Skip to content

Latest commit

 

History

History
126 lines (96 loc) · 17.8 KB

research_papers.md

File metadata and controls

126 lines (96 loc) · 17.8 KB

LLM: Research Papers


Image Description

Come on OpenAI, release the GPT-4 weights 😇

As Large Language Models (LLM) have exploded in popularity since the unveiling of ChatGPT by OpenAI in October 2022, research and commercial interest over LLMs has led to a tsunami of research contributions. This is an attempt to curate the list of research articles, with a description of the main findings of each paper to the best of my knowledge. This list is non-exhaustive.

Summaries

Transformers

Pre-LLM Transformers

Foundational LLMs

Fine-Tuning Methods (RLHF* = SFT** + PPO***)

*Reinforcement Learning with Human Feedback

**Supervised Fine Tuning

***Proximal Policy Optimization

Quantization of LLMs

Alignment Approaches (Alternatives to RLHF)

Robustness, Harmlessness and Ethics

Theoretical Considerations on LLMs

Applications

Healthcare

Financial Markets

Economics

Artificial General Intelligence