11. Further reading
For brevity and readability, footnotes were used in this article, rather than
in-text/parenthetical citations. Primary reference papers are listed below, or please
see http://lifearchitect.ai/papers/ for the major foundational papers in the large
language model space. Papers below are shown in order of appearance in this
article.
Datasheets for Datasets
Gebru, T., Morgenstern, J., Vecchione, B., Vaughan, J., Wallach, H., Daumé III, H., &
Crawford, K. (2018). Datasheets for Datasets. https://arxiv.org/abs/1803.09010
GPT-1 paper
Radford, A., & Narasimhan, K. (2018). Improving Language Understanding by
Generative Pre-Training. OpenAI.
https://cdn.openai.com/research-covers/language-unsupervised/language_understan
ding_paper.pdf
GPT-2 paper
Radford, A., Wu, J., Child, R., Luan, D., Amodei, D. & Sutskever, I. (2019). Language
Models are Unsupervised Multitask Learners. OpenAI.
https://cdn.openai.com/better-language-models/language_models_are_unsupervised
_multitask_learners.pdf
GPT-3 paper
Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., & Dhariwal, P. et al. (2020).
OpenAI. Language Models are Few-Shot Learners. https://arxiv.org/abs/2005.14165
The Pile v1 paper
Gao, L., Biderman, S., Black, S., Golding, L., Hoppe, T., & Foster, C. et al. (2021). The
Pile: An 800GB Dataset of Diverse Text for Language Modeling. EleutherAI.
https://arxiv.org/abs/2101.00027
GPT-J announcement
Komatsuzak, A., Wang, B. (2021). GPT-J-6B: 6B JAX-Based Transformer.
https://arankomatsuzaki.wordpress.com/2021/06/04/gpt-j/
GPT-NeoX-20B paper
Black, S., Biderman, S., Hallahan, E. et al. (2022). EleutherAI. GPT-NeoX-20B: An
Open-Source Autoregressive Language Model.
http://eaidata.bmk.sh/data/GPT_NeoX_20B.pdf
RoBERTa paper
Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., & Chen, D. et al. (2019). RoBERTa: A Robustly
Optimized BERT Pretraining Approach. Meta AI. https://arxiv.org/abs/1907.11692
___________________
Alan D. Thompson. 2022. What’s in my AI? A Comprehensive Analysis of Datasets used to Train GPT…
https://LifeArchitect.ai/whats-in-my-ai 23