notes: large language models

Notes on various LLMs and the techniques used to make them.

LLMs are better thought of as "calculators for words" - retrieval of facts is a by-product of how they are trained, but it's not their core competence at all.

Simon Willison on HN, which he later expanded on

LLM Families

Source

A more comprehensive list of models:
2023 LifeArchitect.ai data (shared)

Comparisons

hyperopt:
Does anyone know of any good test suites we can use to benchmark these local models? [...]

aiappreciator:
The simplest and quickest benchmark is to do a rap battle between GPT-4 and the local models. [...]

It is instantly clear how strong the model is relative to GPT-4.

this HN thread

Interesting Articles

We Have No Moat, And Neither Does OpenAI, a leaked internal Google document about the success of open source models and how to change their approach against OpenAI

Large Language Models

LLM Families

Comparisons

Interesting Articles

Links