
Beating BERT? Small LLMs vs Fine-Tuned Encoders for Classification
I ran 32 experiments comparing small LLMs to BERT on classification tasks. Turns out 2018-era BERT is still really good at what it does.

I ran 32 experiments comparing small LLMs to BERT on classification tasks. Turns out 2018-era BERT is still really good at what it does.

An empirical analysis of LLM application patterns that successfully scale in production systems, focusing on extraction, generation, and classification use cases

A technical dive into the limitations of current RAG approaches, examining architectural challenges and exploring pathways to more integrated knowledge-aware LLM architectures.

A (Very) Simple RAG Tutorial