#rag

6 posts

March 2, 2026 learnings EN

LLM Fundamentals: Parameters, Embeddings, and Attention

How LLM parameters encode meaning, what embedding dimensions actually represent, and why Transformer attention is computed in parallel.

#llm #embeddings #attention #transformer #rag

February 28, 2026 learnings EN

Generative AI Design Patterns Landscape and AI Engineer Career Positioning

A structured map of GenAI application patterns and a practical framework for deciding how deep to go — calibrated to AI Product Engineer vs AI Engineer roles.

#genai-patterns #rag #agent-patterns #career #ai-product-engineer

February 27, 2026 learnings EN

AI Pipeline Patterns from LinguaRAG: RAG, Streaming, and Prompt Architecture

A comprehensive breakdown of AI pipeline concepts learned building LinguaRAG — a Korean-German textbook AI tutor using RAG, SSE streaming, multi-layer prompts, and pgvector.

#rag #llm-streaming #prompt-engineering #pgvector #fastapi

February 27, 2026 learnings EN

PDF RAG Indexing: Unit Detection and Chunk Noise Filtering

How to reliably detect structured unit boundaries in a bilingual PDF and prevent boilerplate text from polluting RAG vector chunks.

#rag #pdf-processing #pdftotext #regex #vector-search

February 27, 2026 learnings EN

PDF Indexing Pipeline: Unit Detection Guards and Copyright Filtering

Hard-won lessons from building a robust PDF chunker for a Korean-German textbook: multiple detection guards, line-level copyright stripping, and RAG behavior verification.

#pdf-parsing #rag #regex #text-extraction #pgvector

February 25, 2026 learnings EN

RAG Architecture Fundamentals — pgvector, FastAPI, SSE Streaming, and Embedding Models

Core RAG concepts understood while planning LinguaRAG: offline/online phase separation, SSE streaming mechanics, prompt assembly, and the role of pgvector.

#rag #fastapi #sse-streaming #pgvector #embedding