How and why I built things.
The retrieval pipeline, embedding strategy, vector DB choice, and the prompt engineering that made it useful.
From data ingestion to model validation — the iterative process of building Sakura.
Containerising a model and deploying it as a pay-per-use endpoint. No Kubernetes required.
Why constraints breed creativity — and how a single file produced a better product.