Artificial Intelligence, War and Mass SurveillanceArtificial Intelligence is being deployed at scale in warfare. We have seen this in cases such as the use of Claude (Anthropic) in the capture of Vene...Apr 23, 20266 minRead more
VideoFine-tuning Asclepio-8B and Qwen2.5-VL-3B: Medical Reasoning and Screenshot-to-Code with LoRATraining specialized models doesn't require impossible GPUs or weeks of compute. Asclepio-8B learns clinical medical reasoning with 1.3M examples in 6.7 hours, reaching 76.9% accuracy. Qwen2.5-VL-3B converts UI screenshots into functional HTML/CSS with 94.6% accuracy in 5.5 hours. Both trained on L4 24GB using LoRA, demonstrating that specialized fine-tuning with smaller models can outperform giant general-purpose models. This post documents configurations, data pipelines, real metrics, and why fast iteration matters more than raw parameters.fine-tuningLoRAmedical-ai+6Oct 19, 202516 minRead more
RequestScopedPipeline: Concurrent Inference in Diffusers without Race Conditions or Memory DuplicationDiffusers pipelines weren't designed for concurrency: calling pipe() simultaneously causes race conditions in schedulers, 'Already borrowed' errors in Rust tokenizers, or duplicates entire models in memory. My contribution (#12328) introduces RequestScopedPipeline, which solves these issues by creating lightweight per-request views, cloning only small mutable components, and adding automatic locks to tokenizers. Result: a server that handles multiple concurrent users without exploding GPU memory.diffusersaiserver-async+3Sep 30, 20259 minRead more
lambda-gateway: Building a Serverless Host DemoHave you ever wondered how platforms like Vercel and AWS Lambda work under the hood? I built lambda-gateway, a serverless hosting demo using Docker, FastAPI, and Next.js to discover how serverless architectures work through hands-on experimentation.lambdaserverlessdocker+4Invalid Date15 minRead more
My First Blog PostWelcome to my blog! This is my first article where I'm going to share my experiences as a Full-Stack developer. In this space I'll be writing about web development, artificial intelligence...web developmentnext.jsreact+1Sep 28, 20251 minRead more