FelixKoole.com

March 15, 2026 Model Poisoning: Why Your Self-Hosted Model might be Leaking Data

March 4, 2026 AI Can't Click Buttons: Why Text-Based Programs Are About To Make Their Comeback

Terminal programs like hledger store everything in plain text — making them lightweight, fully yours, and perfect for AI agents. Here's why the command line is making a comeback.

February 8, 2026 5-min Paper: Should you use float or ints as confidence number

When you ask an LLM to rate its confidence, does the format matter? We tested four SOTA models with decimal (0.00–1.00) and integer (0–100) confidence scores across true, dubious, and nonsense labels. Decimal format produced more conservative estimates on ambiguous inputs and dramatically better cross-model agreement. Integer format caused surprising failures — GPT-5.2 alternated between 0 and 100 on obvious nonsense. The culprit? Tokenization. The 0. prefix appears to anchor models into calibrated probability-reasoning mode that integers simply don't activate.

5 minute paper

Thinking about AI Security

Latest Writing