MIKIAS ABERA · TORONTO

AI systems where being wrong is expensive

—— Senior software engineer. I build document and data pipelines with verification gates, eval harnesses, and audit trails, and I write about what breaks along the way.

extract.

verify.

publish.

Proofover promises

Citation Verification Rate

100%

333KPapers triaged

28K+Papers extracted

4/5Fabrications caught

0Unverified claims shipped

Numbers straight from my pipeline databases. Every one is checkable.

No model-authored numbers.

Every output earns trust before it ships.

Verifiedby default

PROJECTS

Systems built to be right

—— Each one carries its own verification story.

COMPLIANCE · ACTIVE

/01

PermitCheck

PermitCheckToronto readiness

15 Overbrook PlRD (f15.0; a550) (x5)

Lot coverage30% max · 41.3%

Building height10.0 m max · 9.3 m

Side yard1.8 m req · 1.24 m

Front setbackneeds manual review

limits resolved per address · flagged, not guessed

A Toronto permit-readiness checker that verifies a package against the property's actual zoning, and says what it can't confirm instead of guessing.

+
Resolves each lot's binding zoning limits from the City's official data
+
Flags variances with the governing by-law section cited on every line
+
Precision-first: when it flags, it's right; otherwise it says verify
+
Graded against real decided applications, the City's own determinations as the key
+
Multi-pass extraction and a measured model choice, every change eval-gated

ENACTED · ACTIVE

/02

Enacted

Enacted.Every change to Canadian law

This week · 9 sectors

FEDERAL · SOR/2026-12+96 / −24

Immigration and Refugee Protection Regulations amended

ONTARIO · O. Reg. 176/26+34 / −0

Work with local police services

BC · B.C. Reg. 43/2026+18 / −5

Business Corporations Regulation

Read the exact diffcitations never AI-authored

Every change to Canadian law, in plain English: daily diffs of nearly 9,000 Ontario and federal regulations and statutes, with AI summaries that are never allowed to invent a fact.

+
Tracks nearly 9,000 consolidated laws across three jurisdictions: Ontario (daily against e-Laws), the entire federal corpus (git-diffed against Justice Canada's official XML repo), and British Columbia (per Gazette issue), plus Ontario proclamations and per-law RSS watch feeds
+
Every amendment becomes an exact, computed line-by-line diff between consolidated versions, never AI output
+
AI writes the plain-English summary of the computed diff only, behind a citation gate that rejects any summary introducing facts not present in its input
+
Citations, dates, and version numbers render from official metadata, never from the model
+
Weekly digest by industry sector, with CASL double opt-in and one-click unsubscribe
+
Seeded with 8 weeks of real history on day one: 128 change events, 120 published summaries, 1 caught by the gate

EXTRACTION · ACTIVE

/03

NoeticMap

✦ NoeticmapNDE Research

8,940

Total Experiences

8,062

AI Analyzed

16.4

Avg. Greyson Score

Audio Experiences

Fear of Death · n=7,600

74% reduced42% eliminated

nderf · oberf · adcrf corpora · verbatim-quoted claims

An extraction pipeline that turns consciousness research, 65,074 papers and 8,940 experience accounts, into structured, citable claims.

+
8,940 experience accounts aggregated from NDERF, OBERF, and ADCRF
+
8,062 fully AI-processed with analysis, embeddings, and Greyson scoring
+
65,074 papers discovered, 2,944 relevant, 2,233 fully extracted
+
10,063 key insights pulled from the academic literature
+
Deep extraction runs on local Qwen models, not cloud GPUs

EVIDENCE · ACTIVE

/04

Naulus

NaulusEvidence, mapped

Fitness & Nutrition · The Evidence, Mapped

Claims, mapped to their evidence.

Getting lean isn’t a mystery. It’s a settled question buried under noise.

Every citation verified · nothing publishes unverified

A public evidence map for fitness and nutrition claims, where nothing publishes with an unverified citation.

+
Citations resolved by DOI against Crossref before they back a claim
+
No verified citation, no publication
+
98.2% resolution rate across a 1,000-DOI sample
+
Evidence tiered A to D per study, not per headline
+
Numbers computed in code, never by the model

DOSSIERS · SHIPPED

/05

Xzema

● xzemaTreatments · Methodology

Research map, not medical advice.

Eczema treatments, mapped to their actual evidence

DupilumabSTRONG EVIDENCE

ProbioticsSTRONG EVIDENCE

Elimination dietPOPULAR BUT UNPROVEN

Browse treatments2 axes · never blended

Eczema treatments mapped to their actual evidence: 25,037 papers and 1.1 million community reports, distilled into human-reviewed dossiers.

+
25,037 papers sourced, 13,222 relevant, 9,257 fully extracted
+
1.14M community posts mined for signal
+
Evidence grades computed from study design, not stated labels
+
Sanity-gated rubric caught grade inflation on run one
+
Citations must resolve at build time or they are dropped and logged

PRACTICE · SHIPPED

/06

Blindfold Lab

BLINDFOLD LAB

A real phenomenon

Learn to See
Without Your Eyes

Begin Training

An audio-guided practice platform for blindfolded perception: once a session starts, everything is voice, keyboard, or swipe.

+
Eyes-free by design: all guidance is spoken, all input is keyboard or swipe
+
Three drills shown full-screen: contrast, colors, and shapes
+
12-trial sessions across three difficulty tiers, 5s down to 2s exposure
+
Tracks accuracy and reaction time, suggests level changes over your last 3 sessions
+
Trust-based practice logs, no verification claims

WRITING

What breaks gets written down

—— Build logs, failure reports, and the fundamentals series. One useful essay a week.

Writing

What breaks needs to be written down.

Essays on making AI systems reliable: pipelines, verification, evals, and the failures that taught me.

+Build logs and case studies
+Fundamentals, learned in public
+Failure reports with fixes

Mikias Aberaverification

2026-06-11

Never let the LLM author the numbers

Mikias Aberapipelines

2026-06-11

Extracting structured claims from 28,000 papers on a desk, not a datacenter

Mikias Aberaverification

2026-06-10

My research agent fabricated 4 of 5 citations, so I built a verification gate

FUNDAMENTALS

Learned in public

—— One fundamental per week. Primary sources, a tiny build that breaks on purpose, and the explainer I wish existed.

How do you wrap deterministic checks around a probabilistic system so fabrications cannot ship?

The build: A citation verifier that validates every cited source against an authority API

Read the essay

Which numbers in an AI system should the model never author, and where does the arithmetic actually live?

The build: A deterministic calculation layer the LLM can invoke but never override

Ships on schedule

What does cosine similarity over embedding space actually measure, and which queries does it quietly fail?

The build: Embed one corpus two ways and show where the retrievals disagree

Ships on schedule

Why does chunk size and overlap dominate retrieval quality more than model choice?

The build: Same corpus, three chunking strategies, measured hit rates

Ships on schedule

All 12 weeks →

FAQ

Asked and answered

—— The questions people actually ask about the work, all in one place.

Document and data pipelines with AI in the loop: extraction systems, retrieval over regulated documents, verification gates that stop fabrications from shipping, and the eval harnesses that catch drift before users do.

Because my own pipeline once fabricated 4 of 5 citations and nothing about the output looked wrong. If a property of the output must always hold, something other than the model has to enforce it. That principle shapes everything I ship.

A 12-week public curriculum: each week I take one fundamental behind systems I already run in production, read the primary sources, rebuild it small enough to break on purpose, and publish the explainer I wish existed.

TypeScript and Python, Next.js, Postgres, and whichever model fits the task. The interesting decisions are rarely the model: they are chunking, retrieval, schemas, evals, and where the deterministic checks live.

Yes. Everything ships to the writing section and the email list, one useful essay a week. The projects pages show the verification story behind each system.

How I work

Verified by default, explained in public.

Every system I ship has a verification story: what must always hold, and the deterministic check that enforces it.

28,000+ papers extracted, 333,000 triaged

Every citation machine-verified against OpenAlex + Crossref.

Read the essays

“Four of the five citations were wrong. Not subtly wrong. Fabricated citations look exactly like real ones, and that is the whole problem: the most plausible-looking thing is a well-formed citation, not a true one. So I stopped asking the model to be trustworthy and built a gate instead.”

The verification gate

Week 01 of Production AI Fundamentals

Read it

AI systems where being wrong is expensive

Citation Verification Rate

Every output earns trust before it ships.

Systems built to be right

/01

PermitCheck

/02

Enacted

/03

NoeticMap

/04

Naulus

/05

Xzema

/06

Blindfold Lab

What breaks gets written down

Writing

Never let the LLM author the numbers

Extracting structured claims from 28,000 papers on a desk, not a datacenter

My research agent fabricated 4 of 5 citations, so I built a verification gate

Learned in public

Verification gates

01

Deterministic computation

02

Embeddings and vector search

03

Chunking

04

Asked and answered

Verified by default, explained in public.

Every system I ship has a verification story: what must always hold, and the deterministic check that enforces it.

“Four of the five citations were wrong. Not subtly wrong. Fabricated citations look exactly like real ones, and that is the whole problem: the most plausible-looking thing is a well-formed citation, not a true one. So I stopped asking the model to be trustworthy and built a gate instead.”