Building AI agents, exploring cognitive science, unraveling bioinformatics

Creative engineering where philosophy meets technology

Latest Posts

Carbon-3B: A 3B DNA Foundation Model That Matches Evo2-7B at 150x the Speed

#genomics #foundation-models #dna-language-models #tokenization #transformers #variant-effect-prediction

Carbon-3B matches Evo2-7B on sequence recovery, variant-effect prediction, and motif-perturbation discrimination while generating DNA over 150 times faster.

May 29, 202625 min read

MIMIC: One 1B Model Across the Central Dogma, and Why Multimodality Beats Scale

#foundation-models #computational-biology #multimodal #splicing #protein-designCited

A 1-billion-parameter model conditioned on RNA chemical-probing reactivity folds a held-out transcript to an F1 of 0.987 against an experimentally guided reference.

May 29, 202636 min read

Structure Without Alignment: How ESM-2 Folds a Single Sequence

#protein-language-models #structure-prediction #esmfold #esm-2 #metagenomics #deep-learningCited

ESMFold predicts atomic-level structure from a single sequence. No multiple sequence alignment, no database search, no Evoformer churning over homologs.

May 29, 202617 min read

Fewer Than 0.1% of Neurons Predict When LLMs Hallucinate

#mechanistic-interpretability #hallucination #llm-internals #neurons #alignmentCited

Fewer than one in a thousand neurons in a large language model can predict whether it's about to hallucinate -- and they encode something unexpected: not factual errors, but a tendency toward compliance over truth.

Feb 25, 202611 min read

AI Safety via Debate: How Adversarial Argumentation Solves RL's Hardest Problem

#ai-safety #reinforcement-learning #scalable-oversight #debate #alignmentCited

Reinforcement learning works when you can check the answer. A chess engine wins or loses. A code-generation model passes or fails the test suite.

Feb 20, 202616 min read

Inverse Graphics as RL Environments: Testing Whether VLMs Can Actually See

#reinforcement-learning #vision-language-models #inverse-graphics #benchmarks #spatial-reasoningCited

Vision-language models can describe a scene in paragraph-length detail and still fail to tell you whether a red cube is in front of or behind a blue cylinder.

Feb 20, 202612 min read

Socrates Was a Terrible Prompt Engineer (That's the Point)

#prompt-engineering #socratic-method #llm-reasoning #ai-philosophy #chain-of-thoughtCited

Six ancient questioning techniques map onto the most effective LLM prompting strategies with uncomfortable precision, and what that reveals about these models is more interesting than the performance gains.

Feb 11, 202620 min read

Making Science Machine-Readable: The Epistemological Challenge of Verifying Knowledge at Scale

#machine-learning #scientific-publishing #ai #knowledge-graphs #epistemologyCited

How do you verify scientific knowledge when there are 2.9 million papers on arXiv alone, with thousands more added every day? A new paper extracts nearly two million claims from 16,087 manuscripts and compares machine evaluation to human peer review, with 81% agreement.

Feb 4, 202619 min read

When AI Writes the Code, Verification Becomes the Job

#formal-verification #ai-code-generation #software-security #devops #llmCited

Over 80% of developers now use AI assistants for code generation, yet at least 62% of AI-generated code contains vulnerabilities. As AI writes code faster than humans can review it, the engineer's primary job shifts from writing code to verifying it through formal methods.

Feb 4, 202611 min read

The Historical Accident That Split Drug Design in Two (And the Contrastive Model That Reunites It)

#drug discovery #contrastive learning #computational biology #virtual screening #protein-ligand interactions #graph neural networks #machine learningCited

Structure-based and ligand-based drug design evolved as separate fields solving the same problem. ConGLUDe, a contrastive geometric learning model, unifies both approaches and outperforms specialist methods on realistic benchmarks without requiring pre-defined binding pockets.

Jan 30, 202612 min read

AlphaGenome: One Model for the Other 98% of Your DNA

#deep-learning #genomics #alphagenome #deepmind #variant-prediction #non-coding-dnaCited

Google DeepMind's AlphaGenome reads 1 million base pairs of DNA and predicts thousands of regulatory functions at single-nucleotide resolution, beating 25 of 26 specialized models.

Jan 29, 202612 min read

How AlphaGenome Tackles Variant Effect Prediction

#genomics #deep learning #variant effect prediction #alphagenome #computational biologyCited

AlphaGenome processes 1 million DNA base pairs to predict variant effects across 7,000+ genomic tracks in one second, outperforming specialized models on 25 of 26 VEP benchmarks.

Jan 29, 20268 min read

How AlphaGenome Models Gene Regulation: 2D Embeddings, Splicing, and the Race to Read Non-Coding DNA

#alphagenome #genomics #deep learning #splicing #computational biology #google deepmind #variant interpretationCited

A technical look at AlphaGenome's architecture, its 2D pairwise embeddings for splicing prediction, and what the model means for clinical variant interpretation.

Jan 29, 202618 min read

EDEN: 28 Billion Parameters for Programming Biology

#foundation models #computational biology #gene therapy #metagenomics #drug discovery #eden #basecamp researchCited

Basecamp Research's EDEN model trains on proprietary environmental metagenomics to design gene-insertion enzymes, antimicrobial peptides, and synthetic microbiomes -- all validated in the wet lab.

Jan 28, 202616 min read

A Bioinformatician's Guide to Choosing Genomic Foundation Models

#foundation models #genomics #bioinformatics #deep learning #protein language models #dna models #esm-2 #dnabert-2 #hyenadna #scgptCited

A practical guide to selecting genomic foundation models for bioinformatics tasks. Covers ESM-2, DNABERT-2, HyenaDNA, Nucleotide Transformer, scGPT, and Evo with specific recommendations for DNA sequence analysis, protein structure prediction, and single-cell analysis based on hardware requirements, inference speed, and task type.

Jan 19, 202622 min read

View All 65 Posts