Doğukan Tuna profile photo

Dogukan Tuna

πŸ“„

Hello, I usually go by the "dt.thinky!" alias, though my name is Doğukan.

Welcome to my website! I'm an AI researcher focused on searching & scaling training recipes toward superhuman generalization & super-efficient learning. Nowadays, I'm busy with my pretraining autoresearch infra, looping LLM orchestrators for hundreds of agents, trying to maximize my net positive impact on shortening the timeline to achieving superhuman capabilities for science. I love to stare at nature 🏞️ & thinking machines πŸ€”

πŸ—“οΈ

I share my daily worklogs here: dthinky.com

πŸ“¬
πŸ“

My notes, experiments and things I find worth sharing:

🧠

Neural Networks & New Kinds!

Compression is how I think about learning. The tighter a model can compress its inputs, the more structure it has actually found. Kolmogorov complexity makes this precise β€” it measures the length of the shortest program that produces a given output, which turns out to be the theoretical floor for any compressor.

The ultimate compressor

K(X) = length of the shortest program that outputs X

For any computable compressor C and all strings X:

K(X) ≀ |C(X)| + K(C) + O(1)

via the simulation argument β€” run C inside a universal machine

The catch

K(X) is uncomputable β€” you can never know the true shortest program.

But a deep network is a finite parallel computer that approximates it with bounded resources.

MAGICAL!

Why neural nets are compressors

Neural nets can simulate arbitrary programs

↓

They are small computers β€” circuits wired by data

↓

SGD searches over the space of programs they can express

Micro-Kolmogorov complexity

Fix an architecture, then fit a network with SGD β€” the bit-length of the resulting weights is a practical proxy for description length:

micro-K(f) β‰ˆ bit-length of weights in a fixed architecture

minf ∈ F [ loss(f) + λ · micro-K(f) ]

Shorter description length β†’ better generalization.

I'm deeply invested in methods that make learning systems compress harder and generalize further.