Probably actually reading/writing

March 5, 2020 — January 15, 2025

Suspiciously similar content

Stuff that I am currently actively reading or otherwise working on. If you are looking at this, and you aren’t me, you may need to consider re-evaluating your hobbies.

1 Refactoring

I need to reclassify the bio computing links; that section has become confusing and there are too many nice ideas there not clearly distinguished.

2 Currently writing

Not all published yet.

So you’ve just joined a union
When is computation statistical, in the sense that we know the dynamics of a population of solutions, even when we cannot do the computations? Not quite sure of the scope of this, so let’s use some examples to flesh it out:
1. Trading equities. We cannot know what trades everyone is making, but we can do a good job of pricing options under no-arbitrage assumptions and the like, even though some of the calculations made by people pricing things in the market will be very complicated, and in fact far more complicated than ours. The no-arbitrage assumptions are not strictly true, but the returns to complexity in finding arbitrage opportunities seem to diminish in compute, or something like that, and in the wash it’s pretty similar.
2. Scaling laws: we cannot know what exact computations an LLM will do, but we can estimate how well it will do them remarkably well under a certain data-parameter-train-compute budget.
3. Algorithmic statistics and pseudorandomness are about the statistical behaviours of certain classes of algorithms, in a broader sense, wherein they become near-indistinguishable from randomness, in certain technical senses
4. …
AI Safety
1. metrics that come apart from their goals
2. domestication of humans
3. Causal agency.
Foundation models and their world models
Community building
1. Collective care
2. Social calendaring
3. Psychological resilience
Reality gap
Continual learning.
Is academic literary studies actually distinct from the security discipline of studying side-channel attacks?
Goodhart coordination
Structural problems are hard let’s do training programs
~~Extraversion~~
Is residual prediction different from adversarial prediction?
Science communication for ML
Human superorganisms
1. Moral orbits.
2. Revisit probability collectives
3. Movement design
4. Returns on hierarchy
5. Effective collectivism
6. Alignment
7. Emancipating my tribe, the cruelty of collectivism (and why I love it anyway)
8. Institutions for angels
9. Institutional alignment
10. Beliefs and rituals of tribes
11. Where to deploy taboo
12. The Great Society will never feel great, merely be better than the alternatives
13. Egregores etc
14. Player versus game
15. ~~Something about the fungibility of hipness and cash~~
16. Monastic traditions
Approximate conditioning
Nested sampling
What even are GFlownets?
Public sphere business models
How to do house stuff (renovation etc)
Power and inscrutability
Strategic ignorance
What is an energy-based model?? tl;dr branding for models that handle likelihoods through a potential function which is not normalised to be a density. I do not think there is anything new about that per se?
Funny-shaped learning
1. Causal attention
2. ~~Graphical ML~~
3. Gradient message passing
4. All inference is already variational inference
Human learner series
1. Which self?
2. Is language symbolic?
3. Our moral wetware
4. Is is ought
5. Morality under uncertainty and computational constraint
6. Superstimuli
7. Clickbait bandits
8. Correlation construction
9. Moral explainability
  1. Burkean conservatism is about unpacking when moral training data is out-of-distribution.
  2. Something about universal grammar and its learnable local approximations, versus universal ethics and its learnable local approximations. Morality by template, computational difficulty of moral identification. Leading by example of necessity.
10. Righting and wronging
11. Akrasia in stochastic processes: What time-integrated happiness should we optimise?
12. ~~Comfort traps~~ ✅ Good enough for now
13. ~~Myths~~ ✅ a few notes is enough
Classification and society series
1. Constructivist rationalism
2. Affirming the consequent and evaporative tribalism
3. Classifications are not very informative
4. Adversarial categorization
5. AUC and collateral damage
6. Bias and base rates
7. Decision theory
8. Decision theory and prejudice
Shouting at each other on the internet series (Teleological liberalism)
1. Modern politics seems to be excellent at reducing the vast spectrum of policy space to two mediocre choices then arguing about which one is worse. What is this tendency called?
2. The Activist and decoupling games, and game-changing
3. On being a good weak learner
4. Lived evidence deductions and/or ad hominem for discussing genetic arguments.
5. Diffusion of responsibility — is this distinct from messenger shooting?
6. Iterative game theory of communication styles
7. Invasive arguments
8. Coalition games
9. ~~All We Need Is Hate~~
10. Speech standards
11. Startup justice warriors/move fast and cancel things
12. ~~Pluralism~~ ✅
Learning in context
1. Interaction effects are what we want
2. Interpolation is what we want
3. Optimal conditioning is what we want
4. Correlation construction is easier than causation learning
Epistemic community design
1. Scientific community
2. Messenger shooting
3. On being a good weak learner
4. Experimental ethics and surveillance
5. Steps to an ecology of mind
6. Epistemic bottlenecks is probably in this series too.
7. Ensemble strategies at the population level. I don’t need to guess right, we need a society in which people in aggregate guess in a calibrated way.
Epistemic bottlenecks and bandwidth problems
1. Information versus learning as a fundamental question of ML. When do we store exemplars on disk? When do we gradient updates? How much compute to spend on compressing?
2. What is special about science? One thing is transmissibility. Can chatGPT do transmission? Or is it 100% tacit? How does explainability relate to transmissibility?
DIY and the feast of fools
Tail risks and epistemic uncertainty
1. Black swan farming
2. Wicked tail risks
3. Planning under uncertainty
Economic dematerialization via
1. Enclosing the intellectual commons
2. Creative economy jobs
Academic publications as Veblen goods
Stein variational gradient descent
Edge of chaos, history of
X is Yer than Z
But what can I do?
1. Starfish problems
2. Ethical consumption
3. Prefigurative politics
Haunting and exchangeability. Connection to interpolation, and individuation, and to legibility, and nonparametrics.
Doing complicated things naively
Conspiracies as simulations
Something about the limits of legible fairness versus metis in common property regimes
The uncanny ally
Elliptical belief propagation
Strategic ignorance
Privilege accountancy
~~Anthropic principles~~ ✅ Good enough
~~You can’t talk about us without us~~ ❌ what did I even mean? something about mottes and baileys?
~~Subculture dynamics~~ ✅ Good enough
~~Opinion dynamics (memetics for beginners)~~ ✅ Good enough
~~Table stakes versus tokenism~~ ✅
~~Iterative game theory under bounded rationality~~ ❌ too general
~~Memetics~~ ❌ (too big, will never finish)
~~Cradlesnatch calculator~~ ✅ Good enough

3 music stuff

4 Misc

Transforming Probability Spaces
Does not CGD find a pursuit basis?

5 Workflow optimization

Shell Integration - Documentation - iTerm2 - macOS Terminal Replacement

6 graphical models

7 “transfer” learning

Bernhard Schölkopf: From statistical to causal learning
Bernhard Schölkopf: Learning Causal Mechanisms (ICLR invited talk)
thuml/Transfer-Learning-Library /Transfer Learning — Transfer Learning Library 0.0.24 documentation “Transfer Learning Library for Domain Adaptation, Task Adaptation, and Domain Generalization”.
thuml/A-Roadmap-for-Transfer-Learning

8 Custom diffusion

9 Commoncog

10 Music skills

11 Internal

12 ICML 2023 workshop

13 Neurips 2022 follow-ups

Arya et al. (2022) — stochastic gradients are more general than deterministic ones because they are defined on discrete vars
Rudner et al. (2022)
Phillips et al. (2022) — diffusions in the spectral domain allow us to handle continuous function valued inputs
Gahungu et al. (2022)
Wu, Maruyama, and Leskovec (2022) LE-PDE is a learnable low-rank approximation method
Holl, Koltun, and Thuerey (2022) — Physics loss via forward simulations, without the need for sensitivity.
Neural density estimation
Metrics for inverse design and inverse inference problems - the former is in fact easier. Or is it? Can we simply attain forward prediction loss?
Noise injection in emulator learning (see refs in Su et al. (2022))