Max Wolf's Second Brain

      • activation space
      • actor critic
      • advantage
      • agency
      • agent
      • aleatoric uncertainty
      • algebra
      • Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization
      • artificial neural network
      • astrocyte
      • attention (general)
      • attention (ML)
      • Automating the Search for Artificial Life with Foundation Models
      • batch normalization
      • Bayes Theorem
      • bayesian neural network
      • beauty
      • bernoulli
      • bias
      • bias-variance tradeoff
      • bijective
      • binomial coefficient
      • binomial distribution
      • bootstrapping
      • borel set
      • bourgeois
      • bourgeois democracy
      • cartesian product
      • categories
      • causal attention
      • cellular automata
      • central moment
      • chain rule of probability
      • closure
      • cma-es
      • column space
      • combinatorics
      • computational irreducibility
      • conditional probability
      • congruence
      • congruence class
      • consciousness
      • Consciousness as a coherence-inducing operator - Cosciousness is virtual
      • continuous
      • correlation matrix
      • coset
      • cosine similarity
      • covariance
      • credit assignment
      • critical state
      • cross product
      • cross-attention
      • cross-entropy
      • cross-entropy loss
      • cumulative distribution function
      • curiosity
      • Curiosity-driven Exploration by Self-supervised Prediction
      • curse of dimensionality
      • cycle-consistency
      • cyclic group
      • determinant
      • diagonal matrix
      • differentiability
      • Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model
      • dot product
      • DQN
      • Dream To Control - Learning Behaviours By Latent Imagination
      • DropConnect
      • dropout
      • echo state network
      • eigendecomposition
      • eigenspace
      • eigenvalue
      • eligibility trace
      • Embracing curiosity eliminates the exploration-exploitation dilemma
      • Encoding innate ability through a genomic bottleneck
      • EngramNCA - A Neural Cellular Automaton Model of Memory Transfer
      • ensembles
      • epistemic
      • epistemic uncertainty
      • epsilon greedy
      • equivariance
      • ES-HyperNEAT
      • Evolution Strategies as a Scalable Alternative to Reinforcement Learning
      • evolutionary optimization
      • expected value
      • exponential distribution
      • extended euclidian algorithm
      • factor group
      • factorial
      • factorization
      • first order optimization
      • fisher information
      • frobenius norm
      • function
      • gaussian elimination
      • general intelligence
      • General intelligence requires rethinking exploration
      • generalization
      • Generative Adverserial Nets
      • goal
      • graph
      • group normalization
      • High-Dimensional Continuous Control Using Generalized Advantage Estimation
      • homomorphism
      • HyperNEAT
      • Increasing Liquid State Machine Performance with Edge-of-Chaos Dynamics Organized by Astrocyte-modulated Plasticity
      • independent
      • indirect encoding
      • inductive bias
      • initialization
      • intelligence
      • intrinsic curiosity module
      • Introduction to RL
      • invariance
      • inverse
      • inverse matrix
      • isomorph
      • isotropic
      • isotropic gaussian
      • KL-divergence
      • Large Memory Layers with Product Keys
      • law of total probability
      • layer normalization
      • learning
      • LeCun Initialization
      • life
      • likelihood
      • line segment
      • linear least squares regression
      • linear systems of equations
      • lipschitz continuity
      • liquid state machine
      • log-likelihood
      • log-sum-exp trick
      • logarithm
      • loss
      • machine
      • machine learning
      • markov chain monte carlo
      • Mastering Atari With Discrete World Models
      • Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
      • Mastering Diverse Domains through World Models
      • matrix
      • matrix minor
      • mean
      • mean absolute error
      • mean squared error
      • meaning of life
      • memory
      • message passing
      • mind
      • Mixture of A Million Experts
      • mode connectivity
      • momentum
      • monte carlo dropout
      • monte carlo methods
      • monte carlo tree search
      • Motif - Intrinsic Motivation from Artificial Intelligence Feedback
      • movement
      • Multi-Agent Advantage Decomposition Theorem
      • Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
      • multi-head attention
      • multivariate gaussian distribution
      • NEAT
      • negative log-likelihood loss
      • neighbourhood
      • Neural Ordinary Differential Equations - Paper
      • neuron
      • neutral element
      • norm
      • normal subgroup
      • note-taking
      • novelty
      • null space
      • obsidian markdown features
      • On the Measure of Intelligence
      • open-ended
      • Open-Endedness is Essential for Artificial Superhuman Intelligence
      • order
      • orthogonal
      • orthogonal complement
      • Pascal's triangle
      • permutation
      • permutation equivariance
      • permutation invariance
      • pivot
      • poisson distribution
      • policy
      • policy gradient
      • policy gradient theorem
      • policy iteration
      • potentiation
      • PPO
      • preference-based RL
      • preimage
      • probability
      • probability density function
      • probability distribution
      • probability mass function
      • pseudo inverse
      • Q-Learning
      • Q-value
      • quality diversity
      • random variable
      • random walk
      • rank
      • REINFORCE
      • reinforcement learning
      • reinforcement learning from verifyable rewards
      • reliability
      • reproduction
      • Requirements for self organization
      • reservoir computing
      • Reservoir Computing - A New Paradigm for Neural Networks
      • ResNet
      • Resynthesizing behavior through phylogenetic refinement
      • robust
      • roots of unity
      • row space
      • sampling
      • SARSA
      • scale-free
      • scaled dot product attention
      • score function
      • second moment
      • self-attention
      • sensitivity
      • set
      • simulated annealing
      • singular value decomposition
      • softmax
      • songs of life and mind
      • span
      • SPARTA - Distributed Training with Sparse Parameter Averaging
      • specificity
      • spectral radius
      • spline
      • standard normal distribution
      • StarGANv2-VC A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
      • state-value
      • stationary distribution
      • statistics
      • Streaming Deep Reinforcement Learning Finally Works
      • subgroup
      • supervised learning
      • surjective
      • surprise
      • Sutton & Barto RL Book Notes
      • symmetric
      • symmetric group
      • symmetric matrix
      • TD Lambda
      • temporal difference learning
      • test
      • the big bang
      • The Bitter Lesson
      • the second brain
      • The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games
      • The Unbearable Slowness of Being
      • transformer
      • Transformer Squared - Self-Adaptive LLMs
      • translation invariance
      • TRPO
      • Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
      • truth
      • TU-Wien ADM Übungen
      • unitary
      • Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks
      • upper confidence bound
      • value function
      • value iteration
      • variance
      • vector
      • Wasserstein GAN
      • weight space
    Home

    ❯

    general

    ❯

    ensembles

    ensembles


    Graph View

    Backlinks

    • SPARTA - Distributed Training with Sparse Parameter Averaging
    • epistemic uncertainty

    Created with Quartz v4.4.0 © 2025

    • GitHub
    • Discord Community