Max Wolf's Second Brain
Search
Search
Dark mode
Light mode
Explorer
general
activation space
actor critic
advantage
agency
agent
algebra
astrocyte
attention (general)
attention (ML)
Automating the Search for Artificial Life with Foundation Models
Bayes Theorem
beauty
bernoulli
bias
bias-variance tradeoff
bijective
binomial coefficient
binomial distribution
bootstrapping
borel set
cartesian product
causal attention
cellular automata
central moment
chain rule of probability
closure
cma-es
column space
combinatorics
computational irreducibility
conditional probability
congruence
congruence class
consciousness
Consciousness as a coherence-inducing operator - Cosciousness is virtual
continuous
correlation matrix
coset
covariance
credit assignment
critical state
cross product
cross-attention
cross-entropy
cross-entropy loss
cumulative distribution function
curiosity
Curiosity-driven Exploration by Self-supervised Prediction
cyclic group
definitions
determinant
diagonal matrix
differentiability
dot product
DQN
echo state network
eigendecomposition
eigenspace
eigenvalue
eligibility trace
Embracing curiosity eliminates the exploration-exploitation dilemma
epsilon greedy
equivariance
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
evolutionary optimization
expected value
extended euclidian algorithm
factor group
factorization
first order optimization
fisher information
frobenius norm
function
gaussian elimination
goal
graph
High-Dimensional Continuous Control Using Generalized Advantage Estimation
homomorphism
Increasing Liquid State Machine Performance with Edge-of-Chaos Dynamics Organized by Astrocyte-modulated Plasticity
independent
inductive bias
initialization
intelligence
intrinsic curiosity module
Introduction to RL
invariance
inverse
inverse matrix
isomorph
isotropic
isotropic gaussian
KL-divergence
Large Memory Layers with Product Keys
law of total probability
learning
LeCun Initialization
life
likelihood
line segment
linear least squares regression
linear systems of equations
lipschitz continuity
liquid state machine
log-likelihood
log-sum-exp trick
logarithm
loss
machine
markov chain monte carlo
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
matrix
matrix minor
mean
mean absolute error
mean squared error
meaning of life
memory
message passing
mind
Mixture of A Million Experts
mode connectivity
momentum
monte carlo methods
monte carlo tree search
Motif - Intrinsic Motivation from Artificial Intelligence Feedback
Multi-Agent Advantage Decomposition Theorem
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
multi-head attention
multivariate gaussian distribution
negative log-likelihood loss
neighbourhood
Neural Ordinary Differential Equations - Paper
neuron
neutral element
norm
normal subgroup
note-taking
null space
obsidian markdown features
On the Measure of Intelligence
Open-Endedness is Essential for Artificial Superhuman Intelligence
order
orthogonal
orthogonal complement
Pascal's triangle
permutation
permutation equivariance
permutation invariance
pivot
poisson distribution
policy
policy gradient
policy gradient theorem
policy iteration
potentiation
PPO
preference-based RL
preimage
probability
probability density function
probability distribution
probability mass function
pseudo inverse
Q-Learning
Q-value
random variable
random walk
rank
REINFORCE
reinforcement learning
reliability
reproduction
Requirements for self organization
reservoir computing
Reservoir Computing - A New Paradigm for Neural Networks
ResNet
robust
roots of unity
row space
sampling
SARSA
scale-free
score function
second moment
self-attention
sensitivity
set
simulated annealing
singular value decomposition
songs of life and mind
span
SPARTA - Distributed Training with Sparse Parameter Averaging
specificity
spectral radius
spline
standard normal distribution
state-value
stationary distribution
statistics
Streaming Deep Reinforcement Learning Finally Works
subgroup
surjective
surprise
symmetric
symmetric group
symmetric matrix
TD Lambda
temporal difference learning
test
the big bang
The Bitter Lesson
the second brain
The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games
The Unbearable Slowness of Being
transformer
Transformer Squared - Self-Adaptive LLMs
translation invariance
TRPO
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
TU-Wien ADM Übungen
unitary
value function
value iteration
variance
vector
weight space
Home
❯
general
Folder: general
226 items under this folder.
Apr 16, 2025
translation invariance
Apr 16, 2025
unitary
Apr 16, 2025
value function
Apr 16, 2025
value iteration
Apr 16, 2025
variance
Apr 16, 2025
vector
Apr 16, 2025
weight space
Apr 16, 2025
simulated annealing
Apr 16, 2025
singular value decomposition
Apr 16, 2025
songs of life and mind
Apr 16, 2025
span
Apr 16, 2025
specificity
Apr 16, 2025
spectral radius
Apr 16, 2025
spline
Apr 16, 2025
standard normal distribution
Apr 16, 2025
state-value
Apr 16, 2025
stationary distribution
Apr 16, 2025
statistics
Apr 16, 2025
subgroup
Apr 16, 2025
surjective
Apr 16, 2025
surprise
Apr 16, 2025
symmetric group
Apr 16, 2025
symmetric matrix
Apr 16, 2025
symmetric
Apr 16, 2025
temporal difference learning
Apr 16, 2025
test
Apr 16, 2025
the big bang
Apr 16, 2025
the second brain
Apr 16, 2025
transformer
Apr 16, 2025
preimage
Apr 16, 2025
probability density function
Apr 16, 2025
probability distribution
Apr 16, 2025
probability mass function
Apr 16, 2025
probability
Apr 16, 2025
pseudo inverse
Apr 16, 2025
random variable
Apr 16, 2025
random walk
Apr 16, 2025
rank
Apr 16, 2025
reinforcement learning
Apr 16, 2025
reliability
Apr 16, 2025
reproduction
Apr 16, 2025
reservoir computing
Apr 16, 2025
robust
Apr 16, 2025
roots of unity
Apr 16, 2025
row space
Apr 16, 2025
sampling
Apr 16, 2025
scale-free
Apr 16, 2025
score function
Apr 16, 2025
second moment
Apr 16, 2025
self-attention
Apr 16, 2025
sensitivity
Apr 16, 2025
set
Apr 16, 2025
multivariate gaussian distribution
Apr 16, 2025
negative log-likelihood loss
Apr 16, 2025
neighbourhood
Apr 16, 2025
neuron
Apr 16, 2025
neutral element
Apr 16, 2025
norm
Apr 16, 2025
normal subgroup
Apr 16, 2025
note-taking
Apr 16, 2025
null space
Apr 16, 2025
obsidian markdown features
Apr 16, 2025
order
Apr 16, 2025
orthogonal complement
Apr 16, 2025
orthogonal
Apr 16, 2025
permutation equivariance
Apr 16, 2025
permutation invariance
Apr 16, 2025
permutation
Apr 16, 2025
pivot
Apr 16, 2025
poisson distribution
Apr 16, 2025
policy gradient theorem
Apr 16, 2025
policy gradient
Apr 16, 2025
policy iteration
Apr 16, 2025
policy
Apr 16, 2025
potentiation
Apr 16, 2025
preference-based RL
Apr 16, 2025
likelihood
Apr 16, 2025
line segment
Apr 16, 2025
linear least squares regression
Apr 16, 2025
linear systems of equations
Apr 16, 2025
lipschitz continuity
Apr 16, 2025
liquid state machine
Apr 16, 2025
log-likelihood
Apr 16, 2025
log-sum-exp trick
Apr 16, 2025
logarithm
Apr 16, 2025
loss
Apr 16, 2025
machine
Apr 16, 2025
markov chain monte carlo
Apr 16, 2025
matrix minor
Apr 16, 2025
matrix
Apr 16, 2025
mean absolute error
Apr 16, 2025
mean squared error
Apr 16, 2025
mean
Apr 16, 2025
meaning of life
Apr 16, 2025
memory
Apr 16, 2025
message passing
Apr 16, 2025
mind
Apr 16, 2025
mode connectivity
Apr 16, 2025
momentum
Apr 16, 2025
monte carlo methods
Apr 16, 2025
monte carlo tree search
Apr 16, 2025
multi-head attention
Apr 16, 2025
first order optimization
Apr 16, 2025
fisher information
Apr 16, 2025
frobenius norm
Apr 16, 2025
function
Apr 16, 2025
gaussian elimination
Apr 16, 2025
goal
Apr 16, 2025
graph
Apr 16, 2025
homomorphism
Apr 16, 2025
independent
Apr 16, 2025
inductive bias
Apr 16, 2025
initialization
Apr 16, 2025
intelligence
Apr 16, 2025
intrinsic curiosity module
Apr 16, 2025
invariance
Apr 16, 2025
inverse matrix
Apr 16, 2025
inverse
Apr 16, 2025
isomorph
Apr 16, 2025
isotropic gaussian
Apr 16, 2025
isotropic
Apr 16, 2025
law of total probability
Apr 16, 2025
learning
Apr 16, 2025
life
Apr 16, 2025
cross product
Apr 16, 2025
cross-attention
Apr 16, 2025
cross-entropy loss
Apr 16, 2025
cross-entropy
Apr 16, 2025
cumulative distribution function
Apr 16, 2025
curiosity
Apr 16, 2025
cyclic group
Apr 16, 2025
definitions
Apr 16, 2025
determinant
Apr 16, 2025
diagonal matrix
Apr 16, 2025
differentiability
Apr 16, 2025
dot product
Apr 16, 2025
echo state network
Apr 16, 2025
eigendecomposition
Apr 16, 2025
eigenspace
Apr 16, 2025
eigenvalue
Apr 16, 2025
eligibility trace
Apr 16, 2025
epsilon greedy
Apr 16, 2025
equivariance
Apr 16, 2025
evolutionary optimization
Apr 16, 2025
expected value
Apr 16, 2025
extended euclidian algorithm
Apr 16, 2025
factor group
Apr 16, 2025
factorization
Apr 16, 2025
binomial distribution
Apr 16, 2025
bootstrapping
Apr 16, 2025
borel set
Apr 16, 2025
cartesian product
Apr 16, 2025
causal attention
Apr 16, 2025
cellular automata
Apr 16, 2025
central moment
Apr 16, 2025
chain rule of probability
Apr 16, 2025
closure
Apr 16, 2025
cma-es
Apr 16, 2025
column space
Apr 16, 2025
combinatorics
Apr 16, 2025
computational irreducibility
Apr 16, 2025
conditional probability
Apr 16, 2025
congruence class
Apr 16, 2025
congruence
Apr 16, 2025
consciousness
Apr 16, 2025
continuous
Apr 16, 2025
correlation matrix
Apr 16, 2025
coset
Apr 16, 2025
covariance
Apr 16, 2025
credit assignment
Apr 16, 2025
critical state
Apr 16, 2025
TU-Wien ADM Übungen
Apr 16, 2025
The Bitter Lesson
Apr 16, 2025
The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games
Apr 16, 2025
The Unbearable Slowness of Being
Apr 16, 2025
Transformer Squared - Self-Adaptive LLMs
Apr 16, 2025
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Apr 16, 2025
activation space
Apr 16, 2025
actor critic
Apr 16, 2025
advantage
Apr 16, 2025
agency
Apr 16, 2025
agent
Apr 16, 2025
algebra
Apr 16, 2025
astrocyte
Apr 16, 2025
attention (ML)
Apr 16, 2025
attention (general)
Apr 16, 2025
beauty
Apr 16, 2025
bernoulli
Apr 16, 2025
bias-variance tradeoff
Apr 16, 2025
bias
Apr 16, 2025
bijective
Apr 16, 2025
binomial coefficient
Apr 16, 2025
Motif - Intrinsic Motivation from Artificial Intelligence Feedback
Apr 16, 2025
Multi-Agent Advantage Decomposition Theorem
Apr 16, 2025
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Apr 16, 2025
Neural Ordinary Differential Equations - Paper
Apr 16, 2025
On the Measure of Intelligence
Apr 16, 2025
Open-Endedness is Essential for Artificial Superhuman Intelligence
Apr 16, 2025
PPO
Apr 16, 2025
Pascal's triangle
Apr 16, 2025
Q-Learning
Apr 16, 2025
Q-value
Apr 16, 2025
REINFORCE
Apr 16, 2025
Requirements for self organization
Apr 16, 2025
ResNet
Apr 16, 2025
Reservoir Computing - A New Paradigm for Neural Networks
Apr 16, 2025
SARSA
Apr 16, 2025
SPARTA - Distributed Training with Sparse Parameter Averaging
Apr 16, 2025
Streaming Deep Reinforcement Learning Finally Works
Apr 16, 2025
TD Lambda
Apr 16, 2025
TRPO
Apr 16, 2025
Automating the Search for Artificial Life with Foundation Models
Apr 16, 2025
Bayes Theorem
Apr 16, 2025
Consciousness as a coherence-inducing operator - Cosciousness is virtual
Apr 16, 2025
Curiosity-driven Exploration by Self-supervised Prediction
Apr 16, 2025
DQN
Apr 16, 2025
Embracing curiosity eliminates the exploration-exploitation dilemma
Apr 16, 2025
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Apr 16, 2025
High-Dimensional Continuous Control Using Generalized Advantage Estimation
Apr 16, 2025
Increasing Liquid State Machine Performance with Edge-of-Chaos Dynamics Organized by Astrocyte-modulated Plasticity
Apr 16, 2025
Introduction to RL
Apr 16, 2025
KL-divergence
Apr 16, 2025
Large Memory Layers with Product Keys
Apr 16, 2025
LeCun Initialization
Apr 16, 2025
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Apr 16, 2025
Mixture of A Million Experts