Max Wolf's Second Brain
Search
Search
Dark mode
Light mode
Explorer
general
actor critic
advantage
algebra
Bayes Theorem
bernoulli
bias
bias-variance tradeoff
bijective
binomial coefficient
binomial distribution
bootstrapping
borel set
central moment
chain rule of probability
closure
column space
combinatorics
conditional probability
congruence
congruence class
Consciousness as a coherence-inducing operator - Cosciousness is virtual
correlation matrix
coset
covariance
credit assignment
cross-attention
cross-entropy
cross-entropy loss
cumulative distribution function
curiosity
cyclic group
derivatives
determinant
diagonal matrix
DQN
eigendecomposition
eigenspace
eigenvalue
Embracing curiosity eliminates the exploration-exploitation dilemma
epsilon greedy
expected value
factor group
factorization
first order optimization
fisher information
frobenius norm
function
gaussian elimination
graph
High-Dimensional Continuous Control Using Generalized Advantage Estimation
homomorphism
independent
initialization
Introduction to RL
inverse
inverse matrix
isomorph
isotropic
isotropic gaussian
KL-divergence
law of total probability
likelihood
line segment
linear least squares regression
linear systems of equations
log-likelihood
log-sum-exp trick
logarithm
markov chain monte carlo
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
matrix
mean
Mixture of A Million Experts
monte carlo methods
monte carlo tree search
Multi-Agent Advantage Decomposition Theorem
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
multivariate gaussian distribution
negative log-likelihood loss
Neural Ordinary Differential Equations - Paper
neutral element
normal subgroup
note-taking
null space
obsidian markdown features
Open-Endedness is Essential for Artificial Superhuman Intelligence
order
orthogonal
orthogonal complement
Pascal's triangle
permutation
pivot
poisson distribution
policy
policy gradient
policy gradient theorem
policy iteration
potentiation
PPO
preimage
probability
probability density function
probability distribution
probability mass function
pseudo inverse
Q-Learning
Q-value
random variable
random walk
rank
REINFORCE
reinforcement learning
reliability
reproduction
Requirements for self organization
ResNet
robust
roots of unity
row space
sampling
SARSA
score function
second moment
self-attention
sensitivity
set
singular value decomposition
span
SPARTA - Distributed Training with Sparse Parameter Averaging
specificity
standard normal distribution
state-value
stationary distribution
statistics
subgroup
surjective
surprise
symmetric matrix
TD Lambda
temporal difference learning
test
The Bitter Lesson
the second brain
The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games
The Unbearable Slowness of Being
transformer
Transformer Squared - Self-Adaptive LLMs
TRPO
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
TU-Wien ADM Übungen
unitary
value function
value iteration
Home
❯
general
Folder: general
153 items under this folder.
Jan 23, 2025
function
Jan 23, 2025
gaussian elimination
Jan 23, 2025
graph
Jan 23, 2025
homomorphism
Jan 23, 2025
independent
Jan 23, 2025
initialization
Jan 23, 2025
inverse matrix
Jan 23, 2025
inverse
Jan 23, 2025
isomorph
Jan 23, 2025
isotropic gaussian
Jan 23, 2025
isotropic
Jan 23, 2025
law of total probability
Jan 23, 2025
likelihood
Jan 23, 2025
line segment
ccc
Jan 23, 2025
linear least squares regression
Jan 23, 2025
linear systems of equations
Jan 23, 2025
log-likelihood
Jan 23, 2025
log-sum-exp trick
Jan 23, 2025
logarithm
Jan 23, 2025
markov chain monte carlo
Jan 23, 2025
matrix
Jan 23, 2025
mean
Jan 23, 2025
monte carlo methods
Jan 23, 2025
monte carlo tree search
Jan 23, 2025
multivariate gaussian distribution
Jan 23, 2025
negative log-likelihood loss
Jan 23, 2025
neutral element
Jan 23, 2025
normal subgroup
Jan 23, 2025
note-taking
Jan 23, 2025
null space
Jan 23, 2025
obsidian markdown features
Jan 23, 2025
order
Jan 23, 2025
orthogonal complement
Jan 23, 2025
orthogonal
Jan 23, 2025
permutation
Jan 23, 2025
pivot
Jan 23, 2025
poisson distribution
Jan 23, 2025
policy gradient theorem
Jan 23, 2025
policy gradient
Jan 23, 2025
policy iteration
Jan 23, 2025
policy
Jan 23, 2025
potentiation
Jan 23, 2025
preimage
Jan 23, 2025
probability density function
Jan 23, 2025
probability distribution
Jan 23, 2025
probability mass function
Jan 23, 2025
probability
Jan 23, 2025
pseudo inverse
Jan 23, 2025
random variable
Jan 23, 2025
random walk
Jan 23, 2025
rank
Jan 23, 2025
reinforcement learning
Jan 23, 2025
reliability
Jan 23, 2025
reproduction
Jan 23, 2025
robust
Jan 23, 2025
roots of unity
Jan 23, 2025
row space
Jan 23, 2025
sampling
Jan 23, 2025
score function
Jan 23, 2025
second moment
Jan 23, 2025
self-attention
Jan 23, 2025
sensitivity
Jan 23, 2025
set
Jan 23, 2025
singular value decomposition
Jan 23, 2025
span
Jan 23, 2025
specificity
Jan 23, 2025
standard normal distribution
Jan 23, 2025
state-value
Jan 23, 2025
stationary distribution
Jan 23, 2025
statistics
Jan 23, 2025
subgroup
Jan 23, 2025
surjective
Jan 23, 2025
surprise
Jan 23, 2025
symmetric matrix
Jan 23, 2025
temporal difference learning
Jan 23, 2025
test
Jan 23, 2025
the second brain
Jan 23, 2025
transformer
Jan 23, 2025
unitary
Jan 23, 2025
value function
Jan 23, 2025
value iteration
Jan 23, 2025
Bayes Theorem
Jan 23, 2025
Consciousness as a coherence-inducing operator - Cosciousness is virtual
Jan 23, 2025
DQN
Jan 23, 2025
Embracing curiosity eliminates the exploration-exploitation dilemma
Jan 23, 2025
High-Dimensional Continuous Control Using Generalized Advantage Estimation
Jan 23, 2025
Introduction to RL
Jan 23, 2025
KL-divergence
Jan 23, 2025
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Jan 23, 2025
Mixture of A Million Experts
Jan 23, 2025
Multi-Agent Advantage Decomposition Theorem
Jan 23, 2025
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Jan 23, 2025
Neural Ordinary Differential Equations - Paper
Jan 23, 2025
Open-Endedness is Essential for Artificial Superhuman Intelligence
Jan 23, 2025
PPO
Jan 23, 2025
Pascal's triangle
Jan 23, 2025
Q-Learning
Jan 23, 2025
Q-value
Jan 23, 2025
REINFORCE
Jan 23, 2025
Requirements for self organization
Jan 23, 2025
ResNet
Jan 23, 2025
SARSA
Jan 23, 2025
SPARTA - Distributed Training with Sparse Parameter Averaging
Jan 23, 2025
TD Lambda
Jan 23, 2025
TRPO
Jan 23, 2025
TU-Wien ADM Übungen
Jan 23, 2025
The Bitter Lesson
Jan 23, 2025
The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games
Jan 23, 2025
The Unbearable Slowness of Being
Jan 23, 2025
Transformer Squared - Self-Adaptive LLMs
Jan 23, 2025
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Jan 23, 2025
actor critic
Jan 23, 2025
advantage
Jan 23, 2025
algebra
Jan 23, 2025
bernoulli
Jan 23, 2025
bias-variance tradeoff
Jan 23, 2025
bias
Jan 23, 2025
bijective
Jan 23, 2025
binomial coefficient
Jan 23, 2025
binomial distribution
Jan 23, 2025
bootstrapping
Jan 23, 2025
borel set
Jan 23, 2025
central moment
Jan 23, 2025
chain rule of probability
Jan 23, 2025
closure
Jan 23, 2025
column space
Jan 23, 2025
combinatorics
Jan 23, 2025
conditional probability
Jan 23, 2025
congruence class
Jan 23, 2025
congruence
Jan 23, 2025
correlation matrix
Jan 23, 2025
coset
Jan 23, 2025
covariance
Jan 23, 2025
credit assignment
Jan 23, 2025
cross-attention
Jan 23, 2025
cross-entropy loss
Jan 23, 2025
cross-entropy
Jan 23, 2025
cumulative distribution function
Jan 23, 2025
curiosity
Jan 23, 2025
cyclic group
Jan 23, 2025
derivatives
Jan 23, 2025
determinant
Jan 23, 2025
diagonal matrix
Jan 23, 2025
eigendecomposition
Jan 23, 2025
eigenspace
Jan 23, 2025
eigenvalue
Jan 23, 2025
epsilon greedy
Jan 23, 2025
expected value
Jan 23, 2025
factor group
Jan 23, 2025
factorization
Jan 23, 2025
first order optimization
Jan 23, 2025
fisher information
Jan 23, 2025
frobenius norm