New posts in neural-networks

Application of the chain rule to $3$-layers neural network

Which are good books for applications of Shannon Information Theory?

Category Theory & Artificial Intelligence (AI)

Why can't the set of algebraic polynomials of degree at most k be dense in $C(\mathbb{R}^n)$

What is meant by "function on the unit cube"?

CS231N Backpropagation gradient

Is there a connection between topological mixing and squashing functions used in neural networks?

Why does the sup norm make the results of approximation theory independent from the unknown distribution of the input data?

How do I solve $\frac{\partial}{\partial w_{jk}} { \sum_j w_{jk} . o_j }$

Why use the kernel trick in an SVM as opposed to just transforming the data?

Scaling factor and weights in Unscented Transform (UKF)

How can I derive the back propagation formula in a more elegant way?

Tricky proof of a result of Michael Nielsen's book "Neural Networks and Deep Learning".

Neural Network matrix calculus

What areas of math can be tackled by artificial intelligence?

Why do deep neural networks work well?

Derivative of a vector with respect to a matrix

How does Variational AutoEncoder (VAE) get mean and variance?

What are the best books to study Neural Networks from a purely mathematical perspective?