New posts in neural-network

scale the loss value according to "badness" in caffe

In which cases is the cross-entropy preferred over the mean squared error? [closed]

Keras LSTM: predict multiple sequences from single input array

Can neural networks approximate any function given enough hidden neurons?

gensim word2vec: Find number of words in vocabulary

Why does prediction needs batch size in Keras?

What is a projection layer in the context of neural networks?

Why do we need to explicitly call zero_grad()? [duplicate]

Octave : logistic regression : difference between fmincg and fminunc

PyTorch / Gensim - How to load pre-trained word embeddings

DCGAN debugging. Getting just garbage

Tensorflow: How to replace or modify gradient?

Python scikit learn MLPClassifier "hidden_layer_sizes"

What is the number of filter in CNN?

Meaning of an Epoch in Neural Networks Training

Does Caffe need data to be shuffled?

How useful is Turing completeness? are neural nets turing complete?

caffe data layer example step by step

How to log Keras loss output to a file

How does one debug NaN values in TensorFlow?