These commits are when the Protocol Buffers files have changed: (only the last 100 relevant commits are shown)
Commit: | 25e0cec | |
---|---|---|
Author: | honggui |
add support acl batch normal,direct conv, local connect, concat layers
The documentation is generated from this commit.
Commit: | 11930f1 | |
---|---|---|
Author: | Jonathan R. Williford |
Clarify batch norm parameter documentation.
Commit: | 929135b | |
---|---|---|
Author: | Evan Shelhamer | |
Committer: | GitHub |
Merge pull request #5210 from ftokarev/patches Obsolete reference to `bool solver` in caffe.proto
Commit: | 3a0b6c6 | |
---|---|---|
Author: | Fyodor Tokarev | |
Committer: | Fyodor Tokarev |
Update a comment in caffe.proto
Commit: | 3ba2054 | |
---|---|---|
Author: | Cyprien Noel | |
Committer: | Cyprien Noel |
Switched multi-GPU to NCCL
Commit: | db66432 | |
---|---|---|
Author: | Zhou Mo |
fix many typos by using codespell
Commit: | 3d62e3c | |
---|---|---|
Author: | Evan Shelhamer | |
Committer: | Evan Shelhamer |
sigmoid cross-entropy loss: normalize loss by different schemes sig-ce loss handles all the same normalizations as the softmax loss; refer to #3296 for more detail. this preserves the default normalization for sig-ce loss: batch size.
Commit: | bdb9457 | |
---|---|---|
Author: | Alican Bozkurt |
add default value for rms_decay
Commit: | 5f2d845 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
Add RecurrentLayer: an abstract superclass for other recurrent layer types
Commit: | c419f85 | |
---|---|---|
Author: | Jonathan L Long | |
Committer: | Jonathan L Long |
add parameter layer for learning any bottom
Commit: | 859cf6e | |
---|---|---|
Author: | Kun Wang |
Fix an error in the example of ReshapeParameter. * this small mistake may confuse newer.
Commit: | 77cde9c | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
Net: setting `propagate_down: true` forces backprop
Commit: | 952fd17 | |
---|---|---|
Author: | max argus | |
Committer: | max argus |
Extend Crop to N-D, changed CropParameter.
Commit: | ca9fa49 | |
---|---|---|
Author: | max argus | |
Committer: | max argus |
Crop: fixes, tests and negative axis indexing.
Commit: | 64e78bd | |
---|---|---|
Author: | Jonathan L Long | |
Committer: | max argus |
add CropLayer: crop blob to another blob's dimensions with offsets configure offset(s) through proto definition.
Commit: | bddd04b | |
---|---|---|
Author: | Evan Shelhamer | |
Committer: | Evan Shelhamer |
deprecate input fields and upgrade automagically
Commit: | 00598ca | |
---|---|---|
Author: | Evan Shelhamer | |
Committer: | Evan Shelhamer |
add InputLayer for Net input Create an input layer to replace oddball Net `input` fields.
Commit: | 8f847fa | |
---|---|---|
Author: | Youssef Kashef | |
Committer: | Youssef Kashef |
tranpose parameter added to IP layer to support tied weights in an autoencoder. Arguments to matrix multiplication function are conditioned on this parameter, no actual transposing takes place. test ip gradient computation with transpose on
Commit: | 0816907 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
Separation and generalization of ChannelwiseAffineLayer into BiasLayer and ScaleLayer. The behavior of ChannelwiseAffineLayer can be reproduced by a ScaleLayer with `scale_param { bias_term: true }`. BiasLayer and ScaleLayer each take 1 or 2 bottoms, with the output having the same shape as the first. The second input -- either another bottom or a learned parameter -- will have its axes (virtually) broadcast and tiled to have the same shape as the first, after which elementwise addition (Bias) or multiplication (Scale) is performed.
Commit: | ec04197 | |
---|---|---|
Author: | Dmytro Mishkin | |
Committer: | Jeff Donahue |
Add ChannelwiseAffine for batch norm
Commit: | a7ac8bc | |
---|---|---|
Author: | Evan Shelhamer |
Merge pull request #3388 from mohomran/exponential_linear_units Exponential Linear Units
Commit: | 3e3e9ce | |
---|---|---|
Author: | Jonathan L Long | |
Committer: | Jonathan L Long |
add short description of dilation to caffe.proto
Commit: | 93bfcb5 | |
---|---|---|
Author: | Fisher Yu | |
Committer: | Jonathan L Long |
add support for 2D dilated convolution
Commit: | a668194 | |
---|---|---|
Author: | Mohamed Omran | |
Committer: | Mohamed Omran |
ELU layer with basic tests
Commit: | 8b2aa70 | |
---|---|---|
Author: | Carl Doersch | |
Committer: | Carl Doersch |
Better normalization options for SoftmaxWithLoss layer.
Commit: | 39f69fb | |
---|---|---|
Author: | Jeff Donahue |
Merge pull request #3229 from cdoersch/batchnorm2 Yet another batch normalization PR
Commit: | a52ee65 | |
---|---|---|
Author: | Carl Doersch | |
Committer: | Carl Doersch |
Cleanup batch norm layer, include global stats computation
Commit: | 0eea815 | |
---|---|---|
Author: | Ronghang Hu | |
Committer: | Ronghang Hu |
Change solver type to string and provide solver registry
Commit: | 321720d | |
---|---|---|
Author: | Evan Shelhamer |
Merge pull request #3160 from shelhamer/cudnnV3 Basic cuDNN v3 support
Commit: | ecac7ff | |
---|---|---|
Author: | Simon Layton | |
Committer: | Evan Shelhamer |
Initial cuDNN v3 support
Commit: | 6c02c8b | |
---|---|---|
Author: | Tim Meinhardt | |
Committer: | Tim Meinhardt |
Add argmax_param axis
Commit: | 9d8206e | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
Im2col and Convolution layers support N spatial axes
Commit: | 4c2ff16 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
caffe.proto: generalize ConvolutionParameter to N spatial axes
Commit: | 251e67a | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
Add TileLayer
Commit: | 80579b8 | |
---|---|---|
Author: | Evan Shelhamer |
Merge pull request #2032 from jeffdonahue/embed-layer Embed layer for lookup table of one hot encodings
Commit: | 4e4c89b | |
---|---|---|
Author: | PatWie | |
Committer: | Ronghang Hu |
Adam solver This commit implements the Adam solver by Kingma et. al for CPU and GPU. All solver parameters are defined in the caffe.proto. This also adds an example for the MNIST dataset.
Commit: | bb0a90e | |
---|---|---|
Author: | Ronghang Hu |
Merge pull request #2903 from ronghanghu/multi_gpu Multi-GPU Data Parallelism
Commit: | 0d34d5b | |
---|---|---|
Author: | Ronghang Hu | |
Committer: | Ronghang Hu |
Data Layers Parallel for Multi-GPU Allow data layers (and also PythonLayer when used as data layer) to be shared among worker solver's training net, and also test net for future-proof if one wants to do Multi-GPU testing. Data layers are locked during forward to ensure sequential forward.
Commit: | 1ce3380 | |
---|---|---|
Author: | Mohamed Omran | |
Committer: | Matthias Plappert |
Implement AdaDelta; add test cases; add mnist examples
Commit: | bcc8f50 | |
---|---|---|
Author: | Cyprien Noel | |
Committer: | Evan Shelhamer |
Add DataReader for parallel training with one DB session - Make sure each solver accesses a different subset of the data - Sequential reading of DB for performance - Prefetch a configurable amount of data to host memory - Distribute data to solvers in round-robin way for determinism
Commit: | abe99e8 | |
---|---|---|
Author: | Eren Golge | |
Committer: | Ronghang Hu |
Implement RMSProp Solver Implement RMSProp solver and cleaned up to adjust to new solver interface that uses accumulated gradients and refactored regularization.
Commit: | 4d299c3 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
Add EmbedLayer for inner products with sparse input (one-hot vectors), with unit tests
Commit: | 4227828 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
temporarily switch the snapshot_format default back to BINARYPROTO out of anticipation for user issues due to issue #2885, which causes Caffe to crash when it attempts to snapshot nets with duplicate layer names
Commit: | ada055b | |
---|---|---|
Author: | Eric Tzeng | |
Committer: | Eric Tzeng |
Snapshot model weights/solver state to HDF5 files. Summary of changes: - HDF5 helper functions were moved into a separate file util/hdf5.cpp - hdf5_save_nd_dataset now saves n-d blobs, can save diffs instead of data - Minor fix for memory leak in HDF5 functions (delete instead of delete[]) - Extra methods have been added to both Net/Solver enabling snapshotting and restoring from HDF5 files - snapshot_format was added to SolverParameters, with possible values HDF5 or BINARYPROTO (default HDF5) - kMaxBlobAxes was reduced to 32 to match the limitations of HDF5
Commit: | f973819 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Eric Tzeng |
add double_data, double_diff to BlobProto for weights/snapshots saved when using Dtype == double
Commit: | a756cfe | |
---|---|---|
Author: | Takuya Narihira | |
Committer: | Evan Shelhamer |
PythonLayer takes parameters by string
Commit: | e7b2b4e | |
---|---|---|
Author: | philkr |
ImageData layer default batch size of 1, and check for zero batch size
Commit: | 823d055 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
Add ReductionLayer to reduce any number of "tail" axes to a scalar value Currently implements operations SUM, MEAN, ASUM (sum of absolute values), and SUMSQ (sum of squares)
Commit: | eb442b9 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
FlattenLayer gets a FlattenParameter with an axis, end_axis
Commit: | 8c72fe3 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
Add LogLayer
Commit: | aeef453 | |
---|---|---|
Author: | Evan Shelhamer |
Merge pull request #1977 from shelhamer/accum-grad Decouple the computational batch size and minibatch size by accumulating gradients
Commit: | 8b05a02 | |
---|---|---|
Author: | Jeff Donahue |
Merge pull request #2410 from sguada/datum_transform Datum transform
Commit: | 41cf06c | |
---|---|---|
Author: | Jonathan L Long | |
Committer: | Evan Shelhamer |
zero-init param diffs and accumulate gradients (With layers whose backward accumulates gradients), this effectively decouples the computational batch from the SGD minibatch. Each iteration accumulates gradients over iter_size batches, then parameters are updated.
Commit: | c255709 | |
---|---|---|
Author: | Evan Shelhamer |
Merge pull request #1946 from nickcarlevaris/msra_init Add MSRAFiller, an Xavier-like filler designed for use with ReLUs
Commit: | 65af68d | |
---|---|---|
Author: | Nick Carlevaris-Bianco | |
Committer: | Evan Shelhamer |
Added MSRAFiller, an Xavier-like filler designed for use with ReLUs ...instead of tanh. Based on paper: He et al, "Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification," 2015. - add VarianceNorm option to FillerParameters which allows one to normalize by fan_in, fan_out or their average. - update XavierFiller to use the VarianceNorm option (default behavior unchanged). - add tests for MSRAFiller and XavierFiller.
Commit: | dbd8319 | |
---|---|---|
Author: | Jonathan L Long |
clean up redundant message comments
Commit: | 352aef4 | |
---|---|---|
Author: | Jeff Donahue |
Merge pull request #2466 from ducha-aiki/mvn-less Remove unnecessary variance computation from backward in MVN layer
Commit: | e8d93cb | |
---|---|---|
Author: | Jeff Donahue |
Merge pull request #2095 from mtamburrano/skip_propagate_down_param Added param skip_propagate_down to LayerParameter
Commit: | b866d14 | |
---|---|---|
Author: | Dmytro Mishkin |
Remove unnecessary variance computation from backward in MVN layer
Commit: | c7c4c64 | |
---|---|---|
Author: | manuele |
Added "propagate_down" param to LayerParameter
Commit: | 21032b2 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
Add ReshapeParameter axis and num_axes to reshape only a particular span of the input shape
Commit: | fa6169e | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
ReshapeLayer fixups for ND blobs
Commit: | 4fb3c9e | |
---|---|---|
Author: | Simon Safar | |
Committer: | Jeff Donahue |
Added a Reshape layer for copying-free modification of blob dimensions.
Commit: | 35a5df5 | |
---|---|---|
Author: | Jeff Donahue |
Merge pull request #2177 from pgao/spp_layer Spatial Pyramid Pooling Layer
Commit: | 438cf0e | |
---|---|---|
Author: | PETER_GAO | |
Committer: | PETER_GAO |
Spatial Pyramid Pooling Layer
Commit: | ca673fd | |
---|---|---|
Author: | Nick Carlevaris-Bianco |
Added support for original implementation, using (margin - d^2), through the legacy_version parameter.
Commit: | b963008 | |
---|---|---|
Author: | Sergio Guadarrama | |
Committer: | Sergio Guadarrama |
Allow Transform of encoded datum. Allow initialize transformed_blob from datum or transform params. Allow force_color and force_gray as transform params.
Commit: | 6fe2b04 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
HDF5DataLayer shuffle: minor cleanup; clarification in HDF5DataParameter
Commit: | 249aba4 | |
---|---|---|
Author: | wieschol | |
Committer: | Jeff Donahue |
shuffle data
Commit: | bb5bf43 | |
---|---|---|
Author: | Takuya Narihira | |
Committer: | Takuya Narihira |
PReLU Layer and its tests described in Kaiming He et al, "Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification", arxiv 2015. Belows are commit message histories that I had while developing. PReLULayer takes FillerParameter for init PReLU testing consistency with ReLU Fix : PReLU test concistency check PReLU tests in-place computation, and it failed in GPU Fix: PReLU in-place backward in GPU PReLULayer called an incorrect API for copying data (caffe_gpu_memcpy). First argment of `caffe_gpu_memcpy` should be size of memory region in byte. I modified to use `caffe_copy` function. Fix: style errors Fix: number of axes of input blob must be >= 2 Use 1D blob, zero-D blob. Rename: hw -> dim
Commit: | 6ea7a66 | |
---|---|---|
Author: | max argus | |
Committer: | Jeff Donahue |
AccuracyLayer: add ignore_label param
Commit: | 7a40f74 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
Fixup AccuracyLayer like SoftmaxLossLayer in #1970 -- fixes #2063
Commit: | 7462c84 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
DummyDataLayer outputs blobs of arbitrary shape
Commit: | 8afdcd0 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
ConcatLayer: generalized Blob axes
Commit: | b868916 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
SliceLayer: generalized Blob axes
Commit: | abec302 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
SoftmaxLayer: generalized Blob axes
Commit: | 5407f82 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
Add BlobShape message; use for Net input shapes
Commit: | 29581e6 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
InnerProductLayer can multiply along any axis
Commit: | 1434e87 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
Blobs are ND arrays (for N not necessarily equals 4). vector<int> shape_ instead of (num, channels, height, width).
Commit: | 9114424 | |
---|---|---|
Author: | Evan Shelhamer |
Merge pull request #1910 from philkr/encoded add force_encoded_color flag to the data layer and warn about mixed encoding
Commit: | a2f7f47 | |
---|---|---|
Author: | philkr | |
Committer: | philkr |
added a force_encoded_color flag to the data layer. Printing a warning if images of different channel dimensions are encoded together
Commit: | 6eb0931 | |
---|---|---|
Author: | Evan Shelhamer | |
Committer: | Evan Shelhamer |
give phase to Net and Layer Give the responsibility for phase to Net and Layer, making phase an immutable choice at instantiation and dropping it from the Caffe singleton.
Commit: | d94f107 | |
---|---|---|
Author: | Jonathan L Long | |
Committer: | Jonathan L Long |
[pycaffe] allow Layer to be extended from Python This is done by adding PythonLayer as a boost::python HeldType.
Commit: | f38ddef | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
Add gradient clipping -- limit L2 norm of parameter gradients
Commit: | 6a22697 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
fix for layer-type-str: loss_param and DECONVOLUTION type should have been included in V1LayerParameter, get upgraded
Commit: | 62d1d3a | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
get rid of NetParameterPrettyPrint as layer is now after inputs (whoohoo)
Commit: | 2e6a82c | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
automagic upgrade for v1->v2
Commit: | 11a4c16 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
start layer parameter field IDs at 100 (always want them printed at the end, and want to allow more fields to be added in the future, so reserve fields 10-99 for that purpose)
Commit: | af37eac | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
'layers' -> 'layer'
Commit: | bb5ba1b | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
restore upgrade_proto
Commit: | 78b02e5 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
add message ParamSpec to replace param name, blobs_lr, weight_decay, ...
Commit: | 3b13846 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
Layer type is a string
Commit: | 9767b99 | |
---|---|---|
Author: | Evan Shelhamer |
Merge pull request #1615 from longjon/deconv-layer Add deconvolution layer with refactoring of convolution layer to share code
Commit: | cff3007 | |
---|---|---|
Author: | Evan Shelhamer |
Merge pull request #1654 from longjon/softmax-missing-values Add missing value support to SoftmaxLossLayer
Commit: | 3519d05 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
debug_info in NetParameter so it can be enabled outside training
Commit: | 1304173 | |
---|---|---|
Author: | Jeff Donahue | |
Committer: | Jeff Donahue |
Make comments for sparse GaussianFiller match actual behavior (Fixes #1497 reported by @denizyuret)
Commit: | 3617352 | |
---|---|---|
Author: | Jonathan L Long | |
Committer: | Jonathan L Long |
add DeconvolutionLayer, using BaseConvolutionLayer
Commit: | 34321e4 | |
---|---|---|
Author: | Jonathan L Long | |
Committer: | Jonathan L Long |
add spatial normalization option to SoftmaxLossLayer With missing values (and batches of varying spatial dimension), normalizing each batch across instances can inappropriately give different instances different weights, so we give the option of simply normalizing by the batch size instead.
Commit: | 5843b52 | |
---|---|---|
Author: | Jonathan L Long | |
Committer: | Jonathan L Long |
add missing value support to SoftmaxLossLayer
Commit: | 18749f8 | |
---|---|---|
Author: | Sergio | |
Committer: | Sergio |
Added Multistep, Poly and Sigmoid learning rate decay policies Conflicts: include/caffe/solver.hpp src/caffe/proto/caffe.proto src/caffe/solver.cpp