Get desktop application:
View/edit binary Protocol Buffers messages
Used in:
Group norm parameters.
Dropout features.
Dropout rate.
Path to the dataset.
Dataset name.
Save folder.
Checkpoint folder (if different from save folder).
Split name for train.
Split name for validation.
Split name for test.
Split name for few-shot training.
Split name for few-shot validationn.
Split name for few-shot test.
Image resize size.
Image crop size.
Whether perform random crop.
Whether perform random flip.
Whether perform random color.
Whether perform random rotate.
Mean pixel values for each channel.
Std pixel values for each channel.
Base data sampler. Choice of `fewshot`, `incremental`, `constant_prob`, `crp`.
Number of classes
Number of query examples per class.
Maximum shots for incremental sampler.
Maximum number of support images per class.
Whether an episode is supervised or not.
Target label ratio for semisupervised episodes.
Maximum episode length.
Whether to allow repeating images in an episode.
Whether to enable query set in an episode.
Whether to fix the unknown class ID.
Whether making it a hierarchical sampler.
Whether to use dataset class hierarchy.
Number of sequential stages for hierarchical sampler.
Blender for blending multiple episodes.
Blur blender window size.
Blur blender stride size
Number of blur operation runs.
Markov switching process probability.
Whether to mix class hierarchy and non class hierarchy.
Whether to use the same family for different context.
Whether to shuffle temporal ordering.
Whether to use new class hierarchy.
New class parameter for constant prob sampler.
Alpha parameter for CRP sampler.
Theta parameter for CRP sampler.
Number of queries for distractor images.
Number of shots for distractor images.
Number of classes for distractor images.
Add random box occluder.
Add random background. Choice of `none`, `uppsala`, `uppsala_double`.
Whether to randomly drop the background hue.
When random apply is true, use this probability to sample.
When random apply is true, this will randomly choose a background and ignore the stage iD.
Add gaussian noise to the random background.
CNN backbone.
Class name of model.
Data type.
Total number of output classes.
Number of training examples.
Memory class.
Whether or not to freeze backbone.
Whether to set a different learning rate for the backbone.
Set a learning rate multiplier for the backbone.
Whether to have a fixed unknown class ID.
Multiplier for the unknown class loss.
Whether or not to renormalize the logits.
Number of episodes for evaluation.
Whether to feed in stage ID.
Whether perform in stage training / testing.
Whether to perform ROI pooling with mask attention.
Sub memory class.
Sub memory class 2.
Whether to store SSL with a schedule.
Disable binary xent for self-predicted unknowns.
"softmax" or "sigmoid".
4-layer CNN config.
ResNet config.
ProtoNet config.
Memory network config.
LSTM memory module config.
Memory augmented module config.
OML config.
Hybrid model config.
Optimizer config.
Training config.
Used in:
New ablation options:
Used in:
Used in:
Used in:
Maximum number of classes.
Initialization for radius.
Not currently used.
For example based storage, maximum number of items.
Not currently used.
Not currently used.
Not currently used.
Not currently used.
Not currently used.
Similarity function.
Initial value of the beta write.
Choice of "radii" or "max"
For GRU based models.
For GRU based models, initialize forget gate.
Whether to normalize feature prior to read/write.
Going to be coupled with hybrid_config.
Going to be coupled with main config.
ID for unknown.
Used in:
Number of channels for each layer.
Number of output classes.
Inner loop learning rate.
Inner loop gradient truncation.
Inner loop gradient truncation.
Inner loop loss, "softmax" or "sigmoid" or "mix".
How to compute unknown logits, "sum", "max", or "radii".
How to compute unknown logits, "softmax" or "sigmoid".
Select active classes.
Learn initial weight.
Use cosine.
Use bias.
Whether to run semi-supervised.
Used in:
Optimizer name.
Learning rate decay values.
Learning rate decay steps.
Total number of training steps.
Training batch size.
Training number of GPUs.
T-BPTT
Scale learning rate by the number of GPUs.
Used in:
Similarity function, euclidean or cosine.
Whether to use cosine softmax in task A.
Whether to add a learnable scaling factor in task A.
Whether to use cosine softmax in task B.
Whether to add a learnable scaling factor in task B
Whether to use cosine attention mechanism.
Whether to gate the prototypes with learned parameters.
Whether to reinit tau variable for cosine A.
Whether to freeze backbone network weights.
Which scaling module to use.
Static scaling factor on the prototypes.
Used in:
Input image height.
Input image width.
Input image number of channels.
Number of residual units.
Input image number of filters.
Stride for the initial convolution of each resolution stage.
Initial convolution strides.
Whether doing max pooling in the initial convolution.
Number of filters in the initial convolution.
Whether to use bottleneck layer in each residual unit.
Weight decay.
Normalization scheme used in every layer.
Number of groups (used by GroupNorm).
Whether perform global average pooling in the end.
Data format, NCHW or NHWC.
Add leaky relu.
Add the final ReLU to the feature map.
Dropout features.
Dropout rate.
Dropout features.
Dropout rate.
Activation scaling.
Used in:
Number of steps for one validation run.
Number of steps per output printing.
Number of steps per mode checkpointing.
Number of steps to pretrain task A.