Get desktop application:
View/edit binary Protocol Buffers messages
Complete specification for a single task.
Used in:
Name for this component: this is used in linked features via the "source_component" field.
TransitionSystem to use.
Resources that this component depends on. These are copied to TaskInputs when calling SAFT code.
Feature space configurations.
Neural Network builder specification.
The registered C++ implementation of the dragnn::Component class; e.g. "SyntaxNetComponent".
Number of possible actions from every state.
Specify the name of the lower level component on which it has attention.
Options for the ComponentBuilder. If this is empty, the regular tf.while_loop based builder is assumed.
Default max number of active states for beam training.
Default max number of active states for beam inference.
The trace for a single step of a single Component.
Used in:
A caption/description to describe this step. This should fit in a graphical node rendered to the screen.
An *HTML-language* representation of the current state.
The scores for each potential decision. (The mapping from index to name is managed by the component.)
Set to true once the step is finished. (This allows us to open a step after each transition, without having to know if it will be used.)
The traces for all steps for a single Component.
Used in:
Name of the component; should match the ComponentSpec.
The steps that have been taken by this Component.
Main proto being used to trace parsing.
For each sentence, there is a sequence of state sets storing tracing information.
Specification for a feature channel that is a *fixed* function of the input. NEXT_ID: 10
Used in:
Interpretable name for this feature channel. NN builders might depend on this to determine how to hook different channels up internally.
String describing the FML for this feature channel.
Dimensions of embedding space, or -1 if the feature should not be embedded.
No. of possible values returned.
No. of different feature templates in the channel, i.e. the # of features that will be concatenated but share the embedding for this channel.
Whether the embeddings for this channel should be held constant at their pretrained values, instead of being trained. Pretrained embeddings are required when true.
Predicate map for compacting feature values.
Pointer to a pretrained embedding matrix for this feature set.
Vocab file, containing all vocabulary words one per line.
Describes single embedding "group", e.g., 'words', 'tags'. Each group shares an embedding space.
Used in:
string-valued name of the group, e.g., 'words'.
The feature functions active in this embedding group.
A fixed sparse bag of features in DRAGNN. The id, weight, and description fields are all aligned if present (ie, any of these that are non-empty should have the same # items). If weight is omitted, 1.0 is used. These features as interepreted as multiple firings of a single feature template: e.g., for a single focus word, a bag of ngrams.
Used in:
string-valued description of each *feature value*. (Only used for debugging.)
string-valued name of feature. (Only used for debugging.)
A hyperparameter configuration for a training run. NEXT ID: 22
Used in: ,
Global learning rate initialization point.
Momentum coefficient when using MomentumOptimizer.
Decay rate and base for global learning rate decay. The learning rate is reduced by a factor of |decay_base| every |decay_steps|.
Whether to decay the learning rate in a "staircase" manner. If true, the rate is adjusted exactly once every |decay_steps|. Otherwise, the rate is adjusted in smaller increments on every step, such that the overall rate of decay is still |decay_base| every |decay_steps|.
Random seed to initialize parameters.
Specify the optimizer used in training, the default is MomentumOptimizer.
Whether or not to use a moving average of the weights in inference time.
Rolling average update co-efficient.
The dropout *keep* probability rate used in the model. 1.0 = no dropout.
The dropout *keep* probability rate for recurrent connections. If < 0.0, recurrent connections should use |dropout_rate| instead. 1.0 = no dropout.
Gradient clipping threshold, applied if greater than zero. A value in the range 1-20 seems to work well to prevent large learning rates from causing problems for updates at the start of training.
Parameters for Adam training.
Coefficient for global L2 regularization.
Coefficient for global self normalization regularization. A value of zero turns it off.
Comma separated list of components to which self_norm_alpha should be restricted. If left empty, no filtering will take place. Typically a single component.
A spec for using multiple optimization methods.
Used in:
First optimizer.
Second optimizer.
After this number of steps, switch from first to second.
A feature in DRAGNN thats link a component to another or a component to itself recurrently. If batch_idx or beam_idx are omitted, 0 is used.
Used in:
Index into the {step x batch x beam} activations workspace generated by the previous computation.
Values in the original feature space. This is ignored in TensorFlow.
string-valued name of feature. (Only used for debugging.)
Specification for a feature channel that *links* to component activations. Note that the "vocabulary" of these features is the activations that they are linked to, so it is determined by the other components in the spec.
Used in:
Interpretable name for this feature channel. NN builders might depend on this to determine how to hook different channels up internally.
Feature function specification. Note: these should all be of type LinkedFeatureType.
Embedding dimension, or -1 if the link should not be embedded.
No. of different feature templates in the channel, i.e. the # of features that will be concatenated but share the embedding for this channel.
Component to use for translation, e.g. "tagger"
Translator target, e.g. "token" or "last_action", to translate raw feature values into indices. This must be interpretable by the Component referenced by source_component.
Layer that these features should connect to.
Trace for an entire linked feature channel.
Used in:
Name of the embedding space.
The component that this feature links to.
The string-valued name of the translator function that maps a feature value to a step index.
The name of the layer that we are extracting from the identified step.
Individual features within this group.
Proto to specify a set of DRAGNN components (transition systems) that are trained and evaluated jointly. Each component gets one ComponentSpec. The order of component is important: a component can only link to components that come before (for now.) NEXT ID: 6
Whether to extract debug traces.
The traces for all Components.
Used in:
The Parts here should be more or less compatible with TaskInput.
Used in:
Super generic container for any registered sub-piece of DRAGNN.
Used in:
Name of the registered class.
Parameters to set while initializing this system; these are copied to Parameters in a TaskSpec when calling SAFT code, or via kwargs in TF Python code.
Fixed resources that will be converted into TaskInput's when calling SAFT code.
Used in: ,
Training target to be built into the graph.
Used in:
Name for this target. This should be unique across all targets.
Specify the weights for different components. This should be the same size as the number of components in the spec, or empty (defaults to equal weights). Weights are normalized across the components being trained to sum to one.
Specify whether to train a component using supervised signal or not. This should be the same size as the number of components in the spec, or empty (defaults to all true).
Maximum length of the pipeline to train. E.g. if max_index is 1, then only the first component will be trained via this target.
A vector of hyperparameter configurations to search over.
Grid points to search over.
Training targets to create in the graph builder stage.