Get desktop application:
View/edit binary Protocol Buffers messages
Redap's public API.
--- Catalog --- Mostly a 1:1 proxy at the moment.
Register new partitions with the Dataset
Unimplemented.
Returns the schema of the partition table (i.e. the dataset manifest) itself, *not* the underlying dataset. * To inspect the data of the partition table, use `ScanPartitionTable`. * To retrieve the schema of the underlying dataset, use `GetDatasetSchema` instead.
Inspect the contents of the partition table (i.e. the dataset manifest). The returned data will follow the schema specified by `GetPartitionTableSchema`.
Returns the schema of the dataset. This is the union of all the schemas from all the underlying partitions. It will contain all the indexes, entities and components present in the dataset.
Creates a custom index for a specific column (vector search, full-text search, etc).
List of specific partitions that will be indexed (all if left empty).
Specify behavior when index for a partition was already created.
Recreate an index with the same configuration but (potentially) new data.
Search a previously created index.
Index column that is queried
Query data - type of data is index specific. Caller must ensure to provide the right type. For vector search this should be a vector of appropriate size, for inverted index this should be a string. Query data is represented as a unit (single row) RecordBatch with 1 column.
Index type specific properties
Scan parameters
Perform Rerun-native queries on a dataset, returning the matching chunk IDs. These Rerun-native queries include: * Filtering by specific partition and chunk IDs. * Latest-at, range and dataframe queries. * Arbitrary Lance filters. To fetch the actual chunks themselves, see `GetChunks`. Passing chunk IDs to this method effectively acts as a IF_EXIST filter.
Client can specify what partitions are queried. If left unspecified (empty list), all partitions will be queried.
Client can specify specific chunk ids to include. If left unspecified (empty list), all chunks that match other query parameters will be included.
Which entity paths are we interested in? Leave empty to query all of them.
Generic parameters that will influence the behavior of the Lance scanner.
Perform Rerun-native queries on a dataset, returning the underlying chunks. These Rerun-native queries include: * Filtering by specific partition and chunk IDs. * Latest-at, range and dataframe queries. * Arbitrary Lance filters. To fetch only the actual chunk IDs rather than the chunks themselves, see `QueryDataset`.
Client can specify from which partitions to get chunks. If left unspecified (empty list), data from all partition (that match other query parameters) will be included.
Client can specify chunk ids to include. If left unspecified (empty list), all chunks (that match other query parameters) will be included.
Which entity paths are we interested in? Leave empty to query all of them.
Query details
TODO(jleibs): support ScanParameters iff we can plumb them into Datafusion TableProvider Otherwise, just wait for Arrow Flight rerun.common.v1alpha1.ScanParameters scan_parameters = 2;
--- Tasks service --- Query the status of submitted tasks
Fetch the output of a completed task
Query the status of submitted tasks as soon as they are no longer pending