Proto commits in google/yggdrasil-decision-forests

These commits are when the Protocol Buffers files have changed: (only the last 100 relevant commits are shown)

Commit:1d120b9
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Add total_max_num_nodes_hyperparameter to C++ PiperOrigin-RevId: 756751520

The documentation is generated from this commit.

Commit:2e669f4
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Expose the sum of abs values of the shape values as a measure of variable importance. PiperOrigin-RevId: 756641315

Commit:6b84b4a
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Allow custom fields in model metadata PiperOrigin-RevId: 755822672

Commit:b9e968c
Author:Richard Stotz
Committer:Copybara-Service

Optimize handling of indicator labels in NDCG PiperOrigin-RevId: 745955921

Commit:4ddbf46
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Clarify NDCG implementation Fix typos, add comments, bring notation closer to the paper. More meaningful tests that no longer rely on outcome of random values. PiperOrigin-RevId: 745609685

Commit:f71ff3d
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Configure the features and feature orders in the model & prediction analysis. PiperOrigin-RevId: 742296003

Commit:d41c040
Author:Richard Stotz
Committer:Copybara-Service

[YDF] [TF-DF] Fix handling of byte values in CategoricalSpec PiperOrigin-RevId: 733329637

Commit:03f69f4
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Replace a directory move operation (executed by the manager) by multiple file move operations (executed by workers) for distributed GBT training. PiperOrigin-RevId: 730769512

Commit:c94bc47
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Save / Load deep models PiperOrigin-RevId: 728685775

Commit:83220b5
Author:Mathieu Guillame-Bert
Committer:Richard Stotz

Remove CHECK / FATAL from training code. PiperOrigin-RevId: 723457491

Commit:a845b13
Author:Mathieu Guillame-Bert
Committer:Richard Stotz

Fix OSS build. PiperOrigin-RevId: 720170531

Commit:750504d
Author:Mathieu Guillame-Bert
Committer:Richard Stotz

mplement Anchor forests (Part 2) PiperOrigin-RevId: 715752402

Commit:e539028
Author:Richard Stotz
Committer:Richard Stotz

[YDF] Add PDPs for deep models PiperOrigin-RevId: 725207520

Commit:b98a119
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Add PDPs for deep models PiperOrigin-RevId: 725207520

Commit:a7fde2a
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Remove CHECK / FATAL from training code. PiperOrigin-RevId: 723457491

Commit:b89f193
Author:Mathieu Guillame-Bert
Committer:Richard Stotz

Fix OSS build. PiperOrigin-RevId: 720170531

Commit:e17668b
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

mplement Anchor forests (Part 2) PiperOrigin-RevId: 715752402

Commit:ca346cb
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Add sparse_oblique_max_num_projections hyperparameter PiperOrigin-RevId: 701891284

Commit:fd34b13
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Oblique: Add integer weights PiperOrigin-RevId: 701890429

Commit:bd66459
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Oblique: Add power of two weights PiperOrigin-RevId: 701863564

Commit:e7c0eb7
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Deprecate SparseObliqueSplit.binary_weights hyperparameter In favor of `SparseObliqueSplit.weights`. PiperOrigin-RevId: 701854322

Commit:82dc873
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Add standalone evaluation PiperOrigin-RevId: 700701433

Commit:a4cccb6
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Fix regression / ranking CEP PiperOrigin-RevId: 700678738

Commit:075e567
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Feature selection (part 2) PiperOrigin-RevId: 699129506

Commit:2e16cb8
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Implement Anchor forests (slow condition eval; p1) PiperOrigin-RevId: 698739620

Commit:52c6ae0
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Add support for sequence vector features PiperOrigin-RevId: 688968619

Commit:e315892
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Remove TF from YDF build process. PiperOrigin-RevId: 688884179

Commit:1c59e9d
Author:Mathieu Guillame-Bert
Committer:Richard Stotz

Read avro files without dependencies (part 7; last one!) PiperOrigin-RevId: 684390907

Commit:9ae9347
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Read avro files without dependencies (part 7; last one!) PiperOrigin-RevId: 684390907

Commit:a95ab83
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Enable distributed training for Ranking GBT. PiperOrigin-RevId: 676466613

Commit:a7c8db2
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Store loss options in the model PiperOrigin-RevId: 676427940

Commit:7dafafd
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Move loss options definition Old: learner/gradient_boosted_trees/gradient_boosted_trees.proto New: model/gradient_boosted_trees/gradient_boosted_trees.proto PiperOrigin-RevId: 676396458

Commit:0d66aba
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Fix Hyperparameter definition of NDCG truncation This change adds a few fixes to the NDCG truncation - Add a Python test that the learner correctly truncates - Simplify the proto by re-using the existing LambdMart options proto - Set the different ranking options as mutually exclusive hyperparameters - Fix the definition of the truncation hyperparameters as integers PiperOrigin-RevId: 676387366

Commit:ec48a7b
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Add parameter to control the maximum duration of the model analysis. Default to 10 seconds. PiperOrigin-RevId: 675552053

Commit:8b04210
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Allow configuring the truncation of NDCG losses PiperOrigin-RevId: 675173198

Commit:d5630e6
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Rename LAMBDA_MART_NDCG5 to LAMBDA_MART_NDCG We want to make the truncation parameter configurable. Renaming the loss is the first step. PiperOrigin-RevId: 675162089

Commit:2c77d15
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Multi-threaded model prediction and evaluation PiperOrigin-RevId: 675092823

Commit:0b87800
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Add support for compressed tfrecords without tf dependencies. PiperOrigin-RevId: 673388515

Commit:b080eab
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Allow prediction and evaluation with slow engine in Python PiperOrigin-RevId: 671272015

Commit:579c3b8
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Clean up PYDF learner parameters PYDF exposes some parameters on every learner that are only supported on some of them. This change cleans up the lists of hyperparameters. This change also fixes some of the documentation of the learners. PiperOrigin-RevId: 668420630

Commit:00f1c20
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Add anomaly detection quality tests PiperOrigin-RevId: 665378493

Commit:cb01a4d
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Detect user-supplied hyperparameters, Also fixes hyperparameter handling in Isolation Forests PiperOrigin-RevId: 655910265

Commit:4a01e16
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Large amount of clean-up and refactors in preparation to layer-wise learning PiperOrigin-RevId: 653984280

Commit:86f7edf
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Anomaly detection; Enable model and prediction analysis of isolation forest models (part 6) PiperOrigin-RevId: 644041112

Commit:887b122
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Anomaly detection; Isolation forest c++ learner (part 4) PiperOrigin-RevId: 643984861

Commit:933e38d
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Anomaly detection; Isolation forest c++ model (part 3) PiperOrigin-RevId: 643984185

Commit:f319589
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Anomaly detection; ccleaner + decision tree utilities (part 2) PiperOrigin-RevId: 642919627

Commit:33c15ad
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Anomaly detection; Create the ANOMALY_DETECTION task (part 1) PiperOrigin-RevId: 642918719

Commit:2ec3887
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Internal representation of a forest before beeing converted to Jax code PiperOrigin-RevId: 627328319

Commit:d6c9ce3
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Store the training dataset feature dtypes and use it when exporting a model to TensorFlow. PiperOrigin-RevId: 624114118

Commit:68053b9
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Expose max_num_projections hyperparameter Also improve documentation of hyperparameters for oblique splits, in particular fixing the incorrect documentation for the density parameter PiperOrigin-RevId: 615017123

Commit:f1ea1e5
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Add support for model serialization / unserialization to std::string in the C++ API. PiperOrigin-RevId: 610774439

Commit:50a3f02
Author:Richard Stotz
Committer:Copybara-Service

Prepare release of TF-DF 1.9.0rc0 and YDF 1.9.0rc0 PiperOrigin-RevId: 610665881

Commit:dfe9124
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

No change PiperOrigin-RevId: 602751469

Commit:e69b925
Author:Richard Stotz
Committer:Copybara-Service

[PYDF] Fix TF-DF model loading when using categorical columns PiperOrigin-RevId: 599743316

Commit:65316b7
Author:Richard Stotz
Committer:Copybara-Service

[PYDF] Fix confusion table. Checked that the result now matches the CLI output. PiperOrigin-RevId: 589119576

Commit:639861a
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Add actionnable error when training with monotonic constraints and keep_non_leaf_label_distribution=false. PiperOrigin-RevId: 588829949

Commit:46b7cbb
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Remove unused proto import PiperOrigin-RevId: 586238575

Commit:4639143
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Add allow_tokenization_for_inference_as_categorical_set to dataspec guide PiperOrigin-RevId: 584879316

Commit:ead1524
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Prediction analysis for categorical and boolean features, and all supported task types. PiperOrigin-RevId: 584652426

Commit:d52fd0f
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Add support for prediction analysis of classification and regression models with numerical features. PiperOrigin-RevId: 584651846

Commit:8e94e51
Author:Richard Stotz
Committer:Copybara-Service

[PYDF] Expose initial predictions on GBT PiperOrigin-RevId: 583056952

Commit:00a83a3
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Can compile YDF with support for TF.Example and TFRecord without depending on TensorFlow. PiperOrigin-RevId: 582370567

Commit:31487d9
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Write TFRecord without TF dependency. PiperOrigin-RevId: 581261801

Commit:91f63f2
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Support reading TFRecord without TF dependency. PiperOrigin-RevId: 581187755

Commit:9519de1
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Remove old snapshots automatically PiperOrigin-RevId: 578441359

Commit:9c32afb
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Support for MAE loss for GBT (part2) PiperOrigin-RevId: 571341495

Commit:9dda105
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Add support for mean-average-error (MAE) metric. PiperOrigin-RevId: 571267619

Commit:b4f039f
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Display one variable importance at a time in the report. PiperOrigin-RevId: 571042367

Commit:0206955
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Add tabs in model analysis report. PiperOrigin-RevId: 570941191

Commit:4e61fbd
Author:Copybara-Service

Merge pull request #64 from achoum:main PiperOrigin-RevId: 569438067

Commit:0755057
Author:Mathieu Guillame-Bert

wip

Commit:39bebc1
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Add support for monotonic constraints. PiperOrigin-RevId: 569096752

Commit:04246c2
Author:Richard Stotz
Committer:Copybara-Service

Fix typos PiperOrigin-RevId: 568168727

Commit:a2fee47
Author:TensorFlow Decision Forests Team
Committer:Copybara-Service

Internal Code Change PiperOrigin-RevId: 565013859

Commit:d174668
Author:Richard Stotz

Merge pull request #56 from achoum:gbm_fix_win PiperOrigin-RevId: 561650237

Commit:5c09134
Author:Richard Stotz
Committer:Copybara-Service

Improve documentation PiperOrigin-RevId: 552440631

Commit:02bcb97
Author:Richard Stotz
Committer:Copybara-Service

[YDF] Fix error messages for uplifting - Make it clear GBTs are not supported for uplifting - Make it clear non-binary treatments are not supported - Fix a crash when using invalid uplifting options PiperOrigin-RevId: 544369877

Commit:dd29886
Author:TensorFlow Decision Forests Team
Committer:Copybara-Service

Change exported java classname to avoid name conflicts with ../model/decision_tree/decision_tree.proto PiperOrigin-RevId: 543418503

Commit:8cd50ec
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Add support for MHLD oblique decision trees. PiperOrigin-RevId: 539999166

Commit:9a55c13
Author:TensorFlow Decision Forests Team
Committer:Copybara-Service

Extend YDF benchmark to support customized metrics. PiperOrigin-RevId: 531530838

Commit:d88c6c4
Author:Richard Stotz
Committer:Copybara-Service

Add Poisson Log Loss (5/n): Integration test, overall wiring PiperOrigin-RevId: 522080116

Commit:456420b
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Multi-task learning where the output of models trained as "secondary" as used as input for the models trained as "primary". PiperOrigin-RevId: 520289672

Commit:3e177db
Author:TensorFlow Decision Forests Team
Committer:Copybara-Service

Add a customized metrics field to uplift message. PiperOrigin-RevId: 516295646

Commit:ee2f142
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Example of manual hyper-parameter swipe using the c++ API. PiperOrigin-RevId: 512618472

Commit:015c6cb
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Distributed training with YDF GRPC distributed server PiperOrigin-RevId: 495555577

Commit:7cd724d
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Allow for the user to override the most frequent item / global imputation item. PiperOrigin-RevId: 493262639

Commit:b8b0621
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Improve the default HP tuning space of GBT and RF, and makes default HP spaces available in the OSS version of the hyper-parameter optimizer (both in YDF and in TF-DF). PiperOrigin-RevId: 492971812

Commit:f346e0b
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Internal chang. PiperOrigin-RevId: 487952192

Commit:1702269
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Open source the analyze_model_and_dataset tool (experimental). PiperOrigin-RevId: 487907328

Commit:b61710f
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Multitask learner. Part 1. PiperOrigin-RevId: 487346981

Commit:aed0492
Author:Richard Stotz
Committer:Copybara-Service

Add early_stopping_initial_iteration to the list of generic hyperparameters. PiperOrigin-RevId: 487267030

Commit:d157b6e
Author:Richard Stotz
Committer:Copybara-Service

Add confusion table on the training dataset to the GBT (Binomial log loss) training logs. PiperOrigin-RevId: 486656934

Commit:b04bdff
Author:Richard Stotz
Committer:Copybara-Service

Add option not to tokenize strings PiperOrigin-RevId: 483760397

Commit:7e0ca81
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

The address/port of GRPC workers can be changed without stopping the full pipeline. PiperOrigin-RevId: 482829391

Commit:8f0fab9
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Remove the need for a shared file-system to use GRPC workers. PiperOrigin-RevId: 482827958

Commit:da1b8a8
Author:Richard Stotz
Committer:Copybara-Service

Cleanup: More early stopping into its own folder PiperOrigin-RevId: 481615749

Commit:300b710
Author:Richard Stotz
Committer:Copybara-Service

Add Warning for too early stopping PiperOrigin-RevId: 481611268

Commit:df24e21
Author:Richard Stotz
Committer:Copybara-Service

Fix typo PiperOrigin-RevId: 474263913

Commit:b51cac6
Author:Mathieu Guillame-Bert
Committer:Copybara-Service

Add customization of the number of IO threads. In YDF, add `num_io_threads` in the deployment proto. In TF-DF, add a num_io_threads argument to the `fit_on_dataset_path` function. PiperOrigin-RevId: 472656462