Skip to content

Latest commit

 

History

History
175 lines (116 loc) · 5.7 KB

CHANGELOG.md

File metadata and controls

175 lines (116 loc) · 5.7 KB

Changelog

[unreleased][unreleased]

Changed

  • The Retrieval task now accepts a list of factorized metrics instead of a single optional metric.

[0.7.2][2022-09-28]

  • Improved support for using TPUEmbedding under parameter server strategy.

[0.7.0][2022-07-07]

A number of changes to make factorized top-K metric computation more accurate and less prone to user error.

Changed

  • tfrs.layers.embedding.TPUEmbedding now supports input features with dynamic shape. batch_size argument is deprecated and no longer required.

  • tfrs.layers.embedding.TPUEmbedding now supports running on different versions of TPU.

  • Pinned TensorFlow to >= 2.9.0 which works with Scann 1.2.7.

  • tfrs.tasks.Ranking.call now accepts a compute_batch_metrics argument to allow switching off batch metric computation. Following this change, 'compute_metrics'argument does not impact computation of batch metrics.

Breaking changes

  • tfrs.metrics.FactorizedTopK requires the candidate ids for positive candidates to be supplied when using approximate top-K sources. Each top-K layer now has an exact method to broadcast its ability to return exact or approximate top-K results.
  • Removed metrics constructor parameter for tfrs.metrics.FactorizedTopK. FactorizedTopK only makes sense with top-k metrics, and this change enforces this.
  • Replaced the k constructor argument in tfrs.metrics.FactorizedTopK with ks: a list of k values at which to compute the top k metric.

Changed

  • The tfrs.metrics.FactorizedTopK metric can now compute candidate-id based metrics when given the true_candidate_ids argument in its call method.

Added

  • The Retrieval task now also accepts a loss_metrics argument.

[0.6.0][2021-08-23]

Changed

  • Pinned TensorFlow to >= 2.6.0, which works with Scann 1.2.3.

Breaking changes

  • TopK layer indexing API changed. Indexing with datasets is now done via the index_from_dataset method. This change reduces the possibility of misaligning embeddings and candidate identifiers when indexing via indeterministic datasets.

[0.5.2][2021-07-15]

Fixed

  • Fixed error in default arguments to tfrs.experimental.models.Ranking (tensorflow#311).
  • Fix TPUEmbedding layer to use named parameters.

Added

  • Added batch_metrics to tfrs.tasks.Retrieval for measuring how good the model is at picking out the true candidate for a query from other candidates in the batch.
  • Added tfrs.experimental.layers.embedding.PartialTPUEmbedding layer, which uses tfrs.layers.embedding.TPUEmbedding for large embedding lookups and tf.keras.layers.Embedding for smaller embedding lookups.

[0.5.1][2021-05-14]

Changed

  • Supplying incompatibly-shaped candidates and identifiers inputs to factorized_top_k layers will now raise (to prevent issues similar to tensorflow#286).

[0.5.0][2021-05-06]

Changed

  • Fixed the bug in tfrs.layers.loss.SamplingProbablityCorrection that logits should subtract the log of item probability.
  • tfrs.experimental.optimizers.CompositeOptimizer: an optimizer that composes multiple individual optimizers which can be applied to different subsets of the model's variables.
  • tfrs.layers.dcn.Cross and DotInteraction layers have been moved to tfrs.layers.feature_interaction package.

Added

  • tfrs.experimental.models.Ranking, an experimental pre-built model for ranking tasks. Can be used as DLRM like model with Dot Product feature interaction or DCN like model with Cross layer.

[0.4.0][2021-01-20]

Added

  • TopK layers now come with a query_with_exclusions method, allowing certain candidates to be excluded from top-k retrieval.
  • TPUEmbedding Keras layer for accelerating embedding lookups for large tables with TPU.

Changed

  • factorized_top_k.Streaming layer now accepts a query model, like other factorized_top_k layers.

  • Updated ScaNN to 1.2.0, which requires TensorFlow 2.4.x. When not using ScaNN, any TF >= 2.3 is still supported.

[0.3.2][2020-12-22]

Changed

  • Pinned TensorFlow to >= 2.3 when ScaNN is not being installed. When ScaNN is being installed, we pin on >= 2.3, < 2.4. This allows users to use TFRS on TF 2.4 when they are not using ScaNN.

[0.3.1][2020-12-22]

Changed

  • Pinned TensorFlow to 2.3.x and ScaNN to 1.1.1 to ensure TF and ScaNN versions are in lockstep.

[0.3.0][2020-11-18]

Added

  • Deep cross networks: efficient ways of learning feature interactions.
  • ScaNN integration: efficient approximate maximum inner product search for fast retrieval.

[0.2.0][2020-10-15]

Added

  • tfrs.tasks.Ranking.call now accepts a compute_metrics argument to allow switching off metric computation.
  • tfrs.tasks.Ranking now accepts label and prediction metrics.
  • Add metrics setter/getters on tfrs.tasks.Retrieval.

Breaking changes

  • Corpus retrieval metrics and layers have been reworked.

    tfrs.layers.corpus.DatasetTopk has been removed, tfrs.layers.corpus.DatasetIndexedTopK renamed to tfrs.layers.factorized_top_k.Streaming, tfrs.layers.ann.BruteForce renamed to tfrs.layers.factorized_top_k.BruteForce. All top-k retrieval layers (BruteForce, Streaming) now follow a common interface.

Changed

  • Dataset parallelism enabled by default in DatasetTopK and DatasetIndexedTopK layers, bringing over 2x speed-ups to evaluations workloads.
  • evaluate_metrics argument to tfrs.tasks.Retrieval.call renamed to compute_metrics.