activation_celu()
activation_glu()
activation_hard_shrink()
activation_hard_tanh()
activation_log_sigmoid()
activation_soft_shrink()
activation_squareplus()
activation_tanh_shrink()
config_disable_flash_attention()
config_enable_flash_attention()
config_is_flash_attention_enabled()
initializer_stft()
layer_max_num_bounding_boxes()
layer_stft_spectrogram()
loss_circle()
metric_concordance_correlation()
metric_pearson_correlation()
op_celu()
op_exp2()
op_glu()
op_hard_shrink()
op_hard_tanh()
op_ifft2()
op_inner()
op_soft_shrink()
op_squareplus()
op_tanh_shrink()
callback_backup_and_restore()
: Addeddouble_checkpoint
argument to save a fallback checkpointcallback_tensorboard()
: Added support forprofile_batch
argumentlayer_group_query_attention()
: Addedflash_attention
andseed
argumentslayer_multi_head_attention()
: Addedflash_attention
argumentmetric_sparse_top_k_categorical_accuracy()
: Addedfrom_sorted_ids
argument
- Added native Flash Attention support for GPU (via cuDNN) and TPU (via Pallas kernel) in JAX backend
- Added opt-in native Flash Attention support for GPU in PyTorch backend
- Enabled additional kernel fusion via bias_add in TensorFlow backend
- Added support for Intel XPU devices in PyTorch backend
install_keras()
changes: if a GPU is available, the default is now to install a CPU build of TensorFlow and a GPU build of JAX. To use a GPU in the current session, calluse_backend("jax")
.
- When using
get_file()
withextract = TRUE
oruntar = TRUE
, the return value is now the path of the extracted directory, rather than the path of the archive.
-
Logging is now asynchronous in
fit()
,evaluate()
, andpredict()
. This enables 100% compact stacking oftrain_step
calls on accelerators (e.g. when running small models on TPU).- If you are using custom callbacks that rely on
on_batch_end
, this will disable async logging. You can re-enable it by addingself$async_safe <- TRUE
to your callbacks. Note that the TensorBoard callback is not considered async-safe by default. Default callbacks like the progress bar are async-safe.
- If you are using custom callbacks that rely on
-
New bitwise operations:
op_bitwise_and()
op_bitwise_invert()
op_bitwise_left_shift()
op_bitwise_not()
op_bitwise_or()
op_bitwise_right_shift()
op_bitwise_xor()
-
New math operations:
op_logdet()
op_trunc()
op_histogram()
-
New neural network operation:
op_dot_product_attention()
-
New image preprocessing layers:
layer_auto_contrast()
layer_solarization()
-
New Model functions
get_state_tree()
andset_state_tree()
, for retrieving all model variables, including trainable, non-trainable, optimizer variables, and metric variables. -
New
layer_pipeline()
for composing a sequence of layers. This class is useful for building a preprocessing pipeline. Compared to akeras_model_sequential()
,layer_pipeline()
has a few key differences:- It's not a Model, just a plain layer.
- When the layers in the pipeline are compatible with
tf.data
, the pipeline will also remaintf.data
compatible, regardless of the backend you use.
-
New argument:
export_savedmodel(verbose = )
-
New argument:
op_normalize(epsilon = )
-
Various documentation improvements and bug fixes.
-
Added compatibility with Keras v3.5.0. User facing changes:
- New functions:
op_associative_scan()
op_searchsorted()
optimizer_lamb()
keras$DTypePolicy
instances can now be supplied todtype
argument for losses, metrics, and layers.- Add integration with the Hugging Face Hub. You can now save models to
Hugging Face Hub directly
save_model()
and load .keras models directly from Hugging Face Hub withload_model()
. - Added compatibility with NumPy 2.0.
- Improved
keras$distribution
API support for very large models. - Bug fixes and performance improvements.
- Add
data_format
argument tolayer_zero_padding_1d()
layer. - Miscellaneous documentation improvements.
- Bug fixes and performance improvements.
- New functions:
-
Fixed issue where GPUs would not be found when running on Windows under WSL Linux. (reported in #1456, fixed in #1459)
-
keras_shape
objects (as returned bykeras3::shape()
) gain==
and!=
methods. -
Fixed warning from
tfruns::training_run()
being unable to log optimizer learning rate. -
Added compatibility with Keras v3.4.1 (no R user facing changes).
-
Added compatibility with Keras v3.4.0. User facing changes:
-
New functions:
op_argpartition()
op_map()
op_scan()
op_switch()
op_dtype()
op_lstsq()
op_image_hsv_to_rgb()
op_image_rgb_to_hsv()
-
Changes:
- Added support for arbitrary, deeply nested input/output structures in Functional models (e.g. lists of lists of lists of inputs or outputs...)
- Add support for
optional
Functional inputs.keras_input()
gains anoptional
argument.keras_model_sequential()
gains ainput_optional
argument.
- Add support for
float8
inference forDense
andEinsumDense
layers. - Enable
layer_feature_space()
to be used in a{tfdatasets}
pipeline even when the backend isn't TensorFlow. layer_string_lookup()
can now taketf$SparseTensor()
as input.layer_string_lookup()
returns"int64"
dtype by default in more modes now.Layer()
instances gain attributespath
andquantization_mode
.Metric()$variables
is now recursive.- Add
training
argument toModel$compute_loss()
. split_dataset()
now supports nested structures in dataset.- All applications gain a
name
argument, accept a custom name. layer_multi_head_attention()
gains aseed
argument.- All losses gain a
dtype
argument. loss_dice()
gains anaxis
argument.op_ctc_decode()
, new default formask_index = 0
- All
op_image_*
functions now use defaultdata_format
value toconfig_image_data_format()
op_isclose()
gains argumentsrtol
,atol
,equal_nan
.save_model()
gains argumentzipped
.- Bugs fixes and performance improvements.
-
-
Chains of
layer_*
calls with|>
now instantiate layers in the same order as%>%
pipe chains: left-hand-side first (#1440). -
iterate()
,iter_next()
andas_iterator()
are now reexported from reticulate.
User facing changes with upstream Keras v3.3.3:
-
new functions:
op_slogdet()
,op_psnr()
-
clone_model()
gains new args:call_function
,recursive
Updated example usage. -
op_ctc_decode()
strategy argument has new default:"greedy"
. Updated docs. -
loss_ctc()
default name fixed, changed to"ctc"
User facing changes with upstream Keras v3.3.2:
-
new function:
op_ctc_decode()
-
new function:
op_eigh()
-
new function:
op_select()
-
new function:
op_vectorize()
-
new function:
op_image_rgb_to_grayscale()
-
new function:
loss_tversky()
-
new args:
layer_resizing(pad_to_aspect_ratio, fill_mode, fill_value)
-
new arg:
layer_embedding(weights)
for providing an initial weights matrix -
new args:
op_nan_to_num(nan, posinf, neginf)
-
new args:
op_image_resize(crop_to_aspect_ratio, pad_to_aspect_ratio, fill_mode, fill_value)
-
new args:
op_argmax(keepdims)
andop_argmin(keepdims)
-
new arg:
clear_session(free_memory)
for clearing without invoking the garbage collector. -
metric_kl_divergence()
andloss_kl_divergence()
clip inputs (y_true
andy_pred
) to the[0, 1]
range. -
new
Layer()
attributes:metrics
,dtype_policy
-
Added initial support for float8 training
-
layer_conv_*d()
layers now support LoRa -
op_digitize()
now supports sparse tensors. -
Models and layers now return owned metrics recursively.
-
Add pickling support for Keras models. (e.g., via
reticulate::py_save_object()
) Note that pickling is not recommended, prefer using Keras saving APIs.
New functions:
-
quantize_weights()
: quantize model or layer weights in-place. Currently, onlyDense
,EinsumDense
, andEmbedding
layers are supported (which is enough to cover the majority of transformers today) -
layer_mel_spectrogram()
-
layer_flax_module_wrapper()
-
layer_jax_model_wrapper()
-
loss_dice()
-
random_beta()
-
random_binomial()
-
config_set_backend()
: change the backend after Keras has initialized. -
config_dtype_policy()
-
config_set_dtype_policy()
-
New Ops
op_custom_gradient()
op_batch_normalization()
op_image_crop()
op_divide_no_nan()
op_normalize()
op_correlate()
- `
-
New family of linear algebra ops
op_cholesky()
op_det()
op_eig()
op_inv()
op_lu_factor()
op_norm()
op_erfinv()
op_solve_triangular()
op_svd()
-
audio_dataset_from_directory()
,image_dataset_from_directory()
andtext_dataset_from_directory()
gain averbose
argument (defaultTRUE
) -
image_dataset_from_directory()
gainspad_to_aspect_ratio
argument (defaultFALSE
) -
to_categorical()
,op_one_hot()
, andfit()
can now accept R factors, offset them to be 0-based (reported in#1055
). -
op_convert_to_numpy()
now returns unconverted NumPy arrays. -
op_array()
andop_convert_to_tensor()
no longer error when casting R doubles to integer types. -
export_savedmodel()
now works with a Jax backend. -
Metric()$add_variable()
method gains arg:aggregration
. -
Layer()$add_weight()
method gains args:autocast
,regularizer
,aggregation
. -
op_bincount()
,op_multi_hot()
,op_one_hot()
, andlayer_category_encoding()
now support sparse tensors. -
op_custom_gradient()
now supports the PyTorch backend -
layer_lstm()
andlayer_gru()
gain arguse_cudnn
, default'auto'
. -
Fixed an issue where
application_preprocess_inputs()
would error if supplied an R array as input. -
Doc improvements.
- The package has been rebuilt for Keras 3.0. Refer to https://blogs.rstudio.com/ai/posts/2024-05-21-keras3/ for an overview and https://keras3.posit.co for the current up-to-date documentation.
-
Default TF version installed by
install_keras()
is now 2.13. -
Updated layers:
layer_batch_normalization()
updated signature, with changes to options for distributed training.layer_embedding()
gains asparse
argument.
-
Fixed deadlock when an R generator was passed to
fit()
,predict()
, and other endpoints. -
When
fit(verbose = "auto")
is evaluated in the context of a knitr document (e.g., quarto or rmarkdown document being rendered), verbose will now default to2
, showing one line per epoch.
-
Update S3 method formals per new CRAN requirement (
r_to_py.keras_layer_wrapper()
) -
Fixed an issue where
get_file()
would place incorrectly save files in the current working directory. (#1365)
-
Default TensorFlow version installed by
install_keras()
is now 2.11. -
All optimizers have been updated for keras/tensorflow version 2.11. Arguments to all the optimizers have changed. To access the previous optimizer implementations, use the constructors available at
keras$optimizers$legacy
. For example, usekeras$optimizers$legacy$Adam()
for the previous implementation ofoptimizer_adam()
. -
New optimizer
optimizer_frtl()
. -
updates to layers:
layer_attention()
gainsscore_mode
anddropout
arguments.layer_discretization()
gainsoutput_mode
andsparse
arguments.layer_gaussian_dropout()
andlayer_gaussian_noise()
gain aseed
argument.layer_hashing()
gainsoutput_mode
andsparse
arguments.layer_integer_lookup()
gainsvocabulary_dtype
andidf_weights
arguments.layer_normalization()
gains aninvert
argument.layer_string_lookup()
gains anidf_weights
argument.
-
Fixed issue where
input_shape
supplied to custom layers defined withnew_layer_class()
would result in an error (#1338) -
New
callback_backup_and_restore()
, for resuming an interruptedfit()
call. -
The merging family of layers (
layer_add
,layer_concatenate
, etc.) gain the ability to accept layers in...
, allowing for easier composition of residual blocks with the pipe%>%
. e.g. something like this now works:block_1_output <- ... block_2_output <- block_1_output %>% layer_conv_2d(64, 3, activation = "relu", padding = "same") %>% layer_add(block_1_output)
-
model$get_config()
method now returns an R object that can be safely serialized to rds. -
keras_array()
now reflects unconverted Python objects. This enables passing objects likepandas.Series()
tofit()
andevaluate()
methods. (#1341)
-
New functions for constructing custom keras subclasses:
new_model_class()
new_layer_class()
new_callback_class()
new_metric_class()
new_loss_class()
new_learning_rate_schedule_class()
.
Also provided is
mark_active()
, a decorator for indicating a class method should be an active binding (i.e., decorated with Python's@property
).mark_active()
can be used in thenew_*_class
family of class constructors as well as%py_class%
. -
r_to_py()
method for R6 classes and%py_class%
gain support forprivate
fields and methods. Any R objects stored inprivate
will only be available to methods, and will not be converted to Python. -
New family of functions for controlling optimizer learning rates during training:
learning_rate_schedule_cosine_decay()
learning_rate_schedule_cosine_decay_restarts()
learning_rate_schedule_exponential_decay()
learning_rate_schedule_inverse_time_decay()
learning_rate_schedule_piecewise_constant_decay()
learning_rate_schedule_polynomial_decay()
Also, a function for constructing custom learning rate schedules:
new_learning_rate_schedule_class()
. -
New L2 unit normilization layer:
layer_unit_normalization()
. -
New
regularizer_orthogonal
, a regularizer that encourages orthogonality between the rows (or columns) or a weight matrix. -
New
zip_lists()
function for transposing lists, optionally matching by name. -
New
plot()
S3 method for models. -
pydot
is now included in the packages installed byinstall_keras()
. -
The
png
package is now listed under Suggests. -
The
%<>%
assignment pipe from magrittr is exported. -
format()
method for keras models (and derivative methodsprint()
,summary()
,str()
, andpy_str()
):- gain a new arg
compact
. IfTRUE
(the default) white-space only lines are stripped out ofmodel.summary()
. - If any layers are marked non-trainable or frozen, the model summary now includes a "Trainable" column, indicating if a layer is frozen.
- gain a new arg
-
freeze_weights()
andunfreeze_weights()
:- gain a flexible
which
argument that can accept layer names (as character strings), an integer vector, a boolean vector, or a function that returns a boolean when called with a layer. (see updated examples in?freeze_weights
from
andto
arguments gain the ability to accept negative integers, to specify layers counting from the end of the layers list.
- gain a flexible
-
get_weights()
gains atrainable
argument that can acceptTRUE
orFALSE
, allowing for returning only the unfrozen or frozen weights, respectively. -
timeseries_dataset_from_array()
:- R arrays are now cast to the floatx dtype ("float32" by default)
start_index
andend_index
now are 1-based.
-
image_dataset_from_directory()
gains acrop_to_aspect_ratio
argument which can be used to prevent distorting images when resizing to a new aspect ratio. -
Layer
is deprecated, superseded bynew_layer_class()
. -
load_model_tf()
argumentcustom_objects
gains the ability to accept an unnamed list (e.g, of objects returned bynew_layer_class()
or similar). Appropriate names for the supplied objects are automatically inferred. -
Fixed an issue where negative values less than -1 supplied to
axis
arguments were selecting the wrong axis. -
get_layer()
gains the ability to accept negative values for theindex
argument. -
Fixed warning from
create_layer_wrapper()
when the custom layer didn't have an overriddeninitialize
or__init__
method. -
Backend functions:
- k_clip()
min_value
andmax_value
gain default values ofNULL
, can be omitted.NULL
is taken as -Inf or Inf, respectively. - k_squeeze():
axis
argument can be omitted, in which case all axes of size 1 are dropped. - k_tile():
n
argument can now be supplied as a tensor. - New function
k_unstack()
.
- k_clip()
-
KerasTensor objects (e.g, returned by
layer_input()
) now inherit S3 methods for"tensorflow.tensor"
. -
plot.keras_training_history()
no longer issues message`geom_smooth()` using formula 'y ~ x'
whenmethod = "ggplot2"
. -
print
and related methods for models (format
,summary
) now accept awidth
argument. -
evaluate()
,fit()
, andpredict()
methods for keras Models now default toverbose = "auto"
, with verbosity adjusted appropriately based on calls tokeras$utils$disable_interactive_logging()
, and contexts likeParameterServerStrategy
. -
install_keras()
now acceptsversion = "release-cpu"
as a valid specification.
-
Breaking change: The semantics of passing a named list to
keras_model()
have changed.Previously,
keras_model()
wouldunname()
suppliedinputs
andoutputs
. Then, if a named list was passed to subsequentfit()
/evaluate()
/call()
/predict()
invocations, matching ofx
andy
was done to the model's input and outpttensor$name
's. Now, matching is done tonames()
ofinputs
and/oroutputs
supplied tokeras_model()
. Callunname()
oninputs
andoutputs
to restore the old behavior, e.g.:keras_model(unname(inputs), unname(outputs))
keras_model()
can now accept a named list for multi-input and/or multi-output models. The named list is converted to adict
in python. (Requires Tensorflow >= 2.4, Python >= 3.7).If
inputs
is a named list:call()
,fit()
,evaluate()
, andpredict()
methods can also accept a named list forx
, with names matching to the names ofinputs
when the model was constructed. Positional matching ofx
is still also supported (requires python 3.7+).
If
outputs
is a named list:fit()
andevaluate()
methods can only accept a named list fory
, with names matching to the names ofoutputs
when the model was constructed.
-
New layer
layer_depthwise_conv_1d()
. -
Models gain
format()
andprint()
S3 methods for compatibility with the latest reticulate. Both are powered bymodel$summary()
. -
summary()
method for Models gains argumentsexpand_nested
andshow_trainable
, both default toFALSE
. -
keras_model_custom()
is soft deprecated. Please define custom models by subclassingkeras$Model
directly using%py_class%
orR6::R6Class()
. -
Fixed warning issued by
k_random_binomial()
. -
Fixed error raised when
k_random_binomial()
was passed a non-floating dtype. -
Added
k_random_bernouli()
as an alias fork_random_binomial()
. -
image_load()
gains acolor_mode
argument. -
Fixed issue where
create_layer_wrapper()
would not include arguments with aNULL
default value in the returned wrapper. -
Fixed issue in
r_to_py.R6ClassGenerator
(and%py_class%
) where single-expressioninitialize
functions defined without{
would error. -
Deprecated functions are no longer included in the package documentation index.
-
Default Tensorflow + Keras version is now 2.7.
-
New API for constructing RNN (Recurrent Neural Network) layers. This is a flexible interface that complements the existing RNN layers. It is primarily intended for advanced / research applications, e.g, prototyping novel architectures. It allows you to compose a RNN with a custom "cell", a Keras layer that processes one step of a sequence. New symbols:
layer_rnn()
, which can compose with builtin cells:rnn_cell_gru()
rnn_cell_lstm()
rnn_cell_simple()
rnn_cells_stack()
To learn more, including how to make a custom cell layer, see the new vignette: "Working with RNNs".
-
New dataset functions:
text_dataset_from_directory()
timeseries_dataset_from_array()
-
New layers:
layer_additive_attention()
layer_conv_lstm_1d()
layer_conv_lstm_3d()
-
layer_cudnn_gru()
andlayer_cudnn_lstm()
are deprecated.layer_gru()
andlayer_lstm()
will automatically use CuDNN if it is available. -
layer_lstm()
andlayer_gru()
: default value forrecurrent_activation
changed from"hard_sigmoid"
to"sigmoid"
. -
layer_gru()
: default valuereset_after
changed fromFALSE
toTRUE
-
New vignette: "Transfer learning and fine-tuning".
-
New applications:
- MobileNet V3:
application_mobilenet_v3_large()
,application_mobilenet_v3_small()
- ResNet:
application_resnet101()
,application_resnet152()
,resnet_preprocess_input()
- ResNet V2:
application_resnet50_v2()
,application_resnet101_v2()
,application_resnet152_v2()
andresnet_v2_preprocess_input()
- EfficientNet:
application_efficientnet_b{0,1,2,3,4,5,6,7}()
- MobileNet V3:
-
Many existing
application_*()
's gain argumentclassifier_activation
, with default'softmax'
. Affected:application_{xception, inception_resnet_v2, inception_v3, mobilenet, vgg16, vgg19}()
-
New function
%<-active%
, a ergonomic wrapper aroundmakeActiveBinding()
for constructing Python@property
decorated methods in%py_class%
. -
bidirectional()
sequence processing layer wrapper gains abackwards_layer
arguments. -
Global pooling layers
layer_global_{max,average}_pooling_{1,2,3}d()
gain akeepdims
argument with default valueFALSE
. -
Signatures for layer functions are in the process of being simplified. Standard layer arguments are moving to
...
where appropriate (and will need to be provided as named arguments). Standard layer arguments include:input_shape
,batch_input_shape
,batch_size
,dtype
,name
,trainable
,weights
. Layers updated:layer_global_{max,average}_pooling_{1,2,3}d()
,time_distributed()
,bidirectional()
,layer_gru()
,layer_lstm()
,layer_simple_rnn()
-
All the backend function with a shape argument
k_*(shape =)
that now accept a a mix of integer tensors and R numerics in the supplied list. -
All layer functions now accept
NA
as a synonym forNULL
in arguments that specify shape as a vector of dimension values, e.g.,input_shape
,batch_input_shape
. -
k_random_uniform()
now automatically castsminval
andmaxval
to the output dtype. -
install_keras()
gains arg with defaultpip_ignore_installed = TRUE
.
-
New family of preprocessing layers. These are the spiritual successor to the
tfdatasets::step_*
family of data transformers (to be deprecated in a future release). Added a new vignette: "Working with Preprocessing Layers". New functions:Image preprocessing:
layer_resizing()
layer_rescaling()
layer_center_crop()
Image augmentation:
layer_random_crop()
layer_random_flip()
layer_random_translation()
layer_random_rotation()
layer_random_zoom()
layer_random_contrast()
layer_random_height()
layer_random_width()
Categorical features preprocessing:
layer_category_encoding()
layer_hashing()
layer_integer_lookup()
layer_string_lookup()
Numerical features preprocessing:
layer_normalization()
layer_discretization()
These join the previous set of text preprocessing functions, each of which have some minor changes:
layer_text_vectorization()
(changed arguments)get_vocabulary()
set_vocabulary()
adapt()
-
adapt()
changes:- Now accepts all features preprocessing layers, previously
only
layer_text_vectorization()
instances were valid. reset_state
argument is removed. It only ever accepted the default value ofTRUE
.- New arguments
batch_size
andsteps
. - Now returns the adapted layer invisibly for composability with
%>%
(previously returnedNULL
)
- Now accepts all features preprocessing layers, previously
only
-
get_vocabulary()
gains ainclude_special_tokens
argument. -
set_vocabulary()
:- Now returns the adapted layer invisibly for composability with
%>%
(previously returnedNULL
) - Signature simplified. Deprecated arguments (
df_data
oov_df_value
) are now subsumed in...
.
- Now returns the adapted layer invisibly for composability with
-
layer_text_vectorization()
:- valid values for argument
output_mode
change:"binary"
is renamed to"multi_hot"
and"tf-idf"
is renamed to"tf_idf"
(backwards compatibility is preserved). - Fixed an issue where valid values of
output_mode = "int"
would incorrectly return a ragged tensor output shape.
- valid values for argument
-
Existing layer instances gain the ability to be added to sequential models via a call. E.g.:
layer <- layer_dense(units = 10) model <- keras_model_sequential(input_shape = c(1,2,3)) %>% layer()
-
Functions in the merging layer family gain the ability to return a layer instance if the first argument
inputs
is missing. (affected:layer_concatenate()
,layer_add()
,layer_subtract()
,layer_multiply()
,layer_average()
,layer_maximum()
,layer_minimum()
,layer_dot()
) -
%py_class%
gains the ability to delay initializing the Python session until first use. It is now safe to implement and export%py_class%
objects in an R package. -
Fixed an issue in
layer_input()
where passing a tensorflowDType
objects to argumentdtype
would throw an error. -
Fixed an issue in
compile()
where passing an R function via an in-line call would result in an error from subsequentfit()
calls. (e.g.,compile(loss = function(y_true, y_pred) my_loss(y_true, y_pred))
now succeeds) -
clone_model()
gains aclone_function
argument that allows you to customize each layer as it is cloned. -
Bumped minimum R version to 3.4. Expanded CI to test on all supported R version. Fixed regression that prevented package installation on R <= 3.4
Breaking changes (Tensorflow 2.6):
-
Note: The following breaking changes are specific to Tensorflow version 2.6.0. However, the keras R package maintains compatibility with multiple versions of Tensorflow/Keras. You can upgrade the R package and still preserve the previous behavior by installing a specific version of Tensorflow:
keras3::install_keras(tensorflow="2.4.0")
-
predict_proba()
andpredict_classes()
were removed. -
model_to_yaml()
andmodel_from_yaml()
were removed. -
default changed:
layer_text_vectorization(pad_to_max_tokens=FALSE)
-
set_vocabulary()
argumentsdf_data
andoov_df_value
are removed. They are replaced by the new argumentidf_weights
.
New Features:
-
Default Tensorflow/Keras version is now 2.6
-
Introduced
%py_class%
, an R-language constructor for Python classes. -
New vignettes:
- Subclassing Python classes: How to use
%py_class%
. - Making new layers and models via subclassing.
- Customizing what happens in fit (example of how to define a model, like a GAN, with a custom train step).
- Writing your own callbacks.
- Subclassing Python classes: How to use
-
The
keras
Python module is exported -
Major changes to the underlying handling of custom R6 layer classes.
- A new
r_to_py()
method is provided forR6ClassGenerator
objects. - R6 custom layers can now inherit directly from Python layer classes or other R6 custom layer classes.
- Custom R6 layers can now be instantiated directly after conversion of the class generator with
r_to_py()
, without going throughcreate_layer()
. KerasLayer
is deprecated (new classes should inherit directly fromkeras$layers$Layer
).KerasWrapper
is deprecated (new classes should inherit directly fromkeras$layers$Wrapper
).create_wrapper()
is deprecated (no longer needed, usecreate_layer()
directly).- All layer class methods provided as R functions now have a
super
in scope that resolves to the Python super class object. - Methods of
super
can be accessed in the 3 common ways:- (Python 3 style):
super()$"__init__"()
- (Python 2 style):
super(ClassName, self)$"__init__"()
- (R6 style):
super$initialize()
- (Python 3 style):
- User defined custom classes that inherit from a Python type are responsible for calling
super()$`__init__`(...)
if appropriate. - Custom layers can now properly handle masks (#1225)
supports_masking = TRUE
attribute is now supportedcompute_mask()
user defined method is now supported
call()
methods now support atraining
argument, as well as any additional arbitrary user-defined arguments
- A new
-
Layer()
custom layer constructor is now lazy about initializing the Python session and safe to use on the top level of an R package (#1229). -
New function
create_layer_wrapper()
that can create a composing R function wrapper around a custom layer class. -
Refactored
install_keras()
(along withtensorflow::install_tensorflow()
). Installation should be more reliable for more users now. If you encounter installation issues, please file an issue: https://github.com/rstudio/keras/issues/new-
Potentially breaking change: numeric versions supplied without a patchlevel now automatically pull the latest patch release. (e.g.
install_keras(tensorflow="2.4")
will install tensorflow version "2.4.2". Previously it would install "2.4.0") -
pandas is now a default extra packages installed by
install_keras()
-
pyyaml is no longer a installed by default if the Tensorflow version >= 2.6.
-
-
Loss functions:
-
All the loss functions gain the ability to return a callable (a
keras$losses$Loss
instance) ify_true
andy_pred
arguments are missing. -
New builtin loss functions:
loss_huber()
loss_kl_divergence()
-
-
Metric functions:
-
All the metric functions gain the ability to return a
keras$metrics$Metric
instance if called withouty_true
andy_pred
-
Each metric function is now documented separately, with a common
?Metric
topic demonstrating example usage. -
New built-in metrics:
metric_true_negatives()
metric_true_positives()
metric_false_negatives()
metric_false_positives()
metric_specificity_at_sensitivity()
metric_sensitivity_at_specificity()
metric_precision()
metric_precision_at_recall()
metric_sum()
metric_recall()
metric_recall_at_precision()
metric_root_mean_squared_error()
metric_sparse_categorical_accuracy()
metric_mean_tensor()
metric_mean_wrapper()
metric_mean_iou()
metric_mean_relative_error()
metric_logcosh_error()
metric_mean()
metric_cosine_similarity()
metric_categorical_hinge()
metric_accuracy()
metric_auc()
-
-
keras_model_sequential()
gains the ability to accept arguments that define the input layer likeinput_shape
anddtype
. See?keras_model_sequential
for details and examples. -
Many layers gained new arguments, coming to parity with the interface available in the latest Python version:
layer name new argument layer_gru
time_major
layer_lstm
time_major
layer_max_pooling_1d
data_format
layer_conv_lstm_2d
return_state
layer_depthwise_conv_2d
dilation_rate
layer_conv_3d_transpose
dilation_rate
layer_conv_1d
groups
layer_conv_2d
groups
layer_conv_3d
groups
layer_locally_connected_1d
implementation
layer_locally_connected_2d
implementation
layer_text_vectorization
vocabulary
-
The
compile()
method for keras models has been updated:optimizer
is now an optional argument. It defaults to"rmsprop"
for regular keras models. Custom models can specify their own default optimizer.loss
is now an optional argument.- New optional arguments:
run_eagerly
,steps_per_execution
. target_tensors
andsample_weight_mode
must now be supplied as named arguments.
-
Added activation functions swish and gelu. (#1226)
-
set_vocabulary()
gains aidf_weights
argument. -
All optimizer had argument
lr
renamed tolearning_rate
. (backwards compatibility is preserved, an R warning is now issued). -
The glue package was added to Imports
-
Refactored automated tests to closer match the default installation procedure and compute environment of most user.
-
Expanded CI test coverage to include R devel, oldrel and 3.6.
- Use compat module when using
set_session
andget_session
. (#1046) - Allows passing other arguments to
keras_model
egname
. (#1045) - Fixed bug when serializing models with the plaidml backends.(#1084)
- Install keras no longer tries to install scipy because it's already installed by tensorflow (#1081)
- Fixed bug with
layer_text_vectorization
with TensorFlow >= 2.3 (#1131) - Handle renamed argument
text
toinput_text
intext_one_hot
(#1133) - Added TensorFlow 2.3 to the CI (#1102)
- Fix C stack error when using Image Data Generators and Time Series generators with TensorFlow <= 2.0.1 (#1135)
- Fixed warning raised in the initial epoch (@gsteinbu #1130)
- Consistent result when using
text_hashing_trick
with missing values (@topepo #1048) - Added a custom error message for
k_logsumexp
as it was removed from Keras (#1137) - Fixed bug when printing models that are not built yet. (#1138)
- Fix drop_duplicates DeprecationWarning with tf 2.3 (@gsteinbu #1139 #1141)
- Fixed bug when plotting the model history if the model used an early stopping callback (#1140)
install_keras
now installs a fixed version of h5py, because newer versions are backward incompatible. (#1142)- Simplify testing utilities by using a
helper-*
file. (#1173) - Deprecated
hdf5_matrix
if using TF >= 2.4 (#1175) - Fixed TensorFlow nightly installation on CI (#1176)
- Support for TensorFlow v2.4: just small fixes for custom classes. (#1177)
- Added
untar
argument toget_file
as it seems to be slightly different fromextract
(#1179) - Warn when not using the tensorflow implementation of Keras (#1181)
- Added
layer_layer_normalization
(#1183) - Added
layer_multihead_attention
(#1184) - Added
image_dataset_from_directory
(#1185) - Fixed bug when using a custom layer with a time distributed adverb. (#1188)
- Added the
ragged
argument tolayer_input
. (#1193) - Fixed
*_generator
deadlocks with recent versions of TensorFlow (#1197)
- Added
layer_attention
(#1000) by @atroiano. - Fixed issue regarding the KerasMetricsCallback with TF v2.2 (#1020)
-
Added
layer_dense_features
. -
Added
on_test_*
,on_test_batch_*
,on_predict_*
andon_predict_*
to callback options. -
Search for the right optimizers and initializers on TensorFlow 2.0
-
Fixed bug in function generators when using models with multiple inputs. (#740)
-
Added
export_savedmodel
support for TensorFlow 2.0 (#773) -
Fixed bug when using
metric_
functions. (#804) -
Allow users to pass additional arguments to
install_keras
(#808) -
Enabled calling Keras models with R arrays. (#806)
-
Allow passing
data.frames
as inputs to Keras models. (#822) -
Fixed bug when passing a fixed validation set to
fit_generator
(#837) -
Fixed bug when passing a TensorFlow dataset to
fit
within atf$distribute
scope. (#856) -
install_keras
will now install Keras dependencies (#856). It won't re-install TensorFlow if it's already installed. -
Fixed deprecation messages showed with TensorFlow v1.14.
-
Largely reduced tests verbosity.
-
Use
tf.keras
as default implementation module. -
Added AppVeyor to test on Windows.
-
Added
flow_images_from_dataframe
function (#658). -
Allow for unknown
input_shape
inapplication_*
functions. -
Added
save_model_tf
andload_model_tf
to save/load models in the TensorFlow's SavedModel format.
-
Improve handling of
timeseries_generator()
in calls tofit_generator()
-
Add support for
input_shape
argument tolayer_dropout()
-
Improve error message for data frames passed to
fit()
, etc. -
Use 1-based axis indices for
k_gather()
-
Added
version
parameter toinstall_keras()
for installing alternate/older versions -
Added
activation_exponential()
function. -
Added
threshold
parameter toactivation_relu()
-
Added
restore_best_weights
parameter tocallback_model_checkpoint()
-
Added
update_freq
parameter tocallback_tensorboard()
-
Added
negative_slope
andthreshold
parameters tolayer_activation_relu()
-
Added
output_padding
anddilation_rate
parameters tolayer_conv_2d_transpose()
-
Added
output_padding
argument tolayer_conv_3d_transpose()
-
Added
data_format
argument tolayer_separable_conv_1d()
,layer_average_pooling_1d()
,layer_global_max_pooling_1d()
, andlayer_global_average_pooling_1d()
-
Added
interpolation
argument tolayer_upsampling_1d()
andlayer_upsampling_2d()
-
Added
dtype
argument toto_categorical()
-
Added
layer_activation_selu()
function. -
Added
KerasWrapper
class and correspondingcreate_wrapper
function.
-
Fix issue with serializing models that have constraint arguments
-
Fix issue with
k_tile
that needs an integer vector instead of a list as then
argument. -
Fix issue with user-supplied
output_shape
inlayer_lambda()
not being supplied to tensorflow backends -
Filter out metrics that were created for callbacks (e.g.
lr
) -
Added
application_mobilenet_v2()
pre-trained model -
Added
sample_weight
parameter toflow_images_from_data()
-
Use native Keras implementation (rather than SciPy) for
image_array_save()
-
Default
layer_flatten()
data_format
argument toNULL
(which defaults to global Keras config). -
Add
baseline
argument tocallback_early_stopping()
(stop training if a given baseline isn't reached). -
Add
data_format
argument tolayer_conv_1d()
. -
Add
layer_activation_relu()
, making the ReLU activation easier to configure while retaining easy serialization capabilities. -
Add
axis = -1
argument in backend crossentropy functions specifying the class prediction axis in the input tensor. -
Handle symbolic tensors and TF datasets in calls to
fit()
,evaluate()
, andpredict()
-
Add
embeddings_data
argument tocallback_tensorboard()
-
Support for defining custom Keras models (i.e. custom
call()
logic for forward pass) -
Handle named list of model output names in
metrics
argument ofcompile()
-
New
custom_metric()
function for defining custom metrics in R -
Provide typed wrapper for categorical custom metrics
-
Provide access to Python layer within R custom layers
-
Don't convert custom layer output shape to tuple when shape is a list or tuple of other shapes
-
Re-export
shape()
function from tensorflow package -
Re-export
tuple()
function from reticulate package -
Indexes for
get_layer()
are now 1-based (for consistency w/freeze_weights()
) -
Accept named list for
sample_weight
argument tofit()
-
Fix issue with single-element vectors passed to text preprocessing functions
-
Compatibility with TensorFlow v1.7 Keras implementation
-
Support
workers
parameter for native Keras generators (e.g.flow_images_from_directory()
) -
Accept tensor as argument to
k_pow()
-
In
callback_reduce_lr_on_plateau()
, renameepsilon
argument tomin_delta
(backwards-compatible). -
Add
axis
parameter tok_softmax()
-
Add
send_as_json
parameter tocallback_remote_monitor()
-
Add
data_format
method tolayer_flatten()
-
In
multi_gpu_model()
, add argumentscpu_merge
andcpu_relocation
(controlling whether to force the template model's weights to be on CPU, and whether to operate merge operations on CPU or GPU). -
Record correct loss name for tfruns when custom functions are provided for
loss
-
Support for custom constraints from R
-
Added
timeseries_generator()
utility function -
New layer
layer_depthwise_conv_2d()
-
Added
brightness_range
andvalidation_split
arguments to [image_data_generator()].
-
Added support for
remove_learning_phase
inexport_savedmodel()
to avoid removing learning phase. -
Normalize validation data to Keras array in
fit()
andfit_generator()
-
Ensure that custom layers return a tuple from
compute_output_shape()
-
Added Nasnet and Densenet pre-trained models
-
New layers
layer_activation_softmax()
andlayer_separable_conv_1d()
-
Added
amsgrad
parameter tooptimizer_adam()
-
Fix incompatibility with Progbar.update() method in Keras 2.1.4
-
Models saved via
export_savedmodel()
that make use of learning phases can now be exported without having to manually reload the original model. -
Ensure that models saved via
export_savedmodel()
can be served from CloudML -
Run image data generators with R preprocessing functions on the main thread
-
Return R list from
texts_to_sequences()
-
Various fixes for
use_implementation()
function
-
Added
theme_bw
option to plot method for training history -
Support TF Dataset objects as generators for
fit_generator()
, etc. -
Added
use_implementation()
anduse_backend()
functions as alternative to settingKERAS_IMPLEMENATION
andKERAS_BACKEND
environment variables. -
Added R wrappers for Keras backend functions (e.g.
k_variable()
,k_dot()
, etc.) -
Use 1-based axis for
normalize
function. -
Fix issue with printing training history after early stopping.
-
Experimental support for using the PlaidML backend.
-
Correct handling for R functions specified in
custom_objects
-
Added
with_custom_object_scope()
function. -
Automatically provide name to loss function during compile (enables save/load of models with custom loss function)
-
Provide global
keras.fit_verbose
option (defaults to 1)
-
Added
multi_gpu_model()
function. -
Automatically call
keras_array()
on the results of generator functions. -
Ensure that
steps_per_epoch
is passed as an integer -
Import
evaluate()
generic from tensorflow package -
Handle
NULL
when converting R arrays to Keras friendly arrays -
Added
dataset_imbd_word_index()
function -
Ensure that
sample_weight
is passed tofit()
as an array. -
Accept single function as
metrics
argument tocompile()
-
Automatically cast
input_shape
argument to applications to integer -
Allow Keras models to be composable within model pipelines
-
Added
freeze_weights()
andunfreeze_weights()
functions. -
Implement
export_savedmodel()
generic from TensorFlow package -
Convert R arrays to row-major before image preprocessing
-
Use
tensorflow.keras
for tensorflow implementation (TF v1.4) -
Added
application_inception_resnet_v2()
pre-trained model -
Added
dataset_fashion_mnist()
dataset -
Added
layer_cudnn_gru()
andlayer_cudnn_lstm()
(faster recurrent layers backed by CuDNN) -
Added
layer_minimum()
function -
Added
interpolation
parameter toimage_load()
function -
Add
save_text_tokenizer()
andload_text_tokenizer()
functions. -
Fix for progress bar output in Keras >= 2.0.9
-
Remove deprecated
implementation
argument from recurrent layers -
Support for passing generators for validation data in
fit_generator()
-
Accept single integer arguments for kernel sizes
-
Add standard layer arguments to
layer_flatten()
andlayer_separable_conv_2d()
-
Added
image_array_resize()
andimage_array_save()
for 3D image arrays. -
Allow custom layers and lambda layers to accept list parameters.
-
Expose
add_loss()
function for custom layers
-
Add
use_session_with_seed()
function that establishes a random seed for the Keras session. Note that this should not be used when training time is paramount, as it disables GPU computation and CPU parallelism by default for more deterministic computations. -
Fix for plotting training history with early stopping callback (thanks to @JamesAllingham).
-
Return R training history object from
fit_generator()
-
Rename
to_numpy_array()
function tokeras_array()
reflecting automatic use of Keras default backend float type and "C" ordering. -
Add standard layer arguments (e.g.
name
,trainable
, etc.) to merge layers -
Better support for training models from data tensors in TensorFlow (e.g. Datasets, TFRecords). Add a related example script.
-
Add
clone_model()
function, enabling to construct a new model, given an existing model to use as a template. Works even in a TensorFlow graph different from that of the original model. -
Add
target_tensors
argument incompile()
, enabling to use custom tensors or placeholders as model targets. -
Add
steps_per_epoch
argument infit()
, enabling to train a model from data tensors in a way that is consistent with training from arrays. Similarly, addsteps
argument inpredict()
andevaluate()
. -
Add
layer_subtract()
layer function. -
Add
weighted_metrics
argument in compile to specify metric functions meant to take into accountsample_weight
orclass_weight
. -
Enable stateful RNNs with CNTK.
-
install_keras()
function which installs both TensorFlow and Keras -
Use keras package as default implementation rather than tf.contrib.keras
-
Training metrics plotted in realtime within the RStudio Viewer during fit
-
serialize_model()
andunserialize_model()
functions for saving Keras models as 'raw' R objects. -
Automatically convert 64-bit R floats to backend default float type
-
Ensure that arrays passed to generator functions are normalized to C-order
-
to_numpy_array()
utility function for custom generators (enables custom generators to yield C-ordered arrays of the correct float type) -
Added
batch_size
andwrite_grads
arguments tocallback_tensorboard()
-
Added
return_state
argument to recurrent layers. -
Don't re-export
install_tensorflow()
andtf_config()
from tensorflow package. -
is_keras_available()
function to probe whether the Keras Python package is available in the current environment. -
as.data.frame()
S3 method for Keras training history -
Remove names from
keras_model()
inputs -
Return result of
evaluate()
as named list -
Write run metrics and evaluation data to tfruns
-
Provide hint to use r-tensorflow environment when importing keras
- Initial CRAN release