From dc5f71c6c95c19edf540b201c5b48c42b023efd8 Mon Sep 17 00:00:00 2001 From: Joshua David Date: Thu, 18 Jul 2024 22:04:25 -0700 Subject: [PATCH] Update README.md to be more detailed --- README.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/README.md b/README.md index d2c58e5..eb70f06 100644 --- a/README.md +++ b/README.md @@ -193,6 +193,10 @@ print(output.shape) # Expected shape: (batch_size, target_length, d_model) ## Advanced Usage +Custom Dataset Training + +To train on a custom dataset: + ## Results My implementation of LongRoPE achieves the following results: