Book 2 (load/save) #259

Narsil · 2023-07-27T13:33:56Z

Depends on #258 Drafted until merge.

LaurentMazare · 2023-07-27T14:03:02Z

candle-book/src/guide/hello_world.md

+use candle::{DType, Device, Result, Tensor};
+
+struct Model {
+ first: Tensor,


Maybe it makes more sense to use candle_nn::Linear here?
Or maybe we really want to go back to basics here in which case we should mention that Linear exists but for the sake of this tutorial we won't use it here and have explicit tensors instead.

Great idea. I was wondering how to complexify this example without delving into too much details to keep it hello world like.

I will keep this example that simple, and expand by making your own Linear, then referring to candle_nn for layers in general.

For Conv1d and Linear for instance, wdyt ?

BTW, this comment is more about #258 (This PR I converted to draft because it's more about the save/load surface to get simple cheatsheet more complete on that front.

Sounds good, you could even even not have backprop/gradient descent for a tensor 101 though maybe it's too simplistic. But certainly good to only introduce candle_nn later in the process (so that users can understand that there is no magic behind the hood there).

Yup, I want to keep the training loop in a different place.

IMO after having hello world + there's no magic you have several options:

Run a real model (90% of people)

Train a model (10% of people)

It's an old stat, we're closer to 99% vs 1% now that AI is much more mainstream (but there's still a lot of finetuners out there, mostly using script already written by people)

candle-core/src/safetensors.rs

Narsil marked this pull request as draft July 27, 2023 13:40

LaurentMazare reviewed Jul 27, 2023

View reviewed changes

Narsil force-pushed the book_2 branch from d8cfde5 to 193988d Compare July 27, 2023 14:36

Narsil changed the title ~~Adding new surface for savetensors (global load, global save).~~ Adding new surface for safetensors (global load, global save). Jul 27, 2023

Narsil force-pushed the book_2 branch from 193988d to edda133 Compare August 1, 2023 12:26

Narsil added 2 commits August 1, 2023 15:00

Modifying safetensors export to get simple load and save.

3100943

Adding new surface for savetensors (global load, global save).

89d1fd0

Narsil force-pushed the book_2 branch from edda133 to 89d1fd0 Compare August 1, 2023 13:00

Narsil marked this pull request as ready for review August 1, 2023 13:00

Narsil requested a review from LaurentMazare August 1, 2023 14:40

Narsil changed the title ~~Adding new surface for safetensors (global load, global save).~~ Book 2 (load/save) Aug 1, 2023

LaurentMazare reviewed Aug 1, 2023

View reviewed changes

candle-core/src/safetensors.rs Outdated Show resolved Hide resolved

Remove map ownership from save.

afb5e24

LaurentMazare approved these changes Aug 1, 2023

View reviewed changes

Narsil merged commit babee9f into main Aug 1, 2023
10 checks passed

Narsil deleted the book_2 branch August 1, 2023 15:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Book 2 (load/save) #259

Book 2 (load/save) #259

Narsil commented Jul 27, 2023 •

edited

Loading

LaurentMazare Jul 27, 2023

Narsil Jul 27, 2023

Narsil Jul 27, 2023

LaurentMazare Jul 27, 2023

Narsil Jul 27, 2023

Book 2 (load/save) #259

Book 2 (load/save) #259

Conversation

Narsil commented Jul 27, 2023 • edited Loading

LaurentMazare Jul 27, 2023

Choose a reason for hiding this comment

Narsil Jul 27, 2023

Choose a reason for hiding this comment

Narsil Jul 27, 2023

Choose a reason for hiding this comment

LaurentMazare Jul 27, 2023

Choose a reason for hiding this comment

Narsil Jul 27, 2023

Choose a reason for hiding this comment

Narsil commented Jul 27, 2023 •

edited

Loading