Commit 4013381
Fix a2a type (#3311)
Summary:
Pull Request resolved: #3311
VBE initializes dist but kjt sets ctx flag at runtime.
So if the batch sizes happens to match for all features, we assume fixed batch size resulting in runtime error.
In this did, I fix the dist once initialized.
We should follow up with driving this from config.
https://fb.workplace.com/groups/1699838000485189/permalink/2222934654842185/
Differential Revision: D80742183
fbshipit-source-id: 1898040fd436a54742f78594f996a7f4e5e0225c1 parent 1b1e2b3 commit 4013381
File tree
2 files changed
+23
-14
lines changed- torchrec/distributed/sharding
2 files changed
+23
-14
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
367 | 367 | | |
368 | 368 | | |
369 | 369 | | |
370 | | - | |
371 | | - | |
372 | | - | |
373 | | - | |
| 370 | + | |
| 371 | + | |
374 | 372 | | |
375 | 373 | | |
376 | | - | |
377 | | - | |
378 | | - | |
379 | | - | |
380 | | - | |
381 | | - | |
382 | | - | |
| 374 | + | |
| 375 | + | |
| 376 | + | |
| 377 | + | |
383 | 378 | | |
| 379 | + | |
| 380 | + | |
| 381 | + | |
| 382 | + | |
| 383 | + | |
| 384 | + | |
384 | 385 | | |
385 | 386 | | |
386 | 387 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
504 | 504 | | |
505 | 505 | | |
506 | 506 | | |
507 | | - | |
| 507 | + | |
| 508 | + | |
| 509 | + | |
| 510 | + | |
| 511 | + | |
| 512 | + | |
| 513 | + | |
508 | 514 | | |
509 | 515 | | |
510 | 516 | | |
511 | 517 | | |
512 | 518 | | |
513 | 519 | | |
514 | | - | |
| 520 | + | |
| 521 | + | |
515 | 522 | | |
516 | 523 | | |
517 | 524 | | |
| |||
525 | 532 | | |
526 | 533 | | |
527 | 534 | | |
528 | | - | |
| 535 | + | |
| 536 | + | |
529 | 537 | | |
530 | 538 | | |
531 | 539 | | |
| |||
0 commit comments