-
Notifications
You must be signed in to change notification settings - Fork 4
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* check for nans when unsharding * don't cast root forward inputs * use all_gather_into_tensor when possible * fix * fix * fix dtype * clean up * clean up * assert dtypes in `summon_full_params` context * no write back * more tests * explicit cast when checking * updates * ensure `cast` and `writeback` not both set * cast in other direction * revert * Add mp option to train script * update * adjust LR * update prefetching logic in forward pass * update backward prefetch logic * define stream in top-level of package * debugging * add to stream test * more test * clean up * Try recording stream * Add comment * clean up * updates * fix how many mods are prefetched * don't check for nan loss * clean up
- Loading branch information
Showing
12 changed files
with
272 additions
and
102 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.