🚀 Release 0.4.0 #134

jean-francoisreboud · 2024-09-01T20:12:17Z

Features

🚀 examples: integrate Gemma2-2B (#132)
✨ layer_seq: LLM sliding window (#131)
🚀 examples: 3 LLMs examples (#130)
✨ layer_seq: LLM generate (128)
✨ layer_seq: MultiplySeq, SiLU & LLM test (127)
✨ layer_seq: ValueCausalSeq (126)
✨ layer_seq: QueryCausalSeq (125)
✨ layer_seq: RoPESeq (124)
✨ layer_seq: RMSNormSeq (123)
✨ layer_seq: EmbeddingSeq (122)
🪜 feat: LayerCAM2D -> VQGrad2D, LayerCAMSeq -> VQGradSeq (#117)
⚙️ core: GELU vs GELUApprox (113)
🚀 perf: QuerySelf & ValueSelf (112)
🚀 perf: benchmark ViT base model (111)
⚙️ core: initForward,Backward model API (109)
🪜 layer_1d: Dropout1D (#108)
🪜 feat: VQGrad, VQGradSeq (#107)

Bug Fixes

🐛 fix: run on Apple Silicon (110)

Miscellaneous Tasks

📚 docs: LLM doc & split tests (129)
🚀 perf: use half in Metal kernels (121)
🔨 refactor: handle float16 along float on GPU (#120)
🚀 perf: copy & generate weights faster (119)
🚀 perf: Convolution2D (118)

jean-francoisreboud added 24 commits September 18, 2023 11:34

✨ feat: VQGrad, VQGradSeq (#107)

064392b

✨ feat: Dropout1D (#108)

3130f05

✨ feat(core): initForward,Backward model API (#109)

516833d

🐛 fix: run on Apple Silicon (#110)

63934a9

🚀 perf: benchmark ViT base model (#111)

c2988f1

🚀 perf: QuerySelf & ValueSelf (#112)

4969db6

✨ feat(core): GELU vs GELUApprox (#113)

096b95d

✨ feat: LayerCAM2D -> VQGrad2D, LayerCAMSeq -> VQGradSeq (#117)

3d3191d

🚀 perf: Convolution2D (#118)

192f994

🚀 perf: copy & generate weights faster (#119)

a9d176c

🔨 refactor: handle float16 along float on GPU (#120)

52ab4df

🚀 perf: use half in Metal kernels (#121)

ceff714

✨ feat(layer_seq): EmbeddingSeq (#122)

d97e520

✨ feat(layer_seq): RMSNormSeq (#123)

2d65e95

✨ feat(layer_seq): RoPESeq (#124)

03e2617

✨ feat(layer_seq): QueryCausalSeq (#125)

6dd84dd

✨ feat(layer_seq): ValueCausalSeq (#126)

8ab07d5

✨ layer_seq: MultiplySeq, SiLU & LLM test (#127)

0e34be3

✨ feat(layer_seq): LLM generate (#128)

6a188fd

📚 docs: LLM doc & split tests (#129)

c3a8ade

🚀 test(examples): 3 LLMs examples (#130)

723b021

✨ feat(layer_seq): LLM sliding window (#131)

54b4a30

🚀 test(examples): integrate Gemma2-2B (#132)

838e922

🔧 chore: update changelog (#133)

6f8720a

jean-francoisreboud self-assigned this Sep 1, 2024

jean-francoisreboud merged commit a6ca885 into main Sep 1, 2024
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🚀 Release 0.4.0 #134

🚀 Release 0.4.0 #134

jean-francoisreboud commented Sep 1, 2024 •

edited

Loading

🚀 Release 0.4.0 #134

🚀 Release 0.4.0 #134

Conversation

jean-francoisreboud commented Sep 1, 2024 • edited Loading

Features

Bug Fixes

Miscellaneous Tasks

jean-francoisreboud commented Sep 1, 2024 •

edited

Loading