Commit b40eb84
llama : support for
* feat: initial support for llama.cpp
* fix: lint
* refactor: better refactor
* Update src/llama.cpp
Co-authored-by: compilade <git@compilade.net>
* Update src/llama.cpp
Co-authored-by: compilade <git@compilade.net>
* fix: address comments
* Update convert_hf_to_gguf.py
Co-authored-by: compilade <git@compilade.net>
* fix: add more cleanup and harmonization
* fix: lint
* Update gguf-py/gguf/gguf_writer.py
Co-authored-by: compilade <git@compilade.net>
* fix: change name
* Apply suggestions from code review
Co-authored-by: compilade <git@compilade.net>
* add in operator
* fix: add `dt_b_c_rms` in `llm_load_print_meta`
* fix: correct printf format for bool
* fix: correct print format
* Update src/llama.cpp
Co-authored-by: compilade <git@compilade.net>
* llama : quantize more Mamba tensors
* llama : use f16 as the fallback of fallback quant types
---------
Co-authored-by: compilade <git@compilade.net>falcon-mamba architecture (#9074)1 parent f63f603 commit b40eb84
File tree
5 files changed
+36
-24
lines changed- gguf-py/gguf
- src
5 files changed
+36
-24
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
106 | 106 | | |
107 | 107 | | |
108 | 108 | | |
| 109 | + | |
109 | 110 | | |
110 | 111 | | |
111 | 112 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
295 | 295 | | |
296 | 296 | | |
297 | 297 | | |
| 298 | + | |
298 | 299 | | |
299 | 300 | | |
300 | 301 | | |
| |||
2711 | 2712 | | |
2712 | 2713 | | |
2713 | 2714 | | |
2714 | | - | |
| 2715 | + | |
2715 | 2716 | | |
2716 | 2717 | | |
2717 | 2718 | | |
| |||
2742 | 2743 | | |
2743 | 2744 | | |
2744 | 2745 | | |
2745 | | - | |
| 2746 | + | |
| 2747 | + | |
| 2748 | + | |
| 2749 | + | |
2746 | 2750 | | |
2747 | 2751 | | |
2748 | 2752 | | |
2749 | 2753 | | |
2750 | 2754 | | |
2751 | 2755 | | |
2752 | 2756 | | |
2753 | | - | |
| 2757 | + | |
2754 | 2758 | | |
2755 | 2759 | | |
2756 | 2760 | | |
2757 | 2761 | | |
2758 | 2762 | | |
| 2763 | + | |
2759 | 2764 | | |
2760 | 2765 | | |
2761 | 2766 | | |
| |||
2782 | 2787 | | |
2783 | 2788 | | |
2784 | 2789 | | |
2785 | | - | |
2786 | | - | |
2787 | | - | |
2788 | | - | |
2789 | | - | |
2790 | | - | |
2791 | | - | |
2792 | | - | |
2793 | | - | |
2794 | | - | |
2795 | | - | |
2796 | | - | |
2797 | | - | |
2798 | | - | |
2799 | | - | |
2800 | | - | |
2801 | | - | |
2802 | 2790 | | |
2803 | 2791 | | |
2804 | 2792 | | |
| |||
3792 | 3780 | | |
3793 | 3781 | | |
3794 | 3782 | | |
3795 | | - | |
| 3783 | + | |
3796 | 3784 | | |
3797 | 3785 | | |
3798 | 3786 | | |
| |||
3855 | 3843 | | |
3856 | 3844 | | |
3857 | 3845 | | |
3858 | | - | |
3859 | 3846 | | |
| 3847 | + | |
3860 | 3848 | | |
3861 | 3849 | | |
3862 | 3850 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
130 | 130 | | |
131 | 131 | | |
132 | 132 | | |
| 133 | + | |
133 | 134 | | |
134 | 135 | | |
135 | 136 | | |
| |||
1372 | 1373 | | |
1373 | 1374 | | |
1374 | 1375 | | |
| 1376 | + | |
1375 | 1377 | | |
1376 | 1378 | | |
1377 | 1379 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
730 | 730 | | |
731 | 731 | | |
732 | 732 | | |
| 733 | + | |
| 734 | + | |
| 735 | + | |
733 | 736 | | |
734 | 737 | | |
735 | 738 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
328 | 328 | | |
329 | 329 | | |
330 | 330 | | |
| 331 | + | |
331 | 332 | | |
332 | 333 | | |
333 | 334 | | |
| |||
426 | 427 | | |
427 | 428 | | |
428 | 429 | | |
| 430 | + | |
429 | 431 | | |
430 | 432 | | |
431 | 433 | | |
| |||
2237 | 2239 | | |
2238 | 2240 | | |
2239 | 2241 | | |
| 2242 | + | |
2240 | 2243 | | |
2241 | 2244 | | |
2242 | 2245 | | |
| |||
2286 | 2289 | | |
2287 | 2290 | | |
2288 | 2291 | | |
| 2292 | + | |
2289 | 2293 | | |
2290 | 2294 | | |
2291 | 2295 | | |
| |||
5052 | 5056 | | |
5053 | 5057 | | |
5054 | 5058 | | |
| 5059 | + | |
5055 | 5060 | | |
5056 | 5061 | | |
5057 | 5062 | | |
| |||
5907 | 5912 | | |
5908 | 5913 | | |
5909 | 5914 | | |
| 5915 | + | |
5910 | 5916 | | |
5911 | 5917 | | |
5912 | 5918 | | |
| |||
12161 | 12167 | | |
12162 | 12168 | | |
12163 | 12169 | | |
| 12170 | + | |
| 12171 | + | |
| 12172 | + | |
| 12173 | + | |
12164 | 12174 | | |
12165 | 12175 | | |
12166 | 12176 | | |
| |||
12241 | 12251 | | |
12242 | 12252 | | |
12243 | 12253 | | |
| 12254 | + | |
| 12255 | + | |
| 12256 | + | |
| 12257 | + | |
| 12258 | + | |
| 12259 | + | |
| 12260 | + | |
12244 | 12261 | | |
12245 | 12262 | | |
12246 | 12263 | | |
| |||
16105 | 16122 | | |
16106 | 16123 | | |
16107 | 16124 | | |
| 16125 | + | |
| 16126 | + | |
| 16127 | + | |
16108 | 16128 | | |
16109 | 16129 | | |
16110 | 16130 | | |
| |||
16433 | 16453 | | |
16434 | 16454 | | |
16435 | 16455 | | |
16436 | | - | |
16437 | | - | |
16438 | 16456 | | |
16439 | 16457 | | |
16440 | 16458 | | |
| |||
0 commit comments