YiChen Lv
d789527482
spec : Support Step3.5/3.7 flash mtp3 (#24340)
* add mtp_layer_offset + include nextn flags in graph reuse
* add llama_set_mtp_layer_offset + llama_model_n_nextn_layer API
* offset head select + require all MTP blocks
* speculative multi-head process()
* speculative multi-head draft()
* gather outputs via inp_out_ids
* cleanup
* fix core
* minor cleanup
* merged draft_multi_head into draft()
* mtp rename nextn
* Apply suggestions from code review
Co-authored-by: Aman Gupta <amangupta052@gmail.com>
* clean-up comments
* fix for multi seq
* apply suggestions && chain-heads comment
* add a reference for chain_heads discussion
---------
Co-authored-by: Aman Gupta <amangupta052@gmail.com>
2026-06-21 11:33:18 +03:00
..
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-13 19:49:00 +02:00
2026-06-13 19:49:00 +02:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-05-16 11:09:27 +02:00
2026-06-10 23:21:16 +05:30
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-17 17:29:49 +03:00
2026-05-16 11:09:27 +02:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:17:54 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-08 13:48:52 -07:00
2026-06-12 10:21:06 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:17:54 +03:00
2026-06-05 11:09:36 +03:00
2026-06-20 13:48:24 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 17:44:59 +02:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-05-04 12:36:59 +02:00
2026-06-05 11:09:36 +03:00
2026-05-16 11:09:27 +02:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-05-04 12:36:59 +02:00
2026-06-12 10:21:06 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-03-16 09:19:16 +01:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-05-04 12:36:59 +02:00
2026-06-13 19:49:00 +02:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-05-04 12:36:59 +02:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-12 10:21:06 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-05-16 11:09:27 +02:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-09 10:26:44 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-05-16 11:09:27 +02:00
2026-06-12 10:21:06 +03:00
2026-06-12 10:21:06 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-19 13:08:50 +03:00
2026-06-19 13:08:50 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-02-16 14:35:04 +02:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-02-16 14:35:04 +02:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-05 11:09:36 +03:00
2026-06-21 11:33:18 +03:00
2026-06-05 11:09:36 +03:00
2026-05-04 12:36:59 +02:00
2026-06-05 11:09:36 +03:00
2026-05-16 11:09:27 +02:00
2026-06-05 11:09:36 +03:00