Raincleared commited on
Commit
7003030
·
verified ·
1 Parent(s): 988214f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -0
README.md CHANGED
@@ -12,3 +12,17 @@ This is the 3B BlockFFN model used in the paper *BlockFFN: Towards End-Side Acce
12
  It is directly adaptable to the `inference` implementation of our [codes](https://github.com/thunlp/BlockFFN).
13
 
14
  Links: [[Paper](https://arxiv.org/pdf/2507.08771)] [[Codes](https://github.com/thunlp/BlockFFN)]
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
  It is directly adaptable to the `inference` implementation of our [codes](https://github.com/thunlp/BlockFFN).
13
 
14
  Links: [[Paper](https://arxiv.org/pdf/2507.08771)] [[Codes](https://github.com/thunlp/BlockFFN)]
15
+
16
+ ### Citation
17
+
18
+ If you find our work useful for your research, please kindly cite our paper as follows:
19
+
20
+ ```
21
+ @article{song2025blockffn,
22
+ title={{BlockFFN}: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity},
23
+ author={Chenyang Song and Weilin Zhao and Xu Han and Chaojun Xiao and Yingfa Chen and Yuxuan Li and Zhiyuan Liu and Maosong Sun},
24
+ journal={arXiv preprint arXiv:2507.08771},
25
+ year={2025},
26
+ url={https://arxiv.org/pdf/2507.08771},
27
+ }
28
+ ```