-
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper • 2309.12307 • Published • 89 -
LMDX: Language Model-based Document Information Extraction and Localization
Paper • 2309.10952 • Published • 66 -
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Paper • 2310.09263 • Published • 41 -
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 104
Collections
Discover the best community collections!
Collections including paper arxiv:2404.05961
-
McGill-NLP/LLM2Vec-Meta-Llama-31-8B-Instruct-mntp-supervised
Sentence Similarity • Updated • 47 • 4 -
McGill-NLP/LLM2Vec-Meta-Llama-3-8B-Instruct-mntp-supervised
Sentence Similarity • Updated • 6.3k • 49 -
McGill-NLP/LLM2Vec-Mistral-7B-Instruct-v2-mntp-supervised
Sentence Similarity • Updated • 99 • 13 -
McGill-NLP/LLM2Vec-Llama-2-7b-chat-hf-mntp-supervised
Sentence Similarity • Updated • 6 • 3
-
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper • 2309.12307 • Published • 89 -
LMDX: Language Model-based Document Information Extraction and Localization
Paper • 2309.10952 • Published • 66 -
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Paper • 2310.09263 • Published • 41 -
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 104
-
McGill-NLP/LLM2Vec-Meta-Llama-31-8B-Instruct-mntp-supervised
Sentence Similarity • Updated • 47 • 4 -
McGill-NLP/LLM2Vec-Meta-Llama-3-8B-Instruct-mntp-supervised
Sentence Similarity • Updated • 6.3k • 49 -
McGill-NLP/LLM2Vec-Mistral-7B-Instruct-v2-mntp-supervised
Sentence Similarity • Updated • 99 • 13 -
McGill-NLP/LLM2Vec-Llama-2-7b-chat-hf-mntp-supervised
Sentence Similarity • Updated • 6 • 3