-
nvidia/Llama-3_3-Nemotron-Super-49B-v1
Text Generation • 50B • Updated • 13.8k • 320 -
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 10.8k • • 211 -
google/gemma-3-1b-it
Text Generation • 1.0B • Updated • 2.96M • 729 -
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • 6B • Updated • 443k • 1.54k
Gain.Energy
company
Verified
AI & ML interests
At Gain Energy, we are committed to harnessing the power of Artificial Intelligence (AI) and Machine Learning (ML) to revolutionize the oil and gas industry. Our focus spans a wide range of AI and ML applications aimed at enhancing efficiency, safety, and sustainability.
Sparse Mixture of Experts datasets for mathematical reasoning and complex calculations.
-
On Domain-Specific Post-Training for Multimodal Large Language Models
Paper • 2411.19930 • Published • 29 -
START: Self-taught Reasoner with Tools
Paper • 2503.04625 • Published • 113 -
Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer
Paper • 2503.02495 • Published • 9 -
Fine-Tuning Small Language Models for Domain-Specific AI: An Edge AI Perspective
Paper • 2503.01933 • Published • 13
-
Xkev/Llama-3.2V-11B-cot
Image-Text-to-Text • 11B • Updated • 668 • 158 -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 1.72M • • 1.84k -
microsoft/Phi-3.5-mini-instruct
Text Generation • 4B • Updated • 290k • 930 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 220k • • 1.55k
-
Stream of Search (SoS): Learning to Search in Language
Paper • 2404.03683 • Published • 31 -
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization
Paper • 2411.10442 • Published • 86 -
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions
Paper • 2411.14405 • Published • 61 -
Hymba: A Hybrid-head Architecture for Small Language Models
Paper • 2411.13676 • Published • 45
-
nvidia/Llama-3_3-Nemotron-Super-49B-v1
Text Generation • 50B • Updated • 13.8k • 320 -
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 10.8k • • 211 -
google/gemma-3-1b-it
Text Generation • 1.0B • Updated • 2.96M • 729 -
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • 6B • Updated • 443k • 1.54k
Sparse Mixture of Experts datasets for mathematical reasoning and complex calculations.
-
On Domain-Specific Post-Training for Multimodal Large Language Models
Paper • 2411.19930 • Published • 29 -
START: Self-taught Reasoner with Tools
Paper • 2503.04625 • Published • 113 -
Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer
Paper • 2503.02495 • Published • 9 -
Fine-Tuning Small Language Models for Domain-Specific AI: An Edge AI Perspective
Paper • 2503.01933 • Published • 13
-
Xkev/Llama-3.2V-11B-cot
Image-Text-to-Text • 11B • Updated • 668 • 158 -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 1.72M • • 1.84k -
microsoft/Phi-3.5-mini-instruct
Text Generation • 4B • Updated • 290k • 930 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 220k • • 1.55k
-
Stream of Search (SoS): Learning to Search in Language
Paper • 2404.03683 • Published • 31 -
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization
Paper • 2411.10442 • Published • 86 -
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions
Paper • 2411.14405 • Published • 61 -
Hymba: A Hybrid-head Architecture for Small Language Models
Paper • 2411.13676 • Published • 45