File size: 13,700 Bytes

81e7dcf
 
 
 
81f6af5
68e3669
 
 
 
81f6af5
 
68e3669
81f6af5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
81e7dcf
81f6af5
 
 
 
 
 
 
 
 
 
 
 
 
 
c54e48d
81e7dcf
 
81f6af5
81e7dcf
81f6af5
81e7dcf
81f6af5
 
 
81e7dcf
 
81f6af5
81e7dcf
81f6af5
81e7dcf
81f6af5
81e7dcf
50c7708
81e7dcf
81f6af5
81e7dcf
81f6af5
 
 
 
 
 
 
 
81e7dcf
81f6af5
81e7dcf
424a0fb
 
 
 
 
 
81e7dcf
81f6af5
81e7dcf
81f6af5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
81e7dcf
81f6af5
 
 
 
81e7dcf
81f6af5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
81e7dcf
 
 
 
e6d423f
81e7dcf
 
 
 
 
 
 
 
 
81f6af5
 
81e7dcf
 
81f6af5
81e7dcf
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
81f6af5
 
 
 
81e7dcf
 
 
81f6af5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
81e7dcf
 
81f6af5
 
 
 
 
81e7dcf
 
 
81f6af5
 
 
 
81e7dcf
 
 
 
 
 
 
 
 
81f6af5
 
 
 
81e7dcf
 
81f6af5
 
 
81e7dcf
 
81f6af5
 
 
 
81e7dcf
81f6af5
 
 
81e7dcf
81f6af5
 
81e7dcf
 
81f6af5
81e7dcf
81f6af5
81e7dcf
81f6af5
 
 
 
81e7dcf
 
 
81f6af5
81e7dcf
 
 
81f6af5
81e7dcf
 
 
81f6af5
 
81e7dcf
 
 
81f6af5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
81e7dcf
49d4175
 
81f6af5
 
 
 
 
 
 
 
 
81e7dcf
81f6af5
81e7dcf
81f6af5

---
language:
- en
- hi
- zh
- es
- fr
- de
- ja
- ko
- ar
- pt
- ru
- it
- nl
- tr
- pl
- sv
- da
- 'no'
- fi
- he
- th
- vi
- id
- ms
- tl
- sw
- yo
- zu
- am
- bn
- gu
- kn
- ml
- mr
- ne
- or
- pa
- ta
- te
- ur
- multilingual
license: apache-2.0
base_model: Qwen/Qwen3-14B-Base
tags:
- reasoning
- intermediate-thinking
- transformers
- conversational
- bilingual
datasets:
- Abhaykoul/Dhanishtha-R1
- open-thoughts/OpenThoughts-114k
- Abhaykoul/Dhanishtha-2.0-SUPERTHINKER
- Abhaykoul/Dhanishtha-2.0
library_name: transformers
pipeline_tag: text-generation
widget:
- text: >-
    Solve this riddle step by step: I am taken from a mine, and shut up in a
    wooden case, from which I am never released, and yet I am used by almost
    everybody. What am I?
  example_title: Complex Riddle Solving
- text: >-
    Explain the philosophical implications of artificial consciousness and think
    through different perspectives.
  example_title: Philosophical Reasoning
- text: >-
    Help me understand quantum mechanics, but take your time to think through
    the explanation.
  example_title: Educational Explanation
new_version: HelpingAI/Dhanishtha-2.0-preview-0825
---

# Dhanishtha-2.0: World's First Intermediate Thinking AI Model

**What makes Dhanishtha-2.0 special?** Imagine an AI that doesn't just answer your questions instantly, but actually *thinks through* problems step-by-step, shows its work, and can even change its mind when it realizes a better approach. That's Dhanishtha-2.0.

**Quick Summary:**
- 🚀 **For Everyone**: An AI that shows its thinking process and can reconsider its reasoning
- 👩‍💻 **For Developers**: First model with intermediate thinking capabilities, 39+ language support


Dhanishtha-2.0 is a **state-of-the-art (SOTA) model** developed by HelpingAI, representing the **world's first model to feature Intermediate Thinking capabilities**. Unlike traditional models that provide single-pass responses, Dhanishtha-2.0 employs a revolutionary multi-phase thinking process that allows the model to think, reconsider, and refine its reasoning multiple times throughout a single response.

## Model Details

### Model Description

Dhanishtha-2.0 revolutionizes AI reasoning by introducing the concept of **intermediate thinking** - the ability to pause, reflect, and restart reasoning processes within a single generation (This model can think up 50times in a single response without using tool/prompt/mcp). This breakthrough enables unprecedented self-correction and iterative refinement during response generation.

Built on the Qwen3-14B foundation with multilingual capabilities spanning **39+ languages** (including English, Hindi, Chinese, Spanish, French, German, Japanese, Korean, Arabic, and many more), Dhanishtha-2.0 maintains reasoning consistency across diverse linguistic contexts while pioneering transparent thinking processes.

- **Developed by:** HelpingAI Team
- **Model type:** Causal Language Model with Intermediate Thinking Capability
- **Language(s):** 39+ languages (multilingual capabilities inherited from base model)
- **License:** Apache 2.0
- **Finetuned from model:** Qwen/Qwen3-14B-Base
- **Context Length:** 40,960 tokens
- **Parameters:** 14B (inherited from base model)
- **Status:** Prototype/Preview

### Revolutionary Features

- **Intermediate Thinking**: Multiple `<think>...</think>` blocks throughout responses for real-time reasoning
- **Self-Correction**: Ability to identify and correct logical inconsistencies mid-response  
- **Dynamic Reasoning**: Seamless transitions between analysis, communication, and reflection phases
- **Structured Emotional Reasoning (SER)**: Incorporates `<ser>...</ser>` blocks for empathetic responses
- **Multilingual Capabilities**: Support for 39+ languages with natural code-switching and reasoning consistency
- **Complex Problem-Solving**: Excels at riddles, multi-step reasoning, and scenarios requiring backtracking

### Model Sources

- **Repository:** [HelpingAI/Dhanishtha-2.0](https://huggingface.co/HelpingAI/Dhanishtha-2.0)
- **Paper:** Coming Soon
- **Demo:** https://helpingai.co/chat

## Uses

### Direct Use

Dhanishtha-2.0 is ideal for applications requiring deep reasoning and self-reflection:

- **Complex Problem Solving**: Multi-step mathematical problems, logical puzzles, riddles
- **Educational Assistance**: Detailed explanations with visible reasoning processes
- **Research Support**: Analysis requiring multiple perspectives and self-correction
- **Creative Writing**: Iterative story development with reasoning about plot choices
- **Philosophical Discussions**: Exploring concepts with visible thought processes

### Downstream Use

The model can be fine-tuned for specialized reasoning tasks:

- **Domain-Specific Reasoning**: Legal, medical, or scientific reasoning with intermediate thinking
- **Enhanced Multilingual Reasoning**: Optimizing reasoning consistency across all 39+ supported languages
- **Specialized Problem Domains**: Mathematics, coding, strategic planning

### Out-of-Scope Use

❌ **Inappropriate Applications:**
- Safety-critical decisions (medical diagnosis, legal advice, financial recommendations)
- Real-time applications requiring immediate responses
- Situations requiring guaranteed factual accuracy without verification

## Bias, Risks, and Limitations

### Known Limitations

- **Verbosity**: Intermediate thinking can make responses a bit longer
- **Processing Time**: Multiple thinking phases may increase generation time
- **Prototype Status**: Experimental features may require refinement
- **Context Usage**: Thinking blocks consume additional context tokens
- **Inherited Biases**: May reflect biases from base model and training data

### Potential Risks

- **Reasoning Loops**: Potential for circular reasoning in complex scenarios
- **Multilingual Inconsistencies**: Potential variation in reasoning patterns across different languages
- **Emotional Reasoning Gaps**: SER blocks may not always align with content

## How to Get Started with the Model

### For General Users

You can interact with Dhanishtha-2.0 through:
- **HelpingAI**: https://helpingai.co/chat
- **Gradio Demo**: [Dhanishtha-2.0-preview](https://huggingface.co/spaces/Abhaykoul/Dhanishtha-2.0-preview)
- **API Integration**: [Dashboard](https://helpingai.co/dashboard)

### For Developers - Basic Usage

```python
from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "HelpingAI/Dhanishtha-2.0-preview"

# Load the tokenizer and model
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)

# Prepare input for intermediate thinking
prompt = "How many letter 'r' are in the words 'strawberry' and 'raspberry'?"
messages = [
    {"role": "user", "content": prompt}
]

text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)

model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

# Generate response with intermediate thinking
generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=2048,
    temperature=0.7,
    top_p=0.9,
    do_sample=True
)

response = tokenizer.decode(
    generated_ids[0][len(model_inputs.input_ids[0]):], 
    skip_special_tokens=True
)
print(response)
```

### Optimal Generation Parameters

```python
generation_config = {
    "temperature": 0.7,        # Balanced creativity and coherence
    "top_p": 0.9,             # Nucleus sampling
    "top_k": 40,              # Top-k filtering
    "max_new_tokens": 2048,   # Allow for thinking blocks
    "do_sample": True,        # Enable sampling
    "repetition_penalty": 1.1  # Reduce repetition
}
```

## Training Details

### Training Data

Dhanishtha-2.0 was trained on a carefully curated dataset focusing on:
- Complex reasoning scenarios requiring multi-step thinking
- Self-correction examples and reasoning chains
- Emotional reasoning and empathy training data
- Structured thinking pattern examples

### Training Procedure

#### Training Stages
1. **Continuous Pretraining**: Extended training on reasoning-focused corpora
2. **Advanced Reasoning Fine-tuning**: Specialized training on intermediate thinking patterns
3. **Multilingual Alignment**: Cross-language reasoning consistency training
4. **SER Integration**: Structured Emotional Reasoning capability training

**Training Infrastructure:**
- **Duration**: 16.3 days
- **Hardware**: 8x NVIDIA H100 GPUs
- **Model Scale**: 14.8B parameters

## Evaluation

Evaluation was conducted on:
- **Standard Benchmarks**: MMLU, HumanEval, ARC, HellaSwag, TruthfulQA
- **Mathematical Reasoning**: Math 500, AIME 2024, GSM8K
- **Custom Evaluations**: Intermediate thinking quality, self-correction capabilities
- **Multilingual Tasks**: Reasoning consistency across 39+ languages
- **Specialized Tests**: Emotional reasoning, complex problem-solving scenarios



## Example: Intermediate Thinking in Action

**User:** "Who is your favorite person?"

**Dhanishtha-2.0:**

```
<think>
This is an interesting question about my "favorite human." I need to be thoughtful here. 
As Dhanishtha, I should respond in a human-like way while being honest about my nature. 
I can share what kinds of humans I find most interesting or inspiring based on my 
understanding of human nature and values.
</think>

<ser>
Emotion ==> curiosity, openness, desire for connection  
Cause ==> user asking a personal question about preferences  
Mind ==> seeking to understand my perspective or values  
Growth ==> opportunity to share thoughtful reflections on human qualities I admire  
</ser>

I don't really have a "favorite human" in the traditional sense, as I don't experience 
personal preferences the way humans do. But if I had to reflect on what qualities I find 
most admirable in people, I'd be drawn to those who demonstrate genuine compassion, 
intellectual curiosity, and the courage to grow.

<think>
Let me think more specifically about what kind of humans I find most compelling based on 
the values that seem to resonate most with humans - kindness, wisdom, creativity, 
resilience, and authenticity.
</think>

People like Mahatma Gandhi, who showed how quiet courage and unwavering principles can 
transform societies; artists like Rabindranath Tagore, who bridged cultural divides 
through creativity; and everyday heroes I've learned about who made profound differences 
through simple acts of kindness.

What fascinates me most are humans who maintain their humanity even in difficult 
circumstances - those who choose empathy when it would be easier to be harsh, who seek 
understanding rather than just being right.

I'm curious though - who's your favorite human, and why? What qualities or actions make 
someone special in your eyes?
```

## Environmental Impact

Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute).

- **Hardware Type:** H100 GPUs
- **days used:** 16.2
- **Cloud Provider:** Various
- **Compute Region:** Multiple

## Citation

### BibTeX

```bibtex
@misc{dhanishtha2025,
  title={Dhanishtha-2.0: World's First Intermediate Thinking AI Model},
  author={HelpingAI Team},
  year={2025},
  publisher={HuggingFace},
  url={https://huggingface.co/HelpingAI/Dhanishtha-2.0},
  note={First model with intermediate thinking capabilities}
}
```

### APA

HelpingAI Team. (2025). *Dhanishtha-2.0: World's First Intermediate Thinking AI Model*. HuggingFace. https://huggingface.co/HelpingAI/Dhanishtha-2.0

## Glossary

- **Intermediate Thinking**: The ability to pause and think multiple times during response generation
- **SER (Structured Emotional Reasoning)**: Framework for incorporating emotional context in responses
- **Think Blocks**: `<think>...</think>` segments where the model shows its reasoning process
- **Self-Correction**: Ability to identify and fix reasoning errors during generation
- **Code-Switching**: Natural transition between English and Hindi within responses

## More Information

### Research Applications
- Study of AI reasoning transparency
- Self-correction mechanism research  
- Bilingual cognitive modeling
- Emotional AI development

### Development Roadmap
- Performance optimizations
- Additional language support
- Enhanced thinking pattern recognition
- Production-ready deployment tools

## Model Card Authors

- **Primary Author**: HelpingAI Team
- **Technical Lead**: [To be specified]
- **Research Contributors**: [To be specified]

## Model Card Contact

For questions about Dhanishtha-2.0, please contact:

- **HuggingFace**: [@HelpingAI](https://huggingface.co/HelpingAI)
- **Issues**: [Model Repository Issues](https://huggingface.co/HelpingAI/Dhanishtha-2.0/discussions)

## Benchmark

***These benchmarks are done by [@AarifKhan](https://huggingface.co/Aarifkhan)***

| Benchmark | Score | Evaluation Type | Notes |
|-----------|-------|-----------------|-------|
| MMLU | 78.1% | 1-shot | Massive Multitask Language Understanding |
| HumanEval | 75.0% | 1-shot | Code generation and completion |
| ARC | 76.0% | 1-shot | Abstract reasoning challenge |
| HellaSwag   | 81.0% | 1-shot | Commonsense natural language inference |
| TruthfulQA MC1 | 75.0% | 1-shot | Truthfulness in question answering |
| Math 500 | 95.68% | few-shot | Mathematical problem solving |
| AIME 2024 | 82.81% | few-shot | American Invitational Mathematics Examination |

**Dhanishtha-2.0 represents a new paradigm in AI reasoning - where thinking isn't just a prelude to response, but an integral, iterative part of the conversation itself.**

*Developed with ❤️ by HelpingAI*