reasoning on / off ?
#5
by
TahirC
- opened
can we control the thinking to n tokens or turn off if needed ?
Not yet. We plan to release a 'Flash' version that matches the latency of the instruct model while retaining current capabilities.
TahirC
changed discussion status to
closed