Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
51
1
Big Deeper
BigDeeper
Follow
21world's profile picture
1 follower
ยท
0 following
AI & ML interests
Differentiable hashing, orthonormal polynomial language modeling, image compression into language representations.
Recent Activity
new
activity
29 days ago
black-forest-labs/FLUX.1-Kontext-dev:
What are my options to run it on multiple GPUs?
new
activity
2 months ago
unsloth/medgemma-27b-text-it:
Says image-text to text
new
activity
2 months ago
nvidia/parakeet-tdt-0.6b-v2:
Does this model identifies speaker?
View all activity
Organizations
None yet
BigDeeper
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
black-forest-labs/FLUX.1-Kontext-dev
29 days ago
What are my options to run it on multiple GPUs?
#32 opened 29 days ago by
BigDeeper
New activity in
unsloth/medgemma-27b-text-it
2 months ago
Says image-text to text
#2 opened 2 months ago by
BigDeeper
New activity in
nvidia/parakeet-tdt-0.6b-v2
2 months ago
Does this model identifies speaker?
๐
1
8
#16 opened 3 months ago by
SouravAhmed
Is the model capable of splitting different speakers?
๐
1
1
#29 opened 2 months ago by
BigDeeper
liked
a model
4 months ago
deepseek-ai/DeepSeek-V3
Text Generation
โข
685B
โข
Updated
Mar 27
โข
502k
โข
โข
3.93k
New activity in
ByteDance/LatentSync
5 months ago
Very large RAM foot print.
4
#1 opened 6 months ago by
BigDeeper
New activity in
brittlewis12/s1-32B-GGUF
6 months ago
THE q8_0 version appears to go on and on indefinitely.
6
#1 opened 6 months ago by
BigDeeper
New activity in
ndkhanh95/LatentSync
6 months ago
Having a problem. Unable to find a suitable output format for 'video_out.mp4
#1 opened 6 months ago by
BigDeeper
New activity in
chunyu-li/LatentSync
6 months ago
Any ideas how to mitigate this problem?
#3 opened 6 months ago by
BigDeeper
New activity in
Lightricks/LTX-Video
8 months ago
Longer video?
6
#25 opened 8 months ago by
BigDeeper
What minimal VRAM does it require?
12
#18 opened 8 months ago by
DrNicefellow
New activity in
Qwen/Qwen2.5-Coder-32B-Instruct
8 months ago
VSCODE + Cline + Ollama + Qwen2.5-Coder-32B-Instruct.Q8_0
3
#20 opened 8 months ago by
BigDeeper
New activity in
black-forest-labs/FLUX.1-dev
12 months ago
comfyui does not recognize model files in sft format
๐
๐
4
5
#18 opened 12 months ago by
peidong
New activity in
bigscience/bloomz-3b
about 1 year ago
Are there advantages or disadvantages in changing the format for translation?
3
#10 opened about 1 year ago by
BigDeeper
New activity in
QuantFactory/Meta-Llama-3-120B-Instruct-GGUF
about 1 year ago
What does 120B really mean?
3
#1 opened about 1 year ago by
BigDeeper
New activity in
meta-llama/Meta-Llama-3-70B
about 1 year ago
Does anyone know which specific Python library contains the tokenizer that was used to train Llama-3-70b?
๐
1
2
#11 opened about 1 year ago by
BigDeeper
15 TeraTokens = 190 Million books
2
#4 opened over 1 year ago by
Languido
New activity in
meta-llama/Meta-Llama-3-8B
about 1 year ago
I was trying to fine-tune llama3 8b but getting following error - TypeError: LlamaForCausalLM.forward() got an unexpected keyword argument 'decoder_input_ids'
4
#117 opened about 1 year ago by
aniiikket11
New activity in
dphn/dolphin-2.9-llama3-8b-gguf
about 1 year ago
Has anyone tried this gguf with agentic framework?
3
#6 opened about 1 year ago by
BigDeeper
New activity in
microsoft/Phi-3-mini-128k-instruct
over 1 year ago
gguf
30
#24 opened over 1 year ago by
LaferriereJC
Load more