Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
7
3
Sindhu Hegde
sindhuhegde
Follow
peterpu's profile picture
1 follower
Β·
1 following
https://sindhu-hegde.github.io
Sindhu-Hegde
AI & ML interests
Computer Vision, Multimodal Learning: Vision + Speech/Language, Deep Learning, Machine Learning
Recent Activity
reacted
to
DmitryRyumin
's
post
with π₯
6 days ago
πππ New Research Alert - ICCV 2025 (Oral)! ππ€π π Title: Understanding Co-speech Gestures in-the-wild π π Description: JEGAL is a tri-modal model that learns from gestures, speech and text simultaneously, enabling devices to interpret co-speech gestures in the wild. π₯ Authors: @sindhuhegde, K R Prajwal, Taein Kwon, and Andrew Zisserman π Conference: ICCV, 19 β 23 Oct, 2025 | Honolulu, Hawai'i, USA πΊπΈ π Paper: https://huggingface.co/papers/2503.22668 π Web Page: https://www.robots.ox.ac.uk/~vgg/research/jegal π Repository: https://github.com/Sindhu-Hegde/jegal πΊ Video: https://www.youtube.com/watch?v=TYFOLKfM-rM π ICCV-2023-25-Papers: https://github.com/DmitryRyumin/ICCV-2023-25-Papers π Added to the Human Modeling Section: https://github.com/DmitryRyumin/ICCV-2023-25-Papers/blob/main/sections/2025/main/human-modeling.md π More Papers: more cutting-edge research presented at other conferences in the https://huggingface.co/spaces/DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin π Keywords: #CoSpeechGestures #GestureUnderstanding #TriModalRepresentation #MultimodalLearning #AI #ICCV2025 #ResearchHighlight
new
activity
3 months ago
sindhuhegde/avs-spot:
Update task categories to `video-text-to-text`
updated
a dataset
3 months ago
sindhuhegde/avs-spot
View all activity
Organizations
None yet
sindhuhegde
's Spaces
2
Sort:Β Recently updated
Configuration error
4
Gestsync Sync Correction
π
Crop and align video with audio
Runtime error
Gesture Retrieval Exemplar Svm
π