Sindhu Hegde's picture

7 3

Sindhu Hegde

sindhuhegde

·

https://sindhu-hegde.github.io

Sindhu-Hegde

AI & ML interests

Computer Vision, Multimodal Learning: Vision + Speech/Language, Deep Learning, Machine Learning

Recent Activity

reacted to DmitryRyumin's post with 🔥 about 23 hours ago

🚀👌🌟 New Research Alert - ICCV 2025 (Oral)! 🌟🤌🚀 📄 Title: Understanding Co-speech Gestures in-the-wild 🔝 📝 Description: JEGAL is a tri-modal model that learns from gestures, speech and text simultaneously, enabling devices to interpret co-speech gestures in the wild. 👥 Authors: @sindhuhegde, K R Prajwal, Taein Kwon, and Andrew Zisserman 📅 Conference: ICCV, 19 – 23 Oct, 2025 | Honolulu, Hawai'i, USA 🇺🇸 📄 Paper: https://huggingface.co/papers/2503.22668 🌐 Web Page: https://www.robots.ox.ac.uk/~vgg/research/jegal 📁 Repository: https://github.com/Sindhu-Hegde/jegal 📺 Video: https://www.youtube.com/watch?v=TYFOLKfM-rM 🚀 ICCV-2023-25-Papers: https://github.com/DmitryRyumin/ICCV-2023-25-Papers 🚀 Added to the Human Modeling Section: https://github.com/DmitryRyumin/ICCV-2023-25-Papers/blob/main/sections/2025/main/human-modeling.md 📚 More Papers: more cutting-edge research presented at other conferences in the https://huggingface.co/spaces/DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin 🔍 Keywords: #CoSpeechGestures #GestureUnderstanding #TriModalRepresentation #MultimodalLearning #AI #ICCV2025 #ResearchHighlight

new activity 2 months ago

sindhuhegde/avs-spot:Update task categories to `video-text-to-text`

updated a dataset 2 months ago

sindhuhegde/avs-spot

View all activity

Organizations

None yet

Papers 2

arxiv:2503.22668

arxiv:2310.05304

spaces 2

Configuration error

Gestsync Sync Correction

Crop and align video with audio

Gesture Retrieval Exemplar Svm

models 0

None public yet

datasets 2

sindhuhegde/avs-spot

Viewer • Updated Aug 23 • 500 • 132 • 2

sindhuhegde/multivsr

Viewer • Updated Apr 27 • 8.94M • 242 • 4