view article Article Universal Image Segmentation with Mask2Former and OneFormer By nielsr and 2 others β’ Jan 19, 2023 β’ 14
view article Article The State of Computer Vision at Hugging Face π€ By sayakpaul β’ Jan 30, 2023 β’ 8
view article Article π€ PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware By smangrul and 1 other β’ Feb 10, 2023 β’ 94
view article Article Hugging Face and AWS partner to make AI more accessible By jeffboudier and 2 others β’ Feb 21, 2023 β’ 3
view article Article New ViT and ALIGN Models From Kakao Brain By adirik and 3 others β’ Mar 6, 2023 β’ 4
view article Article Creating Privacy Preserving AI with Substra By EazyAl and 3 others β’ Apr 12, 2023 β’ 2
view article Article Accelerating Hugging Face Transformers with AWS Inferentia2 By philschmid and 1 other β’ Apr 17, 2023 β’ 1
view article Article Accelerating Vision-Language Models: BridgeTower on Habana Gaudi2 By regisss and 1 other β’ Jun 29, 2023 β’ 3
view article Article Practical 3D Asset Generation: A Step-by-Step Guide By dylanebert β’ Aug 1, 2023 β’ 9
view article Article Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model By VictorSanh and 10 others β’ Aug 22, 2023 β’ 36
view article Article Object Detection Leaderboard By rafaelpadilla and 1 other β’ Sep 18, 2023 β’ 19
view article Article π€ PEFT welcomes new merging methods By smangrul and 1 other β’ Feb 19, 2024 β’ 22
view article Article Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset By HugoLaurencon and 2 others β’ Mar 15, 2024 β’ 11
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community By Leyo and 2 others β’ Apr 15, 2024 β’ 183
view article Article Docmatix - a huge dataset for Document Visual Question Answering By andito and 1 other β’ Jul 18, 2024 β’ 74
view article Article Visual Document Retrieval Goes Multilingual By marco and 1 other β’ Jan 10 β’ 75
view article Article Fast LoRA inference for Flux with Diffusers and PEFT By sayakpaul and 1 other β’ 7 days ago β’ 34
view article Article TimeScope: How Long Can Your Video Large Multimodal Model Go? By orrzohar and 3 others β’ 7 days ago β’ 29