AI & ML interests

Biological data generation and modeling

Recent Activity

sritter-ginkgobioworksΒ  updated a dataset 4 days ago
ginkgo-datapoints/GDPa1
maschhoff-ginkgoΒ  published a dataset about 1 month ago
ginkgo-datapoints/GDPx4
maschhoff-ginkgoΒ  updated a dataset about 1 month ago
ginkgo-datapoints/GDPx4
View all activity

Articles

cgeorgiawΒ 
posted an update 5 months ago
freddyaboultonΒ 
posted an update 7 months ago
cgeorgiawΒ 
posted an update 7 months ago
view post
Post
6096
πŸš€πŸš€πŸš€ The largest ever dataset of co-folded 3D protein-ligand structures just dropped on HF!!

Meet SAIR (Structurally Augmented ICβ‚…β‚€ Repository): 5M+ AI-generated complexes with experimentally measured drug potency data from SandboxAQ. πŸš€πŸš€πŸš€

Check it out and explore here: SandboxAQ/SAIR

  • 3 replies
Β·
cgeorgiawΒ 
posted an update 8 months ago
freddyaboultonΒ 
posted an update 10 months ago
cgeorgiawΒ 
posted an update 10 months ago
freddyaboultonΒ 
posted an update 10 months ago
view post
Post
784
Time is running out! ⏰

Less than 24 hours to participate in the MCP Hackathon and win thousands of dollars in prizes! Don't miss this opportunity to showcase your skills.

Visit Agents-MCP-Hackathon/AI-Marketing-Content-Creator to register!

cgeorgiawΒ 
posted an update 10 months ago
view post
Post
1634
Snooping on HF is the best because sometimes you just discover that someone (in this case, Earth Species Project) is about to drop terabytes of sick (high quality animal sounds) data...

EarthSpeciesProject/NatureLM-audio-training
freddyaboultonΒ 
posted an update 10 months ago
view post
Post
573
🚨 NotebookLM Dethroned?! 🚨

Meet Fluxions vui: The new open-source dialogue generation model.
🀯 100M Params, 40k hours audio!
πŸŽ™οΈ Multi-speaker audio
πŸ˜‚ Non-speech sounds (like [laughs]!)
πŸ“œ MIT License

Is this the future of content creation? Watch the video and decide for yourself!

https://huggingface.co/spaces/fluxions/vui-spacehttps://huggingface.co/fluxions/vui
  • 1 reply
Β·
cgeorgiawΒ 
posted an update 11 months ago
view post
Post
560
Just dropped two bigger physics datasets (both on photonics)!

NUMBA 1: SIB-CL
This dataset of Surrogate- and Invariance-Boosted Contrastive Learning (SIB-CL) datasets for two scientific problems:
- PhC2D: 2D photonic crystal density-of-states (DOS) and bandstructure data.
- TISE: 3D time-independent SchrΓΆdinger equation eigenvalue and eigenvector solutions.

NUMBA2: 2D Photonic Topology
Symmetry-driven analysis of 2D photonic crystals: 10k random unit cells across 11 symmetries, 2 polarizations, 5 contrasts. Includes time-reversal breaking cases for 4 symmetries at high contrast.

Check them out: cgeorgiaw/sib-cl & cgeorgiaw/2d-photonic-topology