Post
				
				
							413
					We create a dataset of 1 million, MIT-licensed synthetic humans, sampled from actual US demographics. You can use it to seed LLM synthetic data generation and create extremely diverse, statistically realistic outputs. 
Dataset: https://huggingface.co/datasets/skysight-inc/synthetic-humans-1m
Accompanying blog post with methodology: https://www.skysight.inc/blog/synthetic-humans
	
		
	Dataset: https://huggingface.co/datasets/skysight-inc/synthetic-humans-1m
Accompanying blog post with methodology: https://www.skysight.inc/blog/synthetic-humans


