File size: 2,793 Bytes
909dabe
 
 
 
 
 
 
 
 
 
d4ca384
 
 
 
4768b0d
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
909dabe
 
4768b0d
 
d4ca384
 
 
 
 
 
 
 
 
cf2d1c8
 
d4ca384
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
---
title: Bioclip 2 Demo
emoji: 😻
colorFrom: pink
colorTo: gray
sdk: gradio
sdk_version: 5.33.0
app_file: app.py
pinned: false
license: mit
models:
- imageomics/bioclip-2
datasets:
- imageomics/TreeOfLife-200M
description: >- 
  This space provides an interactive demo for running BioCLIP-2 inference. Additionally, one can run inference on multiple images using the pybioclip package.
tags:
  - visualization
  - data
  - samples
  - data-visualization 
  - exploration
  - biology
  - vision
  - CV
  - imageomics
  - clip
  - species-classification
  - biological visual task
  - multimodal
  - animals
  - species
  - taxonomy
  - rare species
  - endangered species
  - evolutionary biology
  - knowledge-guided
  - zero-shot-image-classification
---

# BioCLIP-2 Demo

This app is modified from the original [BioCLIP Demo](https://huggingface.co/spaces/imageomics/bioclip-demo) to run inference with [BioCLIP 2](https://huggingface.co/imageomics/bioclip-2) and uses [pybioclip](https://github.com/Imageomics/pybioclip).

Due to space persistent storage limitations, embeddings are fetched from the [TreeOfLife-200M repo](https://huggingface.co/datasets/imageomics/TreeOfLife-200M). The images will be retrieved from an S3 bucket, as with the origin, described below.

Note that if this space is duplicated, the sample image portion **will not work**.

**bioclip-2/metadata.parquet:** metadata file for fetching [TreeOfLife-200M](https://huggingface.co/datasets/imageomics/TreeOfLife-200M) sample images (up to 3 available per taxa) from an S3 bucket.
- `uuid`: unique identifier for the image within the TreeOfLife-200M dataset.
- `eol_page_id`: identifier of EOL page for the most specific taxa of the image (where available). Note that an image's association to a particular page ID may change with updates to the EOL (or image provider's) hierarchy. However, EOL taxon page IDs are stable. "https://eol.org/pages/" + `eol_page_id` links to the page.
- `gbif_id`: GBIF identifier for the occurrence from which the image is sourced.
- `gbif_taxon_id`: identifier used by GBIF for the most specific taxa of the image (where available). "https://gbif.org/species/" + `gbif_taxon_id` links to the page.
- `kingdom`: kingdom to which the subject of the image belongs (all `Animalia`).
- `phylum`: phylum to which the subject of the image belongs.
- `class`: class to which the subject of the image belongs.
- `order`: order to which the subject of the image belongs.
- `family`: family to which the subject of the image belongs.
- `genus`: genus to which the subject of the image belongs.
- `species`: species to which the subject of the image belongs.
- `file_path`: image filepath to fetch image from S3 bucket (`<folder>/<uuid>.jpg`, folders are first two characters of the `uuid`).