bockhealthbharath
/

Eira-0.2

Image-Text-to-Text

question-answering

Model card Files Files and versions

ShashidharSarvi commited on May 24

Commit

77eb245

·

verified ·

1 Parent(s): 74c792d

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -24,10 +24,10 @@ This model integrates a Llama‑2 text backbone with a BLIP vision backbone to p
 ### Model Description
-EIRA‑0.2 is a fine‑tuned multimodal model designed to answer free‑form questions about medical images (e.g., radiographs, histology slides) in conjunction with accompanying text. Internally, it uses:
 - A **text encoder/decoder** based on **meta‑llama/Llama‑2‑7b‑hf**, fine‑tuned for medical QA.
-- A **vision encoder** based on **Salesforce/blip-image-captioning-base**, fine‑tuned to extract descriptive features from medical imagery.
 - A **fusion module** that cross‑attends between vision features and text embeddings to generate coherent, context‑aware answers.
 - **Developed by:** BockBharath

 ### Model Description
+EIRA‑0.2 is a multimodal model designed to answer free‑form questions about medical images (e.g., radiographs, histology slides) in conjunction with accompanying text. Internally, it uses:
 - A **text encoder/decoder** based on **meta‑llama/Llama‑2‑7b‑hf**, fine‑tuned for medical QA.
+- A **vision encoder** based on **Salesforce/blip-image-captioning-base**, that extract descriptive features from medical imagery.
 - A **fusion module** that cross‑attends between vision features and text embeddings to generate coherent, context‑aware answers.
 - **Developed by:** BockBharath