Spaces:

brendon-ai
/

faq-huggingface-model

Sleeping

brendon-ai commited on Jul 4

Commit

2244fcd

verified ·

1 Parent(s): 665a356

Update src/RAGSample.py

Files changed (1) hide show

src/RAGSample.py CHANGED Viewed

@@ -378,7 +378,6 @@ def setup_rag_chain() -> Runnable:
         - For serious symptoms or concerns, always recommend consulting healthcare professionals
         - Keep responses concise (2-4 sentences maximum)
         - This information is for educational purposes only
         Question: {question}
         Documents: {documents}
 Answer:
@@ -389,13 +388,17 @@ Answer:
     # Initialize a local Hugging Face model
     hf_pipeline = pipeline(
             "text-generation",
-            model="distilgpt2",
             max_new_tokens=150,
             temperature=0.3,
             device_map="auto",
             return_full_text=False,
             truncation=True,
             do_sample=True,
         )
     # Wrap it in LangChain

         - For serious symptoms or concerns, always recommend consulting healthcare professionals
         - Keep responses concise (2-4 sentences maximum)
         - This information is for educational purposes only
         Question: {question}
         Documents: {documents}
 Answer:
     # Initialize a local Hugging Face model
     hf_pipeline = pipeline(
             "text-generation",
+            model="m42-health/Llama3-Med42-8B",
+            tokenizer="m42-health/Llama3-Med42-8B",
             max_new_tokens=150,
+            max_length=2048,      # Llama3 supports longer context
             temperature=0.3,
             device_map="auto",
             return_full_text=False,
             truncation=True,
             do_sample=True,
+            pad_token_id=128001,  # Llama3 pad token
+            eos_token_id=128009,  # Llama3 EOS token
         )
     # Wrap it in LangChain