LLMs and RAG Pipeline #47

Solobrad · 2024-09-30T08:30:26Z

Modern LLMs like Llama seem to outperform traditional RAG methods on long-context tasks, demonstrating improved context handling and understanding, which may lead to reconsidering the need for RAG in many scenarios.

However, I recently came across something called SRF-RAG, which offers several key benefits.

Retrieval: Retrieves relevant context from external sources.
Generation: Produces coherent responses based on retrieved context.
Instruction Tuning: Improves understanding of complex queries.
Hallucination Reduction: Minimizes incorrect or misleading information.
Multi-Hop Reasoning: Handles complex questions by synthesising information from multiple sources.

I think we could use Langchain to set up a pipeline that figures out whether an input needs a RAG-based approach or can be handled directly by the LLM

Call for Contributions #43

ariG23498 · 2024-09-30T09:04:56Z

Hey @Solobrad

Thanks for taking this bit up!

For the time being would you be interested in building a quick RAG pipeline with the Llama family of models? Once that is done, we could look into SRF-RAG as an enhancement.

The suggestion is based on the fact that this repository (huggingface-llama-recipes) is built with the idea of helping anyone to get started quickly.

Please let me know how you feel about my suggestion. Also feel free to ask any questions if you have.

Solobrad · 2024-09-30T09:39:34Z

Hi @ariG23498

I'm in to collaborate with other teams on this repo. Thanks for the opportunity!

ariG23498 · 2024-09-30T10:12:56Z

That would be great!

But would you be open to implementing a very simple (here simple is the keyword) RAG pipeline in the first place?

If you are fine with that, I can redirect other contributors to this issue so that you can collaborate with them on this.

Solobrad · 2024-09-30T11:07:03Z

Yup count me in

Purity-E · 2024-09-30T11:37:50Z

Hi @ariG23498 ,
Thanks for redirecting me to this issue. @Solobrad I'm looking forward to collaborating with you.

Solobrad · 2024-09-30T13:18:15Z

Same here, nice meeting you @Purity-E !

Solobrad · 2024-09-30T14:14:23Z

@Purity-E, we are required to start simple so I've added a very basic pipeline code. If the PR is accepted, we can work on enhancements later

Solobrad · 2024-09-30T14:55:09Z

@ariG23498 I see a few issues same as mine, should we bring them over here? Brainstorming on the enhancements.

Purity-E · 2024-09-30T16:58:02Z

@Purity-E, we are required to start simple so I've added a very basic pipeline code. If the PR is accepted, we can work on enhancements later

Cool. Thanks for the update.

ariG23498 · 2024-10-01T04:59:17Z

@ariG23498 I see a few issues same as mine, should we bring them over here? Brainstorming on the enhancements.

Feel free to. Having said that, we are not really looking for a very complicated project with RAG. It should be enough to get anyone started with RAG using Llama.

atharv-jiwane · 2024-10-01T05:56:51Z

Hey @ariG23498 thanks for redirecting me to here.
Hii @Solobrad looking forward to working with you!

Solobrad · 2024-10-01T11:28:31Z

Hi @atharv-jiwane welcome to the team! I like your idea though, image retrieval can be more effective, especially with PDFs.

atharv-jiwane · 2024-10-01T11:37:13Z

Hey! This is my first time contributing to an open source project so I am really excited! I saw the PR that was created pertaining to this issue and wanted to discuss the how we are going to build from the initial commit. Also saw @ariG23498 's comments on the PR and wanted to take that up. Let me know how you wanna divide/distribute the work.

Solobrad · 2024-10-01T12:17:15Z

Sure man, which would you like to work on? I'm thinking of adding you both to my forked repo as collaborators, so we can discuss work delegations and work on it. Let me know if that works for you. @Purity-E @atharv-jiwane

Purity-E · 2024-10-01T12:19:38Z

@Solobrad sure that's okay

atharv-jiwane · 2024-10-01T12:21:38Z

@Solobrad Yup sounds good! I could take up the embedding part

Solobrad · 2024-10-01T12:25:55Z

Cool, we'll be working on the "llama-rag" branch then.

Solobrad · 2024-10-01T13:09:05Z

I'll check on the dataset.

Solobrad · 2024-10-01T13:35:49Z

I've added a transcript dataset @atharv-jiwane, it's clean and pretty straightforward. You can try embedding it. Thanks

atharv-jiwane · 2024-10-01T14:30:28Z

I have tried embedding the dataset. I am not sure I committed the changes properly, @Solobrad could you please guide me

Solobrad · 2024-10-01T14:55:00Z

Hey @atharv-jiwane , I saw an error about the LLaMa access try filling in the access form https://huggingface.co/meta-llama/Llama-3.1-8B. Even though LLaMa is an open-sourced LLM, we normally have to fill in an application before we can use it from Hugging Face or Kaggle. Does this answer your question?

I changed the code a little because the naming you used for sentenceTransfromer was rewriting the previous LLaMa model. Go ahead and check it out. Hope this helps.

atharv-jiwane · 2024-10-01T16:01:00Z

Hey @Solobrad , I've reviewed your changes and I have filled the verification form for using LLaMa. Thank you for the information on that. I think it takes some time to get the request reviewed.

Meanwhile, I think the only changes I have made are in the embeddings sections right after the dataset has been imported so could you please commit an error-free version of the code to the "llama-rag" branch?

atharv-jiwane · 2024-10-01T19:15:37Z

Hey @Solobrad , I have added an LLM pipeline in the latest commit and fixed the earlier auth issues with LLaMa models. I tried to run the query but it took too long to generate a response. Could you please guide me as to where I am going wrong?

Also, the earlier version of embeddings that I wrote could instead just be done when we create the vector store right?

Solobrad · 2024-10-02T01:47:56Z

Yup @atharv-jiwane , just create the vector store and let it handle embedding the documents. You don't need to separately encode them. If this was what you were asking.

I'll try checking on the prolonged response time.

atharv-jiwane · 2024-10-02T04:59:07Z

Cool, so @Solobrad let's do away with the separate encodings? Also can we add GPU support? I am running this locally on a Macbook Air M2 so I think GPU support would be nice.

Also pertaining to the response time, when I first passed the query ("What is Hugging Face.") to the LLM there was an error code generated which said that the max_new_tokens was exceeded beyond 20. This might also be causing an issue.

Solobrad · 2024-10-03T06:13:11Z

Hi, I solved the max token problem, and I attribute the ‘long’ response time to using the model locally. I’ve tried to use APIs directly and also a smaller model.

@Purity-E @atharv-jiwane, I've pushed the latest runnable code, go on and have a try.

atharv-jiwane · 2024-10-03T06:14:38Z

@Solobrad Thanks for the update! I’ll have a go soon, running slightly busy

Solobrad · 2024-10-04T02:22:43Z

@sinatayebati will be joining us.

Solobrad · 2024-10-05T02:28:41Z

Hey everyone, I suggest adding some compelling markdown so users can easily read what's going on (as mentioned before). So it's like a simple DEMO or tutorial, what's your take on this? @sinatayebati @atharv-jiwane @Purity-E

PS: I added the LLaMa back

sinatayebati · 2024-10-05T05:25:58Z

@Solobrad Hey Nicholas. Thanks for the latest commits. In my opinion this latest notebook should be very close to what HF team has in mind. I also just pushed two minor updates:

added a line in the beginning of notebook to pip install required libraries
updated the readme with a section pointing to this notebook.

Solobrad · 2024-10-05T05:31:07Z

Awesome, thanks!

atharv-jiwane · 2024-10-05T08:21:51Z

Hey @Solobrad! I think the latest commit looks good. I think we should consult the maintainers and ask for their opinion on this

Solobrad · 2024-10-07T08:47:51Z

I've updated the code according to the latest requirements guys @Purity-E @atharv-jiwane. Feel free to add any markdowns or so. You should use Google Collab if you want to run the code.

* Adding a simple LLaMa-RAG pipeline * Adding datasets from Hugging Face * Update llama_rag_pipeline.ipynb * changing model names and removing unused components * Added llm pipeline and fixed prev LLaMa issue * Access LLaMa by API * left out files * removing fles * working version of simple RAG LLaMa * Cleaning up comments and unused components * Modifying the prompt * Commenting out chat template code * adding chat template * removing files * simple RAG * adding LLaMa * changing system prompt * pip install reuqired libraries + readme update * Adding markdowns * removing outputs * updated pip install + resolving conflict in readme * bringing back the readme note after conflict reolsved --------- Co-authored-by: atharv-jiwane <[email protected]> Co-authored-by: sinatayebati <[email protected]>

ariG23498 · 2024-10-07T14:20:31Z

Closing this issue as the PR has been merged! Thanks for the great contribution.

This was referenced Sep 30, 2024

Implementing RAG with Llama #51

Closed

Call for contributions #43

Open

Solobrad mentioned this issue Sep 30, 2024

Issue #47 Adding a simple LLaMa-RAG pipeline #60

Merged

Solobrad changed the title ~~Proposal for Call for Contributions #43: LLMs and RAG~~ LLMs and RAG Pipeline Sep 30, 2024

This was referenced Oct 1, 2024

Simple RAG Pipeline #62

Closed

Simple RAG App using Llama 3 #61

Closed

atharv-jiwane mentioned this issue Oct 1, 2024

Add Multimodal RAG (Text + Image) for Retrieval-Augmented Generation Using Llama #64

Open

ariG23498 closed this as completed Oct 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLMs and RAG Pipeline #47

LLMs and RAG Pipeline #47

Solobrad commented Sep 30, 2024 •

edited

Loading

ariG23498 commented Sep 30, 2024

Solobrad commented Sep 30, 2024

ariG23498 commented Sep 30, 2024

Solobrad commented Sep 30, 2024 •

edited

Loading

Purity-E commented Sep 30, 2024

Solobrad commented Sep 30, 2024

Solobrad commented Sep 30, 2024

Solobrad commented Sep 30, 2024

Purity-E commented Sep 30, 2024

ariG23498 commented Oct 1, 2024

atharv-jiwane commented Oct 1, 2024

Solobrad commented Oct 1, 2024

atharv-jiwane commented Oct 1, 2024

Solobrad commented Oct 1, 2024 •

edited

Loading

Purity-E commented Oct 1, 2024

atharv-jiwane commented Oct 1, 2024

Solobrad commented Oct 1, 2024

Solobrad commented Oct 1, 2024

Solobrad commented Oct 1, 2024

atharv-jiwane commented Oct 1, 2024

Solobrad commented Oct 1, 2024 •

edited

Loading

atharv-jiwane commented Oct 1, 2024

atharv-jiwane commented Oct 1, 2024 •

edited

Loading

Solobrad commented Oct 2, 2024

atharv-jiwane commented Oct 2, 2024

Solobrad commented Oct 3, 2024

atharv-jiwane commented Oct 3, 2024

Solobrad commented Oct 4, 2024 •

edited

Loading

Solobrad commented Oct 5, 2024 •

edited

Loading

sinatayebati commented Oct 5, 2024

Solobrad commented Oct 5, 2024 •

edited

Loading

atharv-jiwane commented Oct 5, 2024

Solobrad commented Oct 7, 2024

ariG23498 commented Oct 7, 2024

LLMs and RAG Pipeline #47

LLMs and RAG Pipeline #47

Comments

Solobrad commented Sep 30, 2024 • edited Loading

ariG23498 commented Sep 30, 2024

Solobrad commented Sep 30, 2024

ariG23498 commented Sep 30, 2024

Solobrad commented Sep 30, 2024 • edited Loading

Purity-E commented Sep 30, 2024

Solobrad commented Sep 30, 2024

Solobrad commented Sep 30, 2024

Solobrad commented Sep 30, 2024

Purity-E commented Sep 30, 2024

ariG23498 commented Oct 1, 2024

atharv-jiwane commented Oct 1, 2024

Solobrad commented Oct 1, 2024

atharv-jiwane commented Oct 1, 2024

Solobrad commented Oct 1, 2024 • edited Loading

Purity-E commented Oct 1, 2024

atharv-jiwane commented Oct 1, 2024

Solobrad commented Oct 1, 2024

Solobrad commented Oct 1, 2024

Solobrad commented Oct 1, 2024

atharv-jiwane commented Oct 1, 2024

Solobrad commented Oct 1, 2024 • edited Loading

atharv-jiwane commented Oct 1, 2024

atharv-jiwane commented Oct 1, 2024 • edited Loading

Solobrad commented Oct 2, 2024

atharv-jiwane commented Oct 2, 2024

Solobrad commented Oct 3, 2024

atharv-jiwane commented Oct 3, 2024

Solobrad commented Oct 4, 2024 • edited Loading

Solobrad commented Oct 5, 2024 • edited Loading

sinatayebati commented Oct 5, 2024

Solobrad commented Oct 5, 2024 • edited Loading

atharv-jiwane commented Oct 5, 2024

Solobrad commented Oct 7, 2024

ariG23498 commented Oct 7, 2024

Solobrad commented Sep 30, 2024 •

edited

Loading

Solobrad commented Sep 30, 2024 •

edited

Loading

Solobrad commented Oct 1, 2024 •

edited

Loading

Solobrad commented Oct 1, 2024 •

edited

Loading

atharv-jiwane commented Oct 1, 2024 •

edited

Loading

Solobrad commented Oct 4, 2024 •

edited

Loading

Solobrad commented Oct 5, 2024 •

edited

Loading

Solobrad commented Oct 5, 2024 •

edited

Loading