-
Notifications
You must be signed in to change notification settings - Fork 39
Any additional suggestion about performance ? #6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
What performance issues are you having? |
I have a Azure DTU based database with 200DTUs. This is my typical scenario. The query takes: CPU time = 2235 ms, elapsed time = 1133 ms.
|
So 1133 ms is unacceptable performance? This is probably what I am getting, but in my applications, it has not been an issue. I always have 'other processes' like my calls the LLM going on, that have their own delays, before returning a result to the end user. |
I would like to have at least the double of the performance. My concern is related to the storage and the number of the users connected to the db. 25000 articles are a good amount of data for my use case, but I can't exclude that they become more, and I'm assuming that the performance is inversely proportional to the number of articles. |
I am considering expanding the use of SQL Server for my production application, which is an RAG platform with about 80,000 pages of data. Our application's data needs are growing, and we're exploring options to increase our storage capacity significantly. Given that SQL Server is our favorite DB, I'm interested in exploring it as the layer that stores the vector embedding from our data. Did you have success in optimizing performance? If you decided not to use SQL Server for storing vectors, I would love to learn why. Thank you. |
Yes, I'm still using the SQL and I did a lot of tests and improvements. |
Do you have any additional suggestions on how to optimize performance apart from what you wrote in the article?
The text was updated successfully, but these errors were encountered: