-
Notifications
You must be signed in to change notification settings - Fork 55
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MeiliSearch: Answers aren't found reliably #29
Comments
@heubeck Thanks for the feedback. Let me check first. Can you pull the latest answer:all-in-one image again and then resave the |
Thx, @LinkinStars, re-tested with the latest answer:all-in-one image, and also created a new meilisearch index. The issues persist, and
seems not to be limited to answers, also searching for the first word of a question doesn't find it. |
@heubeck Got it. |
hi, would you try to search directly in meilisearch's own admin page, see if it can be searched? |
thx @sivdead - didn't know about this UI ;) It shows the issue very well: |
maybe the search plugin should remove all html tags,use only pure words to index, or just use markdown? @LinkinStars does elasticsearch plugin have the same problem? |
There are some discussions and issues around html content in the meilisearch github project. are there libraries out there that reliable remove all formatting and styling elements from html content? |
Sorry for responding to this issue so late. We have been discussing a suitable solution to this problem. Here is the reason for this issue. Answer itself will give the content of the Q&A to the plugin to process, now the content that Answer gives to the plugin is parsed HTML format content. Therefore, after the split words are used to build the inverted index, the search engine will treat the tag as a keyword that can be searched for such as However, the problem is that Answer currently has only two types of data, one is markdown text and the other is html. If it is passed as markdown text, then it will result in a search for "#" which will return all the results. So, we need some other solutions to filter out HTML tags in Answer in a reasonable way, and to keep the blocks of code in markdown that the user would normally type in. This is a bit complicated to implement, and we are still discussing the solution. |
Thx for the feedback. |
found this for html: https://github.com/microcosm-cc/bluemonday |
Hey team,
thank you for providing us with the MeiliSearch plugin.
It's working well when searching for text or title of a question, but there are flaws in finding text in answers, haven't identified the pattern yet.
For instance, the answer in this question:

cannot be found:

where others can be:

Furthermore, it seems, that, when searching for the first word of an answer text, it can never be found.
No idea, how to provide you with more insights, the MeiliSearch log seems unspectacular (http 202 when creating/updating posts, http 200 on searches).
Running the latest
answer:all-in-one
image, and thegetmeili/meilisearch:v1.3.0
image.The text was updated successfully, but these errors were encountered: