-
Notifications
You must be signed in to change notification settings - Fork 144
GazetteerViewGenerator loads no gazetteers; deprecate in favor of SimpleGazetteerAnnotator? #146
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hey @mssammon, |
@shyamupa good to know, thanks. I think that ideally the option to use filters should be implemented in the new annotator -- I'm presuming this is non-trivial additional work but doable. Sound OK? |
The SimpleGazetteerAnnotator is highly optimized. It can do phrases of any length without impacting performance. The GazViewGen is based on the older methodology which is less efficient, but requires less memory. It is a simple hash set of all the phrased from the gaz, but for multi-token hits it is not very efficient. There are lots of cool features in GazViewGen. I suggest, if they are useful features, we should move functionality over to the SimpleGazetteerAnnotator, and deprecate the older GazViewGenerator. |
@shyamupa please track any usages of the old gazetteer view generator within illinois-cogcomp-nlp. (I assume none, but just in case...). What is the cost of migrating the GazViewGen features to the SimpleGazetteerGenerator? |
@mssammon So the class is not used outside of |
Should be taken care of by #244. Will close this issue once merged. |
GazetteerViewGenerator and it is trying to load gazetteer files from resources/gazetteers/gazetteers, but this doesn't exist. Instead, I think clients should use SimpleGazetteerAnnotator, which loads resources from illinois-common-resources (note that these are organized differently from the way expected by GazetteerViewGenerator).
The text was updated successfully, but these errors were encountered: