-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
using spacy for Hindi #10
Comments
As far as i can understand, this is the way to do it right now from spacy.lang.hi.lex_attrs import norm
from spacy.lang.hi.examples import sentences
def stemming(texts):
texts_out = []
for sent in texts:
texts_out.append(norm(sent))
return texts_out
print(stemming(sentences[0].split(' '))) Am I correct? |
Hi @rohanrajpal. sorry I couldn't get to you sooner. from spacy.lang.hi import Hindi
sentence = "पाठशाला मे अभी जलपान की छुट्टी हुई थी। "
nlp = Hindi()
doc = nlp(sentence)
for token in doc:
print(token.text, token.norm_, token.orth_) It outputs
Having said that stemmer for Hindi is still in development. So, to be able to use it reliably, you will need to improve and make PRs. |
Thanks for the detailed reply! |
Hey man, sorry to open an issue here, but I saw your commit on the spacy repo
I was trying to use spacy to do some simple stemming on Hindi text, could you please share some examples? I can't find anything on the internet.
The text was updated successfully, but these errors were encountered: