known words
currently learning
terms to unify
regex to split segments
Highlight top 70 % of unknown words
Text Input