Simple is powerful
Eric Lam | Voidful
UDIC LAB Member
Github : https://github.com/voidful
Email : [email protected]
Medium : https://firstname.lastname@example.org
LinkedIn : https://www.linkedin.com/in/voidful/
Twitter : https://twitter.com/voidful_stack
Facebook : https://www.facebook.com/voidful.nlp/
Natural Language Processing, Machine Learning, Crawler, Web Framework, Data Mining
Front-end(React), Adobe illustrator Design, Backend, Android Apps
🤖📇 Transformers kit - NLP library for different downstream tasks, built on huggingface project
🍳 NLPrep - download and pre-processing data for nlp tasks
🏃 hosting nlp models for demo purpose
Talk to a poster, it can answer related question using machine reading comprehension & Information retrieval.
Extract knowledge form medical record which have different expressions due to doctor’s expression.
Lack of cantonese corpus is most of problem.Now trying to solve it in transfer learning ways.
keep updating with the newest dataset and model
Trial of bert fineturing on sentence generating in different approach : generate one by one, generate one time ,generate from LSTM.
Open source Android app,it can let you hash your password before typing.
Python library that help to do text mining and preprocessing, with unit test and detail document
A different approach that can extract new phrase in short text.It use conditional probability different to PMI and Entropy. Compare to others, it have less limit on size of input corpus and less computation.
Using Fasttext train multi-classifier, select retrain sample form voting, entropy and clustering.So that we can use less labeling effort to get a high accuracy.
Mining from Wiki Dump data, getting plain text, synonym from redirect, translation from language link and relationship from category.
Collect corpus for nlp task, base on scrapy, crawling all text in 19 well-know website.