TY - GEN
T1 - Jumping Finite Automata for Tweet Comprehension
AU - Obare, Stephen
AU - Ade-Ibijola, Abejide
AU - Okeyo, George
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/11
Y1 - 2019/11
N2 - Every day, over one billion social media text messages are generated worldwide, which provides abundant information that can lead to improvements in lives of people through evidence-based decision making. Twitter is rich in such data but there are a number of technical challenges in comprehending tweets including ambiguity of the language used in tweets which is exacerbated in under resourced languages. This paper presents an approach based on Jumping Finite Automata for automatic comprehension of tweets. We construct a WordNet for the language of Kenya (WoLK) based on analysis of tweet structure, formalize the space of tweet variation and abstract the space on a Finite Automata. In addition, we present a software tool called Automata-Aided Tweet Comprehension (ATC) tool that takes raw tweets as input, preprocesses, recognise the syntax and extracts semantic information to 86% success rate.
AB - Every day, over one billion social media text messages are generated worldwide, which provides abundant information that can lead to improvements in lives of people through evidence-based decision making. Twitter is rich in such data but there are a number of technical challenges in comprehending tweets including ambiguity of the language used in tweets which is exacerbated in under resourced languages. This paper presents an approach based on Jumping Finite Automata for automatic comprehension of tweets. We construct a WordNet for the language of Kenya (WoLK) based on analysis of tweet structure, formalize the space of tweet variation and abstract the space on a Finite Automata. In addition, we present a software tool called Automata-Aided Tweet Comprehension (ATC) tool that takes raw tweets as input, preprocesses, recognise the syntax and extracts semantic information to 86% success rate.
KW - Jumping finite automata
KW - front-end compiler analysis
KW - preprocessing
KW - tweet comprehension
UR - http://www.scopus.com/inward/record.url?scp=85081954042&partnerID=8YFLogxK
U2 - 10.1109/IMITEC45504.2019.9015850
DO - 10.1109/IMITEC45504.2019.9015850
M3 - Conference contribution
AN - SCOPUS:85081954042
T3 - Proceedings - 2019 International Multidisciplinary Information Technology and Engineering Conference, IMITEC 2019
BT - Proceedings - 2019 International Multidisciplinary Information Technology and Engineering Conference, IMITEC 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2019 International Multidisciplinary Information Technology and Engineering Conference, IMITEC 2019
Y2 - 21 November 2019 through 22 November 2019
ER -