pith. sign in

arxiv: 1802.09197 · v1 · pith:M2TLQNLQnew · submitted 2018-02-26 · 🧬 q-bio.QM · cs.LG· stat.ML

AI4AI: Quantitative Methods for Classifying Host Species from Avian Influenza DNA Sequence

classification 🧬 q-bio.QM cs.LGstat.ML
keywords influenzaclassificationhostlearningspeciesaccuracyavianbreakout
0
0 comments X
read the original abstract

Avian Influenza breakouts cause millions of dollars in damage each year globally, especially in Asian countries such as China and South Korea. The impact magnitude of a breakout directly correlates to time required to fully understand the influenza virus, particularly the interspecies pathogenicity. The procedure requires laboratory tests that require resources typically lacking in a breakout emergency. In this study, we propose new quantitative methods utilizing machine learning and deep learning to correctly classify host species given raw DNA sequence data of the influenza virus, and provide probabilities for each classification. The best deep learning models achieve top-1 classification accuracy of 47%, and top-3 classification accuracy of 82%, on a dataset of 11 host species classes.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.