Wednesday, April 15, 2015

The confusing vocabulary of public health and bioinformatics

In the very beginning of our Listeria project, we basically had an idea that we could use evolution to help in epidemiological investigations.  The idea was out there in the world, and so I can't claim credit, but no one yet had the resources to sequence all Listeria monocytogenes genomes in a given area before HHS backed the project.  I would love to write more about this project actually but I digress.

The point here is that we had a vocabulary problem.  In the beginning in the CDC lab (I can't say what all was happening in the other agencies), we had some of the best epidemiologists and some of the best Listeria microbiology experts in the room, and also one bioinformatician (me).   Thus, three different disciplines, one important project, and one huge vocabulary problem.  I tried collecting all the terms that everyone had to learn in the room but that started turning into epi/micro/bioinformatics notes and became too huge.

Instead, I present here a list of confounding vocabulary terms that we all came into the room thinking we knew but instead had different meanings for different people.  I also present my own suggested disambiguation for these terms.


termBioinformaticsMicrobiologyEpidemiologySuggested disambiguation
ClusterA group on a dendrogram
OR: a set of computer nodes that work together in an environment to facilitate parallelized analysis
A group of related illnessesUse a prefix: phylo-cluster, epi-cluster, computer cluster
TracebackThe act of finding the last common ancestor for a group of taxa on a treeThe act of finding the common source of a group of casesI feel like this is just a buzzword and can be replaced. Bioinformatics: state that you are finding the last common ancestor. Public health: state that you are finding the common source.
RelatedWhen two genomes or genes are genetically similarWhen two organisms are similar using some typing scheme (e.g., serogrouping, PFGE, etc)When two cases of infected individuals are similar (e.g., by the food they ate, by their geography, etc)If it's unclear, I prefer to use "phylo-related" or "epi-related" or something similar
Coding (via homologous)Computer program Where a gene gets turned into a functional unitA way to categorize public health events such as outbreaksContext. Use it in complete sentences!
MappingAssigning sequence reads to where they match on a reference genomegeographic information systemAgain, context
(phonetic ambiguity) GUI or gooey A graphical user interfaceSomething a little bit grossContext is probably your friend here
Are there any other terms to bring up?  Please list them here in the comments!

No comments:

Post a Comment