Karin Verspoor (FAIDH)

Karin Verspoor (FAIDH)

Melbourne, Victoria, Australia
4K followers 500 connections

About

I am a senior academic leader, with a strong desire to create inclusive, cohesive, and productive collaborative research and educational environments. I am also passionate about supporting #WomenInSTEM and diversity and inclusion initiatives.

I am a computer scientist by training, with an emphasis on artificial intelligence and cognitive science, particularly focused on language representation and processing in the biomedical domain. My research focuses on the use of computational techniques, particularly natural language processing, machine learning, text mining and knowledge integration, primarily in the context of biomedical data analysis.

I am a Fellow of the Australasian Institute of Digital Health. I have been applying my AI skills in the biomedical domain since 2003, addressing problems ranging from information extraction of biological concepts and events, to the use of text mining to support interpretation of biological data, to detection and prediction of disease from clinical data.

Activity

Join now to see all activity

Experience

  • RMIT University Graphic

    RMIT University

    Melbourne, Victoria, Australia

  • -

    Australia

  • -

    Melbourne, Victoria, Australia

  • -

  • -

    Melbourne, Australia

Publications

  • AI: Ready or not?

    Research News Live

    Even without generally intelligent artificial systems, we have powerful tools that give us access to insights of unprecedented scale and complexity into patterns of human behaviour, writes Karin Verspoor.

    See publication
  • The women rebels, geniuses and pioneers who inspired us: Grace Hopper

    University of Melbourne Pursuit

    Rear Admiral Dr Grace Murray Hopper was a significant figure in the history of computing. Her work revolutionised how we program today.

    From early childhood, Hopper had a strong interest in how things worked, gaining a reputation for dismantling her family’s alarm clocks.

    Her talent for mathematics led her to pursue research at Yale, where she gained her PhD in 1934. She also had a distinguished naval career during World War II, where she was one of the first programmers of the…

    Rear Admiral Dr Grace Murray Hopper was a significant figure in the history of computing. Her work revolutionised how we program today.

    From early childhood, Hopper had a strong interest in how things worked, gaining a reputation for dismantling her family’s alarm clocks.

    Her talent for mathematics led her to pursue research at Yale, where she gained her PhD in 1934. She also had a distinguished naval career during World War II, where she was one of the first programmers of the Navy’s IBM Mark I computer.

    She famously coined the term “debugging the machine” after a moth found its way into her computer.

    Hopper strongly believed that programs should be written in a language that was close to English, an idea that was rejected by the computing establishment at the time. Yet she and her staff proved them wrong, successfully making the UNIVAC computer “understand” 20 statements in English. Hopper’s work led to the development of the English-based programming language COBOL (Common Business Oriented Language).
    Grace Hopper helped to move us from giving computers instructions via cryptic machine code to a human readable “higher level” programming that is closer to natural language instructions. Her work was vital to the field of computing, allowing us to express the logic of a program without having to translate it into numbers.

    See publication
  • C'mon girls, let’s program a better tech industry

    The Conversation

    Media piece on women in IT.

    See publication
  • Standardized Mutual Information for Clustering Comparisons: A Step Further in Adjustment for Chance

    Proceedings of the 31st International Conference on Machine Learning (ICML 2014), JMLR, pages 1143-1151, June 21-26, Beijing, China, 2014.

  • Enhancing diagnostics for invasive Aspergillosis using machine learning

    Proceedings of the Abstracts of the Scientific Stream at Big Data 2014 Melbourne, Australia, April 3-4, 2014.

  • Large-scale biomedical concept recognition: An evaluation of current au- tomatic annotators and their parameter

    BMC Bioinformatics

    Ontological concepts are useful for many different biomedical tasks. Concepts are difficult to recognize in text due to a disconnect between what is captured in an ontology and how the concepts are expressed in text. There are many recognizers for specific ontologies, but a general approach for concept recognition is an open problem.

    Three dictionary-based systems (MetaMap, NCBO Annotator, and ConceptMapper) are evaluated on eight biomedical ontologies in the Colorado Richly Annotated…

    Ontological concepts are useful for many different biomedical tasks. Concepts are difficult to recognize in text due to a disconnect between what is captured in an ontology and how the concepts are expressed in text. There are many recognizers for specific ontologies, but a general approach for concept recognition is an open problem.

    Three dictionary-based systems (MetaMap, NCBO Annotator, and ConceptMapper) are evaluated on eight biomedical ontologies in the Colorado Richly Annotated Full-Text (CRAFT) Corpus. Over 1,000 parameter combinations are examined, and best-performing parameters for each system-ontology pair are presented.

    Baselines for concept recognition by three systems on eight biomedical ontologies are established (F-measures range from 0.14–0.83). Out of the three systems we tested, ConceptMapper is generally the best-performing system; it produces the highest F-measure of seven out of eight ontologies. Default parameters are not ideal for most systems on most ontologies; by changing parameters F-measure can be increased by up to 0.4. Not only are best performing parameters presented, but suggestions for choosing the best parameters based on ontology characteristics are presented.

    Other authors
    See publication
  • Diving deep into data to crack the gene code on disease

    The Conversation

    Article looking at the need to explore supplementary information associated with publications, to find information on genetic variation.

    See publication
  • Impact of Corpus Diversity and Complexity on NER Performance

    Proceedings of the Australasian Language Technology Association Workshop 2013

  • A Posteriori Ontology Engineering for Data-Driven Science

    Chapman and Hall/CRC Press

    In: Data-Intensive Science, Eds. T. Critchlow and K. Kleese van Dam

    Other authors
    See publication
  • Text Mining Improves Prediction of Protein Functional Sites

    PLoS One

    We present an approach that integrates protein structure analysis and text mining for protein functional site prediction, called LEAP-FS (Literature Enhanced Automated Prediction of Functional Sites). The structure analysis was carried out using Dynamics Perturbation Analysis (DPA), which predicts functional sites at control points where interactions greatly perturb protein vibrations. The text mining extracts mentions of residues in the literature, and predicts that residues mentioned are…

    We present an approach that integrates protein structure analysis and text mining for protein functional site prediction, called LEAP-FS (Literature Enhanced Automated Prediction of Functional Sites). The structure analysis was carried out using Dynamics Perturbation Analysis (DPA), which predicts functional sites at control points where interactions greatly perturb protein vibrations. The text mining extracts mentions of residues in the literature, and predicts that residues mentioned are functionally important. We assessed the significance of each of these methods by analyzing their performance in finding known functional sites (specifically, small-molecule binding sites and catalytic sites) in about 100,000 publicly available protein structures. The DPA predictions recapitulated many of the functional site annotations and preferentially recovered binding sites annotated as biologically relevant vs. those annotated as potentially spurious. The text-based predictions were also substantially supported by the functional site annotations: compared to other residues, residues mentioned in text were roughly six times more likely to be found in a functional site. The overlap of predictions with annotations improved when the text-based and structure-based methods agreed. Our analysis also yielded new high-quality predictions of many functional site residues that were not catalogued in the curated data sources we inspected. We conclude that both DPA and text mining independently provide valuable high-throughput protein functional site predictions, and that integrating the two methods using LEAP-FS further improves the quality of these predictions.

    Other authors
    • Judith D Cohn
    • Ravikumar Komandur
    • Michael Wall
    See publication
  • Knowledge Integration in Open Worlds: Utilizing the Mathematics of Hierarchical Structure

    Proc. IEEE Int. Conf. Semantic Computing (ICSC 07), IEEE Computer Society, pp. 105-112

    Other authors
  • Deconstruction, Reconstruction, and Ontogenesis for Large, Monolithic, Legacy Ontologies in Semantic Web Service Applications

    Los Alamos Technical Report 06-5859

    Other authors
  • Distributed Representations of Bio-Ontologies for Semantic Web Services

    Joint BioLINK and 9th Bio-Ontologies Meeting (JBB 06), ISMB 06

    Other authors

Patents

  • System and method for knowledge based matching of users in a network

    Issued US US7933856

    A knowledge-based system and methods to matchmaking and social network extension are disclosed. The system is configured to allow users to specify knowledge profiles, which are collections of concepts that indicate a certain topic or area of interest. The system utilizes the knowledge model as the semantic space within which to compare similarities in user interests. The knowledge model is hierarchical so that indications of interest in specific concepts automatically imply interest in more…

    A knowledge-based system and methods to matchmaking and social network extension are disclosed. The system is configured to allow users to specify knowledge profiles, which are collections of concepts that indicate a certain topic or area of interest. The system utilizes the knowledge model as the semantic space within which to compare similarities in user interests. The knowledge model is hierarchical so that indications of interest in specific concepts automatically imply interest in more general concept. Similarity measures between profiles may then be calculated based on suitable distance formulas within this space.

Languages

  • Spanish

    -

  • Dutch

    -

  • French

    -

Organizations

  • Australasian Language Technology Association (ALTA)

    Secretary

    - Present

    http://www.alta.asn.au

  • Australasian Language Technology Association (ALTA)

    President

    -

    http://www.alta.asn.au/

Recommendations received

More activity by Karin

View Karin’s full profile

  • See who you know in common
  • Get introduced
  • Contact Karin directly
Join to view full profile

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Add new skills with these courses