PhD in Informatics Seminar

Evolving meaning for supervised learning in complex biomedical domains using knowledge graphs

Transmissão através de Videoconferência

Por Rita Sousa.

In recent years, the explosion in complexity and heterogeneity of data has motivated a new paradigm, where millions of semantically-described entities are available in knowledge graphs (KGs). Different KGs have been exploited in a wide variety of data mining and machine learning tasks, namely the prediction of specific relations between entities that correspond to KG instances but whose relationship is not encoded in the graph. This scenario has various bioinformatics applications, such as predicting protein-protein interactions, drug-drug interactions or gene-disease associations.

Many of the existing KG-based approaches for machine learning use KGs for generating static semantic representations, which are then used as features. These semantic representations can be considered static since they consider the full graph, blind to the fact that unnecessary information for representations can introduce noise. In applications where the target is encoded in the KG, this problem is mitigated. However, when the classification targets are not a part of the KG, representations cannot be trained on the targets. The problem is exacerbated in complex domains, such as the biomedical, where KGs represent multiple views (or semantic aspects) over the underlying data, some of which may be less relevant to train the model. This brings up the challenge of tailoring the semantic representation of the KGs entities to a specific goal.

This Ph.D. project aims to address this challenge by developing novel machine learning-based approaches to learn suitable semantic representations of data objects extracted from KGs to support specific supervised learning tasks in bioinformatics applications. The novel approaches are anchored in a framework that integrates semantic representation approaches and learning approaches, and allows a comparative evaluation using benchmarks. This framework was successfully applied to protein-protein interaction prediction with significant improvements over manually defined static semantic representations, and to learn similarity functions adapted to different biological perspectives.

More information: https://moodle.ciencias.ulisboa.pt/course/view.php?id=2964#section-2


Zoom

12h00
Ginásio "inundado" de tecnologia

Um programa único na Europa, com o objetivo de capacitar para a integração crítica, segura e eficaz de ferramentas digitais na intervenção clínica - candidaturas até 30 de janeiro.

Imagem abstrata

Neste curso, será promovida uma abordagem multidisciplinar, apresentando as descobertas mais recentes sobre o tema e desafiando a forma tradicional de considerar as associações simbióticas como exceções e não como a regra - candidaturas até 09 de janeiro.

A conferência visa reunir os principais especialistas no domínio da Imagiologia Médica por Micro-ondas (MMWI) e incluirá palestras, apresentações e pósteres de resumos revistos por pares e artigos de conferências, bem como workshops em áreas satélite de investigação com interesse para a investigação em MMWI.

Pessoas a analisarem dados

Candidaturas até 13 de fevereiro.

Um curso prático, limitado a um pequeno número de participantes, destinado a quem procura formação básica em teoria e estatística macroecológica e deseja familiarizar-se com algumas das potenciais utilizações de vários métodos avançado - candidaturas até 13 de fevereiro.

Páginas