Data Science Seminars@Ciências

Biclustering in Biomedical Data Analysis: Algorithms and Applications

Sala 6.3.38, FCUL, Lisboa

Biclustering, the discovery of sets of objects with coherent values/patterns on subsets of features, was shown to be key to unravel and characterize informative regions (biclusters) within matricial, time series and network data, in a wide-set of applications in biomedical and social data analysis. Particularly in biomedical problems, where groups of genes or patients tend to be only meaningfully related on a subset of the sampled/monitored conditions. The challenging combinatorial nature of the biclustering problem led to the development of several approaches with variations on the allowed type, number, positioning and quality of biclusters. The state of the art relies on efficient string processing and mining techniques, in the case of biclustering temporal data, and pattern mining algorithms, in the general case of biclustering matricial and network data.

This talk introduces the biclustering problem comparing it to the traditional clustering problem, provides an overview on state of the art on biclustering  matricial and network data analysis, and then focus the problem of biclustering temporal data, tackling in particular the problem of biclustering gene expression time series obtained from transcriptomics. On going work on new triclustering algorithms to simultaneouly analyse multiple gene expression time series (three-way time series) and multiple multivariate time series collected at clinical follow-up, together with their applications in biomedical problems, such as the identification of disease progression patterns in the NEUROCLINOMICS2 project (PTDC/EEI-SII/1937/2014), are discussed.  

Short Bio: SARA C. MADEIRA is an Associate Professor at the Department of Informatics of the Faculty of Sciences, University of Lisbon (FCUL), since mid February 2017, where she teaches graduate courses on data mining, machine learning and foundations of data science and an under-graduate course on intelligent systems. She is also a senior researcher at LASIGE, where she is a member of the Data and Systems Intelligence, and Health and Biomedical Informatics research lines. Her research interests include data mining, machine learning, bioinformatics and medical informatics. In this context, she was the PI of "NEUROCLINOMICS - Understanding NEUROdegenerative diseases through CLINical and OMICS data" (PTDC/EIA-EIA/111239/2009), a research project embracing the challenges of studying complex diseases and developing efficient and effective mining algorithms for biomedical data, using Amyotrophic Lateral Sclerosis and Alzheimer's disease as case studies, which was followed by the ongoing project "NEUROCLINOMICS2 - Unravelling Prognostic Markers in NEUROdegenerative diseases through CLINical and OMICS data integration" (PTDC/EEI-SII/1937/2014). Her survey on "Biclustering Algorithms for Biological Data Analysis" was considered an ESI Hot Paper in Computer Science in November 2006. Biclustering algorithms and their applications in biomedical data analysis are still her main research topics.

14h00
Departamento de Informática
Representação de programação R

This course aims at providing students with statistical knowledge and tools to manipulate, analyse and visualise biological data with R. Introduction to modeling, simulations and Bayesian statistics.

Logótipo do Verão na ULisboa, sobre um fundo azul

Candidaturas a partir de 07 de abril!

Computability in Europe (CiE) is an interdisciplinary series of international conferences organised by the Association Computability in Europe (ACiE).

Título/data do evento e vários objetos museológicos

This course aims to provide an updated vision of the potential of museum collections for biodiversity research. More specifically, aims to present case studies on the value of museums and the use of collections and specimens in the 21st century, using new technologies and analytical methods.

Águas subterrâneas

Curso acreditado para efeitos de progressão na carreira dos professores na dimensão cientifico-pedagógica dos grupos 420 e 520, com candidaturas até 02 de junho.

Título e datas do programa de estágios

Ready to explore research up close? Applications Deadline: 10 May.

A 10.ª edição do Ser Cientista realiza-se entre 21 e 25 de julho - vem investigar connosco!

Logótipo do evento, sobre fotografia dos Açores

An international symposium that convenes researchers specializing in various disciplines focused on the terrestrial and marine flora and vegetation of the Macaronesian region (Azores, Madeira, Selvagens, Canary Islands, and Cabo Verde).

Composição de imagens relativas à área das ciências forenses

O curso visa dotar os formandos com os conhecimento necessários à integração de equipas profissionais multidisciplinares nas áreas Médico-Legais e Forenses, em Laboratórios ou Serviços Médico-Legais e Forenses - candidaturas até 27 de julho.

Cientista a trabalhar com tubos de ensaio

Os participantes neste curso irão adquirir os conhecimentos essenciais à integração de equipas profissionais multidisciplinares na área das Análises Clínicas/Patologia Clínica, em laboratórios privados, públicos, hospitalares ou do Estado - candidaturas até 27 de julho.

Saída de campo (Geologia)

O curso, com candidaturas até 20 de julho, convida os professores do Ensino Básico e Secundário a explorar a Geologia a partir das rochas que afloram nas imediações da sua escola.

Gotas de água

O curso visa capacitar os formandos para a aplicação dos índices de qualidade ecológica utilizados na avaliação da qualidade ambiental em sistemas de transição, no âmbito da Diretiva Quadro da Água (DQA) - candidaturas até 31 de agosto.

The conference aims to bring together key experts in the Medical Microwave Imaging (MMWI) field and will include invited talks, presentations and posters of peer-reviewed abstracts and conference papers, and workshops in satellite areas of research that are of interest to MMWI research.

Páginas