Seminários do CEAUL

Sala 6.4.31, FCUL, Lisboa

Big Data Analytics: Applications and Challenges
Sílvia Rebouças
Faculdade de Economia, Administração, Atuária e Contabilidade
Universidade Federal do Ceará, Brasil

Advances in information technologies have led to the storage of large amounts of data by organizations. An analysis of this data through data mining techniques, also called big data analytics, is an important support for decision-making. In this seminar, will be presented two applications: one for structured data and other for unstructured data in text format. The first aims to classify the beneficiaries of an operator of health insurance in Brazil, according to their financial sustainability, via their sociodemographic characteristics and their healthcare cost history. Beneficiaries with a loss ratio greater than 0.75 were considered unsustainables. The sample consisted of 38875 beneficiaries, active between the years 2011 and 2013. The techniques used were logistic regression, which presented the best performance (with an accuracy rate of 68.43%), and classification trees. Age and the type of plan were the most important variables related to the profile of the beneficiaries in the classification. The highlights with regard to healthcare costs were annual spending on consultation and on dental insurance. In the second application, the goal is to develop a tool to evaluate the organization's stakeholder engagement, disclosed in sustainability reports, using a text mining approach. In order to achieve this goal, reports of Brazilian companies that were published under the G4 guidelines of the Global Reporting Initiative during 2016 were used. The results showed that the most mentioned stakeholders were employees, clients, market and suppliers, and the major concern raised by them relates mainly to the environment. Two clusters with different patterns of stakeholder engagement were found, but the firm-characteristics did not differ significantly between them. The proposed method facilitate pattern recognition in texts, eliminating the need of time-consuming techniques, such as, content analysis, that are usually used in the analysis of reports.


Robust Inference for Roc Regression
Vanda Inácio de Carvalho
School of Mathematics, University of Edinburgh, UK

The receiver operating characteristic (ROC) curve is the most popular tool for evaluating the diagnostic accuracy of continuous biomarkers. Often, covariate information that affects the biomarker performance is also available and several regression methods have been proposed to incorporate covariates in the ROC framework. In this work, we propose robust inference methods for ROC regression, which can be used to safeguard against the presence of outlying biomarker values. Simulation results suggest that the methods perform well in recovering the true conditional ROC curve and corresponding area under the curve, on a variety of data contamination scenarios. Methods are illustrated using data on age-specific accuracy of glucose as a biomarker of diabetes.

14h00-16h30
CEAUL - Centro de Estatística e Aplicações da Universidade de Lisboa

Uma oportunidade única para interagir com a comunidade global de computação científica, com inscrições (preço reduzido) até 02 de maio.

Logótipo Moodle

Ação de formação para docentes e investigadores de CIÊNCIAS.

Título/data/local do evento e iconografia representativa de energias renováveis

Inscrições a partir de 07 de abril! Junta-te a esta revolução energética e faz a diferença!

Curso destinado a estudantes de Mestrado e de Doutoramento, bem como a profissionais que desenvolvam investigação científica na área da saúde.

Químico a escrever fórmulas num quadro

Curso acreditado para efeitos de progressão na carreira dos professores do Ensino Básico e Secundário do Grupo 510 (CCPFC/ACC-118288/22), com candidaturas até 18 de maio.

Título/data/local do evento e fotografia do mar

Quais são os conceitos-chave para enfrentar os atuais desafios marinhos e costeiros? 

Título da conferência, sobre um quadro de ardósia

The conference focuses on "Algebra and its role in Computer Science", with special emphasis on the areas of study related to the work of M. V. Volkov, such as semigroups and automata.

Logótipo do Verão na ULisboa, sobre um fundo azul

Candidaturas a partir de 07 de abril!

Computability in Europe (CiE) is an interdisciplinary series of international conferences organised by the Association Computability in Europe (ACiE).

A 10.ª edição do Ser Cientista realiza-se entre 21 e 25 de julho - vem investigar connosco!

Logótipo do evento, sobre fotografia dos Açores

An international symposium that convenes researchers specializing in various disciplines focused on the terrestrial and marine flora and vegetation of the Macaronesian region (Azores, Madeira, Selvagens, Canary Islands, and Cabo Verde).

Composição de imagens relativas à área das ciências forenses

O curso visa dotar os formandos, com formação universitária nas mais diversas áreas do saber, com os conhecimento necessários à integração de equipas profissionais multidisciplinares nas áreas Médico-Legais e Forenses, em Laboratórios ou Serviços Médico-Legais e Forenses.

Cientista a trabalhar com tubos de ensaio

Este curso forma profissionais para atividade na área das Análises Clínicas ou Patologia Clínica. Irão adquirir os conhecimentos essenciais à integração de equipas profissionais multidisciplinares na área das Análises Clínicas/Patologia Clínica, em laboratórios privados, públicos, hospitalares ou do Estado.

The conference aims to bring together key experts in the Medical Microwave Imaging (MMWI) field and will include invited talks, presentations and posters of peer-reviewed abstracts and conference papers, and workshops in satellite areas of research that are of interest to MMWI research.

Páginas