Seminários do CEAUL

Sala 6.4.31, FCUL, Lisboa

Big Data Analytics: Applications and Challenges
Sílvia Rebouças
Faculdade de Economia, Administração, Atuária e Contabilidade
Universidade Federal do Ceará, Brasil

Advances in information technologies have led to the storage of large amounts of data by organizations. An analysis of this data through data mining techniques, also called big data analytics, is an important support for decision-making. In this seminar, will be presented two applications: one for structured data and other for unstructured data in text format. The first aims to classify the beneficiaries of an operator of health insurance in Brazil, according to their financial sustainability, via their sociodemographic characteristics and their healthcare cost history. Beneficiaries with a loss ratio greater than 0.75 were considered unsustainables. The sample consisted of 38875 beneficiaries, active between the years 2011 and 2013. The techniques used were logistic regression, which presented the best performance (with an accuracy rate of 68.43%), and classification trees. Age and the type of plan were the most important variables related to the profile of the beneficiaries in the classification. The highlights with regard to healthcare costs were annual spending on consultation and on dental insurance. In the second application, the goal is to develop a tool to evaluate the organization's stakeholder engagement, disclosed in sustainability reports, using a text mining approach. In order to achieve this goal, reports of Brazilian companies that were published under the G4 guidelines of the Global Reporting Initiative during 2016 were used. The results showed that the most mentioned stakeholders were employees, clients, market and suppliers, and the major concern raised by them relates mainly to the environment. Two clusters with different patterns of stakeholder engagement were found, but the firm-characteristics did not differ significantly between them. The proposed method facilitate pattern recognition in texts, eliminating the need of time-consuming techniques, such as, content analysis, that are usually used in the analysis of reports.


Robust Inference for Roc Regression
Vanda Inácio de Carvalho
School of Mathematics, University of Edinburgh, UK

The receiver operating characteristic (ROC) curve is the most popular tool for evaluating the diagnostic accuracy of continuous biomarkers. Often, covariate information that affects the biomarker performance is also available and several regression methods have been proposed to incorporate covariates in the ROC framework. In this work, we propose robust inference methods for ROC regression, which can be used to safeguard against the presence of outlying biomarker values. Simulation results suggest that the methods perform well in recovering the true conditional ROC curve and corresponding area under the curve, on a variety of data contamination scenarios. Methods are illustrated using data on age-specific accuracy of glucose as a biomarker of diabetes.

14h00-16h30
CEAUL - Centro de Estatística e Aplicações da Universidade de Lisboa
Pormenor de linguagem corporal (braços e mãos) de pessoa a dialogar

Ação de formação para docentes e investigadores de Ciências.

Feixes luminosos

Envio de propostas até 20 de junho.

Logótipo do prémio

As candidaturas à 11.ª edição decorrem até 28 de junho.

Vai realizar-se em Lisboa, nos dias 28 e 29 de junho de 2024, o 37.º Encontro do Seminário Nacional de História da Matemática.

Logótipo do Verão na ULisboa, sobre um fundo amarelo

Uma oportunidade única de conheceres e experimentares o ritmo e o espírito da vida académica!

The topics of the conference include (but are not limited to) classical and quantum integrable systems, complex geometry of moduli spaces, automorphic forms and their applications to number theory.

Título/data do evento, logótipos das entidades organizadoras e fotografia de Lisboa (Castelo de S. Jorge e respetiva colina)

Inscrição (taxa reduzida) até 20 de abril.

Título/data/local do evento, logótipos das entidades organizadoras e várias fotografias da orla costeira e de pessoas

Escola de verão com um programa muito diversificado, com especialistas em vários tópicos, que vão falar sobre formas de olhar para o nosso planeta de uma forma integrada, juntando conhecimentos de várias disciplinas.

Are you a BSc or MSc student interested in Soft Matter, Non-linear Dynamics and Waves or Particle Physics?

Vem investigar connosco!

Logótipo do evento, sobre um fundo branco

Um evento de reunião da comunidade nacional nas diversas vertentes da informática, com a ambição de ser o fórum de eleição para a divulgação, discussão e reconhecimento de trabalhos científicos.

Páginas