To keep up with and go beyond state-of-the-art, knowledge workers must extract and interpret valuable and relevant knowledge from the huge amount of documents available in multiple repositories. In life science, they also need to mine the big data sets they collect from the lab experiments. In this context, I am interested in semantic infrastructures supporting biocuration for genomics and biomedical research, as well as machine-learning-based classification systems capable of detecting relevant documents in highly imbalanced corpora. In parallel with literature mining, I design data mining and analysis tasks based on relational and statistical approaches. Lien vers le projet