Gonçalo Faria

bayesian deep learning, deep generative modelling, nlp, causal discovery

CV

Hey, I'm Gonçalo.

I am currently an PhD student at University of Washington, under the guidance of Professor Noah Smith.

My research interests are robustness to distribution shifts, o.o.d generalization, and reliability, particularly for complex reasoning tasks.

I studied at the University of Minho for my undergrad and at Instituto Superior Técnico for my master's. My master's dissertation, advised by André Martins and Mário Figueiredo, focused on Differentiable Causal Discovery.

Feel free to reach out! I'm always happy to chat about research, life, or anything else.




Selected Publications
NEURIPS 2024
Gonçalo R. A. Faria, Sweta Agrawal, António Farinhas, Ricardo Rei, José G. C. de Souza, André F. T. Martins
Abstract : An important challenge in machine translation (MT) is to generate high-quality and diverse translations. Prior work has shown that the estimated likelihood from the MT model correlates poorly with translation quality. In contrast, quality evaluation metrics (such as COMET or BLEURT) exhibit high correlations with human judgments, which has motivated their use as rerankers (such as quality-aware and minimum Bayes risk decoding). However, relying on a single translation with high estimated quality increases the chances of "gaming the metric''. In this paper, we address the problem of sampling a set of high-quality and diverse translations. We provide a simple and effective way to avoid over-reliance on noisy quality estimates by using them as the energy function of a Gibbs distribution. Instead of looking for a mode in the distribution, we generate multiple samples from high-density areas through the Metropolis-Hastings algorithm, a simple Markov chain Monte Carlo approach. The results show that our proposed method leads to high-quality and diverse outputs across multiple language pairs (English↔{German, Russian}) with two strong decoder-only LLMs (Alma-7b, Tower-7b).
CLEAR 2022
Gonçalo R. A. Faria, André F. T. Martins, Mário A. T. Figueiredo
Abstract : Recent work has shown promising results in causal discovery by leveraging interventional data with gradient-based methods, even when the intervened variables are unknown. However, previous work assumes that the correspondence between samples and interventions is known, which is often unrealistic. We envision a scenario with an extensive dataset sampled from multiple intervention distributions and one observation distribution, but where we do not know which distribution originated each sample and how the intervention affected the system, \textit{i.e.}, interventions are entirely latent. We propose a method based on neural networks and variational inference that addresses this scenario by framing it as learning a shared causal graph among an infinite mixture (under a Dirichlet process prior) of intervention structural causal models. Experiments with synthetic and real data show that our approach and its semi-supervised variant are able to discover causal relations in this challenging scenario.
Publications
EMNLP 2024
Sweta Agrawal, José G. C. de Souza, Ricardo Rei, António Farinhas, Gonçalo Faria, Patrick Fernandes, Nuno M Guerreiro, Andre Martins
NEURIPS 2024
Gonçalo R. A. Faria, Sweta Agrawal, António Farinhas, Ricardo Rei, José G. C. de Souza, André F. T. Martins
CLEAR 2022
Gonçalo R. A. Faria, André F. T. Martins, Mário A. T. Figueiredo

Also on Google Scholar