Skip to main content

Showing 1–11 of 11 results for author: Ascensão, J T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.15677  [pdf, other

    cs.LG cs.CR

    Adversarial training for tabular data with attack propagation

    Authors: Tiago Leon Melo, João Bravo, Marco O. P. Sampaio, Paolo Romano, Hugo Ferreira, João Tiago Ascensão, Pedro Bizarro

    Abstract: Adversarial attacks are a major concern in security-centered applications, where malicious actors continuously try to mislead Machine Learning (ML) models into wrongly classifying fraudulent activity as legitimate, whereas system maintainers try to stop them. Adversarially training ML models that are robust against such attacks can prevent business losses and reduce the work load of system maintai… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

  2. arXiv:2307.13787  [pdf, other

    cs.LG cs.CR

    The GANfather: Controllable generation of malicious activity to improve defence systems

    Authors: Ricardo Ribeiro Pereira, Jacopo Bono, João Tiago Ascensão, David Aparício, Pedro Ribeiro, Pedro Bizarro

    Abstract: Machine learning methods to aid defence systems in detecting malicious activity typically rely on labelled data. In some domains, such labelled data is unavailable or incomplete. In practice this can lead to low detection rates and high false positive rates, which characterise for example anti-money laundering systems. In fact, it is estimated that 1.7--4 trillion euros are laundered annually and… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  3. arXiv:2207.08640  [pdf, other

    cs.LG

    Lightweight Automated Feature Monitoring for Data Streams

    Authors: João Conde, Ricardo Moreira, João Torres, Pedro Cardoso, Hugo R. C. Ferreira, Marco O. P. Sampaio, João Tiago Ascensão, Pedro Bizarro

    Abstract: Monitoring the behavior of automated real-time stream processing systems has become one of the most relevant problems in real world applications. Such systems have grown in complexity relying heavily on high dimensional input data, and data hungry Machine Learning (ML) algorithms. We propose a flexible system, Feature Monitoring (FM), that detects data drifts in such data sets, with a small and co… ▽ More

    Submitted 19 July, 2022; v1 submitted 18 July, 2022; originally announced July 2022.

    Comments: 10 pages, 5 figures. AutoML, KDD22, August 14-17, 2022, Washington, DC, US

  4. arXiv:2112.07508  [pdf, ps, other

    cs.LG

    Anti-Money Laundering Alert Optimization Using Machine Learning with Graphs

    Authors: Ahmad Naser Eddin, Jacopo Bono, David Aparício, David Polido, João Tiago Ascensão, Pedro Bizarro, Pedro Ribeiro

    Abstract: Money laundering is a global problem that concerns legitimizing proceeds from serious felonies (1.7-4 trillion euros annually) such as drug dealing, human trafficking, or corruption. The anti-money laundering systems deployed by financial institutions typically comprise rules aligned with regulatory frameworks. Human investigators review the alerts and report suspicious cases. Such systems suffer… ▽ More

    Submitted 17 June, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: 8 pages, 5 figures

    MSC Class: I.2.1; J.4

  5. arXiv:2108.09200  [pdf, other

    cs.SI

    GUDIE: a flexible, user-defined method to extract subgraphs of interest from large graphs

    Authors: Maria Inês Silva, David Aparício, Beatriz Malveiro, João Tiago Ascensão, Pedro Bizarro

    Abstract: Large, dense, small-world networks often emerge from social phenomena, including financial networks, social media, or epidemiology. As networks grow in importance, it is often necessary to partition them into meaningful units of analysis. In this work, we propose GUDIE, a message-passing algorithm that extracts relevant context around seed nodes based on user-defined criteria. We design GUDIE for… ▽ More

    Submitted 20 August, 2021; originally announced August 2021.

    Comments: 16 pages, 8 figures, accepted at GEM2021

  6. arXiv:2108.04494  [pdf, other

    cs.SI

    Finding NeMo: Fishing in banking networks using network motifs

    Authors: Xavier Fontes, David Aparício, Maria Inês Silva, Beatriz Malveiro, João Tiago Ascensão, Pedro Bizarro

    Abstract: Banking fraud causes billion-dollar losses for banks worldwide. In fraud detection, graphs help understand complex transaction patterns and discovering new fraud schemes. This work explores graph patterns in a real-world transaction dataset by extracting and analyzing its network motifs. Since banking graphs are heterogeneous, we focus on heterogeneous network motifs. Additionally, we propose a no… ▽ More

    Submitted 10 August, 2021; originally announced August 2021.

    Comments: 6 pages, 6 figures, accepted at SEAData 2021

  7. arXiv:2107.07724  [pdf, other

    cs.LG stat.ML

    Active learning for imbalanced data under cold start

    Authors: Ricardo Barata, Miguel Leite, Ricardo Pacheco, Marco O. P. Sampaio, João Tiago Ascensão, Pedro Bizarro

    Abstract: Modern systems that rely on Machine Learning (ML) for predictive modelling, may suffer from the cold-start problem: supervised models work well but, initially, there are no labels, which are costly or slow to obtain. This problem is even worse in imbalanced data scenarios, where labels of the positive class take longer to accumulate. We propose an Active Learning (AL) system for datasets with orde… ▽ More

    Submitted 22 October, 2021; v1 submitted 16 July, 2021; originally announced July 2021.

    Comments: 9 pages, 6 figures, 2 tables

    Journal ref: ACM International Conference on AI in Finance, Nov 2021

  8. arXiv:2102.05373  [pdf, other

    cs.LG cs.SI

    GuiltyWalker: Distance to illicit nodes in the Bitcoin network

    Authors: Catarina Oliveira, João Torres, Maria Inês Silva, David Aparício, João Tiago Ascensão, Pedro Bizarro

    Abstract: Money laundering is a global phenomenon with wide-reaching social and economic consequences. Cryptocurrencies are particularly susceptible due to the lack of control by authorities and their anonymity. Thus, it is important to develop new techniques to detect and prevent illicit cryptocurrency transactions. In our work, we propose new features based on the structure of the graph and past labels to… ▽ More

    Submitted 21 July, 2021; v1 submitted 10 February, 2021; originally announced February 2021.

    Comments: 5 pages, 3 figures

  9. arXiv:2005.14635  [pdf, other

    cs.LG stat.ML

    Machine learning methods to detect money laundering in the Bitcoin blockchain in the presence of label scarcity

    Authors: Joana Lorenz, Maria Inês Silva, David Aparício, João Tiago Ascensão, Pedro Bizarro

    Abstract: Every year, criminals launder billions of dollars acquired from serious felonies (e.g., terrorism, drug smuggling, or human trafficking) harming countless people and economies. Cryptocurrencies, in particular, have developed as a haven for money laundering activity. Machine Learning can be used to detect these illicit patterns. However, labels are so scarce that traditional supervised algorithms a… ▽ More

    Submitted 5 October, 2021; v1 submitted 29 May, 2020; originally announced May 2020.

    Comments: 8 pages, 7 figures

  10. arXiv:2002.06075  [pdf, other

    cs.LG cs.AI cs.DB stat.ML

    ARMS: Automated rules management system for fraud detection

    Authors: David Aparício, Ricardo Barata, João Bravo, João Tiago Ascensão, Pedro Bizarro

    Abstract: Fraud detection is essential in financial services, with the potential of greatly reducing criminal activities and saving considerable resources for businesses and customers. We address online fraud detection, which consists of classifying incoming transactions as either legitimate or fraudulent in real-time. Modern fraud detection systems consist of a machine learning model and rules defined by h… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

    Comments: 11 pages, 12 figures, submitted to KDD '20 Applied Data Science Track

  11. arXiv:2002.05988  [pdf, other

    cs.LG cs.CR stat.ML

    Interleaved Sequence RNNs for Fraud Detection

    Authors: Bernardo Branco, Pedro Abreu, Ana Sofia Gomes, Mariana S. C. Almeida, João Tiago Ascensão, Pedro Bizarro

    Abstract: Payment card fraud causes multibillion dollar losses for banks and merchants worldwide, often fueling complex criminal activities. To address this, many real-time fraud detection systems use tree-based models, demanding complex feature engineering systems to efficiently enrich transactions with historical data while complying with millisecond-level latencies. In this work, we do not require thos… ▽ More

    Submitted 17 June, 2020; v1 submitted 14 February, 2020; originally announced February 2020.

    Comments: 9 pages, 4 figures, to appear in SIGKDD'20 Industry Track