Open Source Tools

Non-Performing Loan Ontology

Non-Performing Loan Ontology

The NPL Ontology (NPLO) is a new ontology describing datasets of Non-Perfoming Loan Portfolios.

Reading Time: 4 min.

NPLO Visualization

The Non-Perfoming Loan Ontology

The Non-Performing Loan Ontology is a framework that aims to represent and categorize knowledge about non-performing loans using semantic web information technologies. Codenamed NPLO, it codifies the relationship between the various components of a Non-Performing Loan portfolio dataset.(NB: Non-performing loans are bank loans that are 90 days or more past their repayment date or that are unlikely to be repaid, for example if the borrower is facing financial difficulties).

What do people talk about at FOSDEM 2021

What do people talk about at FOSDEM 2021

FOSDEM is the Free and Open Source Software Developers European Meeting.

Reading Time: 3 min.

FOSDEM 2021

Introduction

What is FOSDEM?

FOSDEM is a non-commercial, volunteer-organized event centered on free and open-source software development (with a geographic focus on the European open source ecosystems / projects). FOSDEM is aimed at developers and anyone interested in the free and open-source software movement. It aims to enable developers to meet and to promote the awareness and use of free and open-source software.

Risk Function Ontology

Risk Function Ontology

The Risk Function Ontology (RFO) is a new ontology describing risk management roles (posts) and functions.

Reading Time: 3 min.

RFO Visualization

The Risk Function Ontology

The Risk Function Ontology is a framework that aims to represent and categorize knowledge about risk management functions using semantic web information technologies. Codenamed RFO codifies the relationship between the various components of a risk management organization. Individuals, teams or even whole departments tasked with risk management exist in some shape or form in most organizations. The ontology allows the definition of risk management roles in more precise terms, which in turn can be used in a variety of contexts: towards better structured actual job descriptions, more accurate description of internal processes and easier inspection of alignement and consistency with risk taxonomies. See also live version and the white paper OpenRiskWP04_061415.

Logarithmic Sankey Visualization of Credit Migrations

Logarithmic Sankey Visualization of Credit Migrations

Sankey diagrams are very useful for the visualization of flows, especially when there is a conserved quantity. They can be tricky when some of the flows are much smaller than others. In the latest release of transitionMatrix we include an example of a log-scale version of Sankey

Reading Time: 5 min.

Using Sankey Diagrams

Sankey Diagrams are a type of flow diagram composed of interconnected arrows. The width of the arrows is proportional to the flow rate. Sankey diagrams are often used in physical sciences (physics, chemistry, biology) and engineering but also in economics. They can be used to represent the relative role and significance of various inputs and outputs in a given process.

openNPL 0.2 REST API implementation

openNPL 0.2 REST API implementation

The 0.2 release of openNPL exposes a RESTful API that provides easy standardized online access to NPL credit portfolio data conforming to the EBA NPL templates

Reading Time: 4 min.

openNPL 0.2 release

The open source openNPL platform supports the management of standardized credit portfolio data for non-performing loans. In this respect it implements the detailed European Banking Authority NPL loan templates. openNPL aims to be at the same time easy to integrate in human workflows (using a familiar web interface) and integrate into automated (computer driven) workflows.

openNPL now Available in Dockerized Form

openNPL now Available in Dockerized Form

Open Source, cloud based management of Non-Performing Loan data following the European Banking Authority's templates with just a few keystrokes!

Reading Time: 1 min.

openNPL now Available in Dockerized Form

Following up on the first release of openNPL the platform is now available to install using Docker. Running openNPL via docker is the installation option that simplifies the manual process (but a working docker installation is required!).

openNPL plus Docker

Docker Hub

You can pull the latest openNPL image from Docker Hub (This method is recommended if you do not want to mess with the source distribution).

21 Ways to Visualize a Timeseries

21 Ways to Visualize a Timeseries

We explore a variety of distinct ways to visualize the same simple dataset. The post is an excursion into the fundamentals of visualization - a partial deconstruction of the process that highlights some common techniques and associated issues.

Reading Time: 1 min.

21 Visualizations of Timeseries

Course Objective

This course is a deep-dive into the structure of visualizations, in particular visualizations of timeseries data. The course is now live at the Academy.

Pre-requisites

Knowledge of basic visualization techniques and mathematical notation of functions and maps. Familiarity with data series and their usage in data science.

openNPL: Open Source NPL Platform - First Release

openNPL: Open Source NPL Platform - First Release

We introduce an open source platform that allows the easy management of non-performing loan data

Reading Time: 4 min.

Non-Performing Loans

The covid-19 crisis will certainly impact the concentration of Non-Performing Loans but given the special nature of this economic crisis compared (in particular) with the 2008 financial crisis it is unclear how precisely things will evolve.

In a previous post and white paper (OpenRiskWP07_022616) we discussed the importance of advancing open and transparent methodologies for managing the risks associated with such credit portfolios. Effective management of NPL is also a top regulatory priority. Following calls from the EU Commission and the EU Council to develop data templates to reduce information asymmetries between potential buyers and sellers of NPL, the European Banking Authority (EBA) has developed such standardised data templates.

What do people talk about at FOSDEM 2020

What do people talk about at FOSDEM 2020

FOSDEM means Free and Open Source Software Developers European Meeting

Reading Time: 4 min.

What do people talk about at FOSDEM 2020

FOSDEM 2020

Introduction

FOSDEM is a non-commercial, volunteer-organized European event centered on free and open-source software development. It is aimed at developers and anyone interested in the free and open-source software movement. It aims to enable developers to meet and to promote the awareness and use of free and open-source software. FOSDEM is held annually since 2001, usually during the first weekend of February, at the Université Libre de Bruxelles Solbosch campus in the southeast of Brussels, Belgium. The history of FOSDEM is neatly available at Wikipedia, while the current conference (2020) website is available here.

Risk Model Ontology

Risk Model Ontology

Reading Time: 2 min.

Semantic Web Technologies

DOAM Graph

The Risk Model Ontology is a framework that aims to represent and categorize knowledge about risk models using semantic web information technologies.

In principle any semantic technology can be the starting point for a risk model ontology. The Open Risk Manual adopts the W3C’s Web Ontology Language (OWL). OWL is a Semantic Web language designed to represent rich and complex knowledge about things, groups of things, and relations between things. OWL is a computational logic-based language such that knowledge expressed in OWL can be exploited by computer programs, e.g., to verify the consistency of that knowledge or to make implicit knowledge explicit. OWL documents, known as ontologies, can be published in the World Wide Web and may refer to or be referred from other OWL ontologies. OWL is part of the W3C’s Semantic Web technology stack, which includes RDF, RDFS, SPARQL, etc

Federated Credit Risk Models

Federated Credit Risk Models

Reading Time: 4 min.

The motivation for federated credit risk models

Representation of federated credit risk model estimation

Federated learning is a machine learning technique that is receiving increased attention in diverse data driven application domains that have data privacy concerns. The essence of the concept is to train algorithms across decentralized servers, each holding their own local data samples, hence without the need to exchange potentially sensitive information. The construction of a common model is achieved through the exchange of derived data (gradients, parameters, weights etc). This design stands in contrast to traditional model estimation where all data reside (or are brought into one computational environment).

Overview of the Julia-Python-R Universe

Overview of the Julia-Python-R Universe

We introduce a side-by-side review of the main open source ecosystems supporting the Data Science domain: Julia, Python, R, the trio sometimes abbreviated as Jupyter

Reading Time: 3 min.

Overview of the Julia-Python-R Universe

Jupyter

A new Open Risk Manual entry offers a side-by-side review of the main open source ecosystems supporting the Data Science domain: Julia, Python, R, sometimes abbreviated as Jupyter.

Motivation

A large component of Quantitative Risk Management relies on data processing and quantitative tools (aka Data Science ). In recent years open source software targeting Data Science finds increased adoption in diverse applications. The overview of the Julia-Python-R Universe article is a side by side comparison of a wide range of aspects of Python, Julia and R language ecosystems.

Open Source Securitisation

Open Source Securitisation

Reading Time: 5 min.

Open Source Securitisation

Motivation

After the Great Financial Crisis securitisation has become the poster child of a financial product exhibiting complexity and opaqueness. The issues and lessons learned post-crisis were many, involving all aspects of the securitisation process, from the nature and quality of the underlying assets, the incentives of the various agents involved and the ability of investors to analyze the products they invested in. While the most egregious complications involved various types of re-securitisation and/or the interplay of structured credit derivatives undoubtedly even vanilla securitisation structure has a considerable amount of business logic.

Python versus R Language: A side by side comparison for quantitative risk modeling

Python versus R Language: A side by side comparison for quantitative risk modeling

Reading Time: 3 min.

Python versus R Language

Python versus R

Motivation for the comparison

A large component of risk management relies on data processing and quantitative tools. In turn, such information processing pipelines and numerical algorithms must be implemented in computer systems.

Computing systems come in an extraordinary large variety but in recent years open source software finds increased adoption for diverse applications (machine learning, data science, artificial intelligence). In particular cloud computing environments are primarily based on open source projects at the systems level. This facilitates (but does not require) the use of open source computational tools such as python or R.

Stressing Transition Matrices

Stressing Transition Matrices

Reading Time: 1 min.

Release of version 0.4.1 of the transitionMatrix package focuses on stressing transition matrices

Stressed Density

Further building the open source OpenCPM toolkit this realease of transitionMatrix features:

  1. Feature: Added functionality for conditioning multi-period transition matrices
  2. Training: Example calculation and visualization of conditional matrices
  3. Datasets: State space description and CGS mappings for top-6 credit rating agencies

Conditional Transition Probabilities

The calculation of conditional transition probabilities given an empirical transition matrix is a highly non-trivial task involving many modelling assumptions. This version of the transitionMatrix includes a canonical implementation that assumes a Gaussian single factor process as the driver of the joint rating dynamics. The technical documentation is available under in Open Risk Manual under the transition matrix category.

Release 0.4 of transitionMatrix adds Aalen-Johansen estimators

Release 0.4 of transitionMatrix adds Aalen-Johansen estimators

Reading Time: 0 min.

Release of version 0.4 of the transitionMatrix package

Release 0.4

Further building the open source OpenCPM toolkit this realease of transitionMatrix features:

  1. Feature: Added Aalen-Johansen Duration Estimator
  2. Documentation: Major overhaul of documentation, now targeting ReadTheDocs distribution
  3. Training: Streamlining of all examples
  4. Installation: Pypi and wheel installation options
  5. Datasets: Synthetic Datasets in long format

Enjoy!

Version 0.4 of the Concentration Library adds geographic / industrial concentration functionality

Version 0.4 of the Concentration Library adds geographic / industrial concentration functionality

Reading Time: 1 min.

Release of version 0.4 of the Concentration Library adds Geographic / Industrial concentration indexes

Portfolio Map View

Further building out the OpenCPM set of tools, we release version 0.4 of the Concentration Library, a python library for the computation of various concentration, diversification and inequality indices.

The below list provides documentation URL’s for each one of the implemented classic indexes (the Hoover index is a new addition in this release)

NACE Economic Activity Pictograms

NACE Economic Activity Pictograms

Reading Time: 2 min.

Representing economic activity using pictograms

Visualization can produce significant new insights when applied to quantitative data. It is currently undergoing a renaissance that mirrors other developments in computing and data science. Sophisticated open source libraries such as d3.js or matplotlib, to name but a couple, are enabling an ever wider range of users to distill valuable information from the avalanche of data being produced.

Version 0.2 of the Open Risk API incorporates the standardized EBA portfolio data templates

Version 0.2 of the Open Risk API incorporates the standardized EBA portfolio data templates

Reading Time: 2 min.

Extending the Open Risk API to include the EBA Portfolio Data Templates

The Open Risk API provides a mechanism to integrate arbitrary collections of risk data and risk modelling resources in the context of assessing and managing financial risk. It is based on two key technologies of the modern Web, RESTful architectures and Semantic Data.

Release of version 0.3 of the Concentration Library

Release of version 0.3 of the Concentration Library

Reading Time: 0 min.

Release of version 0.3 of the ConcentrationMetrics Library

HHI vs Gini

Further building out the OpenCPM set of tools, we release version 0.3 of the ConcentrationMetrics Library. This python library for the computation of various concentration, diversification and inequality indices.

The below list provides documentation URL’s for each one of the implemented indexes