Skip to content
View mamonu's full-sized avatar
๐ŸŽฏ
Focusing
๐ŸŽฏ
Focusing

Block or report mamonu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
mamonu/README.md

Hi there!! ๐Ÿ‘‹



๐Ÿ‘‹ I am Theodore

๐Ÿ“– I am particularly interested in: python spark scala dbt iceberg

๐Ÿ”ญ Iโ€™m currently working as a data engineer in the Ministry of Justice , building and maintaining a SCD2 solution on production pipelines using Amazon Athena / Apache Iceberg / dbt

๐ŸŒฑ Iโ€™m currently learning or getting better at:

  • how to use Apache Iceberg effectively iceberg

  • Scala scala



I can be reached on:


Twitter Badge linkedin


Open Source Programming Projects I have been helping maintain or maintaining myself


Name Frameworks Description
splink python spark Data Linkage at scale package
splink_graph python spark Graph Theoretical metrics at scale
splink_scalaudfs scala spark Scala Linkage UserDefinedFunctions for optimal performance

Some code stats





Yes. I am a cat person



Dont mind me I am just pressing keys randomly until something good happens on the screen

hey

Pinned Loading

  1. moj-analytical-services/splink moj-analytical-services/splink Public

    Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends

    Python 1.4k 150

  2. moj-analytical-services/splink_scalaudfs moj-analytical-services/splink_scalaudfs Public archive

    Data linking functions in Scala, to be used in a Pyspark environment.

    Scala 4 7

  3. moj-analytical-services/splink_graph moj-analytical-services/splink_graph Public archive

    pyspark-parallelised functions producing graph-theoretical metrics in connected component clusters for use in record-linkage (or other domains)

    HTML 10 3

  4. AdventOfCode2022InScala AdventOfCode2022InScala Public

    Scala

  5. MOJAthenaUDFs MOJAthenaUDFs Public

    record linkage UDFs for AWS Athena via AWS lambda interface

    Java 2

  6. Serum-to-E352-WT-converter Serum-to-E352-WT-converter Public

    A Serum to E352-Wavetable converter

    Python