Explore projects
-
Updated
-
This project explores and implements robust relational data models for efficiently storing and querying large-scale genetic Variant Call Format (VCF) data. Leveraging PostgreSQL, we present and evaluate eight distinct schema designs, demonstrating how traditional relational databases can offer competitive performance, scalability, and full database management system (DBMS) features for complex genomic workloads, complementing specialized tools like BCF Tools and TileDB-VCF.
Updated -
-
This project aims to create valid STAC elements given already existing raster datasets.
Updated -
Code examples and resources for using Kafka with Docker, Java, and Python
Updated -
Java/Python code examples and PyFlink Docker image used in Flink lectures
Updated -
End-to-end RSS news classification app using Spark MLlib, Kafka, Flask/RxJS
Updated -
Parallelized computation, publication and analysis of climate indices from climate projections time-series.
Updated