System Analyst (Data Integration)

The System Analyst (Data Integration) will focus on integrating HKGI’s data lakehouse platform with internal systems and external partners.  The incumbent will be responsible for designing, implementing, and maintaining efficient data flows between HKGI’s data lakehouse and other systems while ensuring data quality, security, and compliance.  The incumbent will assume the following responsibilities:

 

Key Responsibilities

  • Design, develop, and maintain APIs and data transfer mechanisms (e.g., RESTful APIs, AWS S3 Upload) for seamless data exchange with external systems
  • Design and implement strategies for data replication, failover, and disaster recovery to ensure high availability and data durability, leveraging clustering technologies (e.g., Kubernetes) and load balancing
  • Establish and enforce data security policies, including access controls, encryption (data at rest and in transit), and audit logging, to protect sensitive genomic and clinical data
  • Analyse system performance, identify bottlenecks, and implement optimisations to enhance data throughput, response times, and overall system efficiency
  • Implement and monitor data validation processes, lineage tracking, and quality control checks to ensure the accuracy, consistency, and reliability of genomics data throughout its lifecycle
  • Collaborate with bioinformaticians, software engineers, and stakeholders to design and implement data integration solutions for various genomics data types (e.g., FASTQ, BAM, VCF, short-reads, long-reads) and clinical data (FHIR, HL7)
  • Perform any other duties assigned by senior officers