About
π Hi, Iβm Szymon

Bioinformatics Expert | 9+ Years | Founder of ACTN3 Bioinformatics
I transform complex genomic data into actionable insights through reproducible pipelines,
cutting-edge analysis, and AI-powered workflows.
β‘ Named after the ACTN3 βSpeed Geneβ - symbolizing fast, efficient, high-performance bioinformatics
πΌ Professional Journey
π¬ Bioinformatician | Roche (via Avenga)
May 2017 - Present | Remote
Leading bioinformatics analysis for oncology and immunology research projects at one of the worldβs largest pharmaceutical companies.
Key Achievements:
βΈ Co-authored 6 publications in top-tier journals (iScience, Clinical Cancer Research, Cancer Immunology Research, EMBO Molecular Medicine)
βΈ Analyzed >800 patient samples across multiple clinical trials (OAK, POPLAR, gp28363)
βΈ Processed ~160,000 sgRNAs in genome-wide CRISPR screens
βΈ Developed 113-gene classifier with 87-91% cross-cohort validation
βΈ Built ML-ready datasets from scRNA-seq CRISPR perturbation screens
Technical Focus:
βΈ NGS analysis (RNA-seq, scRNA-seq, ChIP-seq, scTCR/scBCR-seq, WGS/WES)
βΈ CRISPR/ORF screen analysis
βΈ Machine learning & predictive modeling
βΈ Multi-omics data integration (TCGA, GEO, clinical trials)
βΈ Pipeline development (Snakemake, Docker, HPC)
βΈ R package development for internal workflows
π» LIMS Developer & Bioinformatician | Ardigen
September 2015 - April 2017 | KrakΓ³w, Poland
βΈ Designed & maintained LIMS software (StarLIMS, JScript.NET, T-SQL)
βΈ Developed R package supporting drug discovery workflows
βΈ Integrated bioinformatics tools with laboratory information systems
π§ͺ LIMS Developer & ISO Coordinator | Selvita
August 2014 - August 2015 | KrakΓ³w, Poland
βΈ Led StarLIMS implementation in collaboration with Abbott
βΈ Coordinated QA for ISO compliance during system deployment
βΈ Bridge between IT, lab operations, and quality assurance
π Visiting Student | University of Cambridge
May 2013 - September 2013 | Anne McLaren Laboratory
Project: Computational analysis of long non-coding RNAs during neural differentiation
Supervisor: Dr. Mark Kotter
Methods: R-based microarray analysis in stem cell biology
π οΈ Technical Skills
Core Bioinformatics
NGS Technologies:
β RNA-seq: limma-voom, edgeR, DESeq2, HTSeqGenie, GSNAP
β scRNA-seq: Seurat, Scanpy, UMAP, cell type annotation
β CRISPR screens: crisprVerse, MAGeCK, TMM normalization
β ChIP-seq, ATAC-seq, CUT&RUN: peak calling, motif enrichment
β scTCR/scBCR-seq: immune repertoire profiling
β WGS/WES: GATK, variant calling, COSMIC signatures
β Spatial transcriptomics: 10X Visium analysis
β Long-read sequencing: PacBio, Nanopore
β Methylation: WGBS, Infinium arrays
Analysis Methods:
β Pathway analysis: fgsea, MSigDB, IPA, Metacore GeneGO
β Differential expression: limma, edgeR, DESeq2
β Functional genomics: CRISPR/ORF screen hit calling
Machine Learning & Statistics
Unsupervised:
β Consensus NMF, hierarchical clustering, PCA/UMAP
Supervised:
β PAM, caret, feature engineering, cross-validation design
Clinical:
β Survival analysis: Cox models, Kaplan-Meier, survfit
β Statistical tests: ANOVA, t-tests, Mann-Whitney U, FDR correction
Software Development
R Package Development:
β Package structure: devtools, usethis, roxygen2
β Documentation: pkgdown websites, vignettes
β Testing: testthat, unit tests, CI/CD
β CRAN/Bioconductor submission standards
Workflow Engineering:
β Snakemake, Nextflow, Bash scripting
Reproducibility:
β RMarkdown, Quarto, literate programming
Infrastructure:
β Version control: Git, GitHub/GitLab, CI/CD
β Compute: HPC clusters, Docker, parallel processing
β Data wrangling: dplyr, data.table, FASTQ/BAM/VCF parsing
Visualization & Deployment
Static Graphics:
β ggplot2, ComplexHeatmap, volcano plots, survival curves
Interactive:
β Shiny apps, plotly, custom dashboards
Documentation:
β Quarto websites, dynamic RMarkdown, GitHub Pages
Data Sources & Integration
Public Databases:
β TCGA, GEO, EGA, SRA, cBioPortal, FoundationOne
Clinical Data:
β OAK/POPLAR trials, patient stratification
Platforms:
β Illumina (HiSeq/NovaSeq), 10X Genomics, PacBio, Affymetrix
π Languages
βΈ π¬π§ English: Fluent - Scientific communication, publications, international teams
βΈ π΅π± Polish: Native
π‘ What Drives Me
π Professional Interests (2025 & Beyond)
βΈ π€ AI in Bioinformatics: LLM-assisted analysis, automated QC, AI-powered variant calling
⸠𧬠Multi-Modal Omics: Spatial transcriptomics, single-cell + proteomics + epigenomics integration
βΈ π§ͺ Non-coding RNA: lncRNA, miRNA, circRNA in cancer & development; integration and correlation analysis between ncRNA and mRNA; identifying ncRNA regulators of protein-coding genes; regulatory network reconstruction
βΈ π¦ Software Development: R package development (Bioconductor standards)
βΈ βοΈ Workflow Engineering: Positron IDE + Quarto, Nextflow DSL2, cloud-native pipelines
βΈ π― CRISPR & Functional Genomics: Perturb-seq, base/prime editor screens, screen validation workflows
βΈ π₯ Precision Medicine: Real-time clinical decision support, pharmacogenomics, liquid biopsy, patient-specific modeling
π Personal Passions
βΈ π Continuous learning & skill development
βΈ π Licensed athletics referee
βΈ πͺ Healthy lifestyle & professional athletics training
βΈ π§ Work-life balance advocacy
π€ Letβs Work Together
Iβm passionate about turning complex genomic data into discoveries that matter.
Whether youβre a biotech startup, academic lab, pharma company, or CRO, letβs discuss how I can help accelerate your research with cutting-edge bioinformatics solutions.
Connect with me: