- Use seqbank to query paths for data
select server_experiment.id, extra_details, results_path, server_celltype.type from server_project
join server_experiment on server_project.id=server_experiment.project_id
join server_experimentanalysis on server_experiment.id=server_experimentanalysis.experiment_id
join server_celltype on server_experiment.cell_type_id=server_celltype.id
where server_project.name="vTR_HPV"
- No Transient (according to Phil [because of EBV paper])
- Only use stable
- vTR_control (HCT116-CL0168) FLpIn293
/integrative_analysis/ChIP/ATAC
TODO Nextflow [0/3]
-
Setup project directory
- What's the deal with the existing HPV directory?
- Nix container setup
- GitHub
TODO Early analysis
- Collect MultiQC outputs (should be in results directory from seqbank)
- wurl can be used to view multiqc reports
- Data quality "Pass" (exact)
- mixtape deseq pipeline (20% is used in the EBV paper)
- differential between all four strains (HPV strains) and high risk and low risk
- Look for BAM files in ATAC
- narrowPeak files (idr_conservative (convervation across replicates))
Ask about transient tests
- Only HPV 16
Make spreadsheet with urls to every multiQC report
- Use SQL query, then post process with wurl