W14: Intro to Modern Statistics with R

529 Boyer Hall 611 Charles E Young Dr E,, Los Angeles, CA, United States

Through this seminar, attendees will walk away knowing when and how to run modern versions of traditional statistical analysis. These tests and the underlying bioinformatical lesson about resampling will be of use to most scientific disciplines. The course makes no assumptions about familiarity with traditional statistics – we will simply go through relatable experimental examples […]

W11: Metagenomics Analysis with Python and R

529 Boyer Hall 611 Charles E Young Dr E,, Los Angeles, CA, United States

This workshop provides an introduction to the microbiome analyses from the raw sequence data generated from the next-generation sequencing platforms. We will cover how to perform the 16S rRNA-based analysis using an open-source bioinformatics pipeline QIIME. We will also cover some downstream analyses of the microbiome data beyond QIIME, including statistical analyses and functional analyses.

W8: Variant Calling with GATK

529 Boyer Hall 611 Charles E Young Dr E,, Los Angeles, CA, United States

This workshop uses materials developed by the Broad Institute to teach Variant Discovery with GATK.  Attendees with no prior experience in variant calling are recommended to review all of the materials below before coming to the workshop. This early preparation will allow a focus on the specific issues of running GATK on the UCLA hoffman2 […]

W9: Intro to Python

529 Boyer Hall 611 Charles E Young Dr E,, Los Angeles, CA, United States

This workshop will cover the basic concepts of Python programming. The course is supplemented with many hands-on exercises with emphasis given towards computational biology use cases.

W1: Unix command line I

529 Boyer Hall 611 Charles E Young Dr E,, Los Angeles, CA, United States

Unix is a command-line-based platform that is a highly powerful and flexible tool for data management and analysis. First, this workshop introduces the basic concepts of UNIX operating system and shell scripting. We will explore essential hands-on skills to confidently use the command line interface on either a local (laptop) or a remote (hoffman2 cluster) […]

W2: Using NGS Analysis Tools

529 Boyer Hall 611 Charles E Young Dr E,, Los Angeles, CA, United States

High-throughput sequencing technology involves a number of concepts and techniques that shape a project before application-specific processes are utilized. First, this workshop introduces the more “universal” aspects of high-throughput sequence analysis—from experimental design to sequencing and alignment methods. Next, this workshop covers common file formats for sequence data and limitations of sequencing technologies. We will […]

W18: Advanced Python

529 Boyer Hall 611 Charles E Young Dr E,, Los Angeles, CA, United States

This workshop will cover some more advanced topics in python including an overview of object-oriented python (this will not be an in-depth course on object-oriented programming), use of the numpy and pandas libraries (python libraries for efficient handling of large numeric and heterogenous datasets, and matplotlib for plotting results. At the end of this workshop, […]

W1b: Intro to Unix command line II

529 Boyer Hall 611 Charles E Young Dr E,, Los Angeles, CA, United States

This workshop (UNIX Command Line II) continues Workshop W1: UNIX Command Line I and uses the Hoffman2 campus computing cluster. The focus is on features that make dealing with large files or large numbers of files or repetitive tasks easier. These include shell variables, substitutions, redirections, pipes, loops, conditionals, subshells, shell functions, and shell scripts. […]

W3: Intro to R and Data Visualization

529 Boyer Hall 611 Charles E Young Dr E,, Los Angeles, CA, United States

R (www.r-project.org) is a free software environment for statistical computing and graphics. First, this workshop introduces basic concepts, syntax, and usage in R programming, statistical analysis, and visualization techniques. We will conduct hands-on tutorials throughout the session, giving attendees a chance to see R in action. This course is a pre-requisite for several other Collaboratory […]

W37: Applications of Large Language Models

529 Boyer Hall 611 Charles E Young Dr E,, Los Angeles, CA, United States

This 3-day interactive workshop introduces the overarching principles guiding generative modeling and specifically Large-Scale Language Models (LLM), their application in Python for inference, and specific use-cases in Genomics. Experience with Python is necessary, and basic knowledge about ML workflows is preferred. At the end of this workshop, you WILL be comfortable with loading, inferencing and […]