CLCG Tutorials

This document is a working draft of several outlines of tutorials for both staff and students of the CLCG (Coordinated Laboratory for Computational Genetics and Genomics).


I. UNIX Basics ( Tutorial) ( PPT#1) ( PPT#2)
A. Filesystem structure
B. Accounts
C. Processes/Daemons
D. Environment Variables
E. Shells
F. Permissions (chmod, uname)
G. X-windows

II. UNIX Commands ( Unix Commands) ( VI ref) ( VI ref#2) ( PPT-CMMB)
A. process monitoring and control (ps -aux, ps -efl, top, kill)
B. network routing (route, netstat, ifconfig, ping, traceroute)
C. file finding (find, locate, grep, ls, tail, more, less)
D. file manipulation (rm, rmdir, mkdir)
E. file archiving (tar, gzip, zip, uncompress, bzip2)
F. using man !!!
G. Basic shell scripting (foreach)
H. Editors (vi, pico, nedit, emacs)

III. UNIX Security ( Tutorial )
A. Types of issues that can be problems (local, remote, DoS)
B. Processes that don't need to be run on every computer.
C. Log files
D. Permissions (SUID/SGID)


IIIA. UNIX Administration ( Tutorial )
A. Account Management
B. Filesystems
C. Daemons
D. Backups
E. RPM
F. File permissions

IV. Perl Basics ( Tutorial) ( PPT-scheetz)
A. Data Types
B. File I/O (print, printf, <>, open, close)
C. Back-ticks (executing commands on the system)
D. Pattern Matching, Substitution and Replacement
E. Functions


V. Advanced Perl
A. Modules and Classes (how they are different)
B. Important built-in functions not previously covered (mkdir, reverse)
C. Arrays of hashes, hashes of arrays, hashes of hashes
D. BioPerl (parsing of blast reports)

VI. Hardware Basics (PC)
A. Components
B. Devices (HD, sound, modem, NICs, video, ...)
C. CPU
D. Memory (RAM and cache)
E. Busses (PCI, ISA, IDE, SCSI - variants)
F. I/O (Serial, parallel, USB, Firewire/IEEE1394)

VII. Databases ( Tutorial-Web - Updated 4/15/02) ( PPT-CMMB) (External-Web-Intro) (External-Web-Intro #2) (SQL for Web Nerds)
 
A. Introduction - What are they?
B. Definitions
Server, Database, Table, Field, Types, NULL, indices, identity, foreign keys, primary keys
C. SQL summary
select, insert, update, delete, joins, sub-queries, like, create table, create index, drop table
D. Brief summary of how to connect from various languages
1. Java
2. Perl
3. ISQL


VIII. Advanced Databases (Sybase)
A. Creating a server
B. Creating a database
C. Useful commands
1. sp_help
2. sp_helpdb
3. other sp_help*
D. Transaction log
1. "... is full". Now what?
E. Backing up the database.


IX. Molecular Biology Basics ( Tutorial) ( PPT-CMMB#1) ( PPT-CMMB#2) (WEB-nonUI)

A. Intro/Definitions
B. Transcription
C. Translation
D. Pedigrees (needs to be relocated)



X. Gene Discovery ( Tutorial )
A. cDNA libraries
B. EST sequencing
C. normalization
D. Subtraction
E. Clustering



XI. Disease-gene Candidates (Tutorial - Updated 11/23/02)
A. What is a "disease"-gene?
B. Methods for locating/identifying disease genes.
1. Candidate
2. Positional
3. Positional-candidate
C. Information that can be useful
* Sequence, Protein domains, Functional classification, Cellular Localization, Expression, Similarity to other known genes (poss. in other organisms), Protein interactions, <just about everything>
D. Evaluating candidates
1. Looking for mutations that are correlated to the disease-state
* SSCP, Sequencing, Southern, Northern
2. Verifying candidates
* mouse-knockout, rescue in a model organism
3. Functional classification


XII. Mapping ( Tutorial) ( PPT-CMMB)
A. Types of Mapping
1. Genetic
2. Physical
B. Genetic Mapping

C. Somatic Cell Mapping

D. Radiation Hybrid Mapping

E. Processing/Details
     A. Selection of Candidates
     B. Primer Orders
     C. Testing (and outcomes)
     D. Mapping
     E. Scoring
     F. Placement
     G. Database Loading
     H. Database Interface
     I. BLASTing
     J. comparative map construction
     K. Database Loading
     L. Database Interface



XIII. Expression Studies ( Tutorial) ( PPT-CMMB )
A. Summary
B. Northern Blots, Western, Immuno
C. Microarray
D. SAGE
E. Details
A. selection of probe set (summarize recent discussions)
B. rearray, sequence verification, DNA preps
C. creating the microarray slides
D. selection of target
E. target preparations
F. hybridization
G. scanning
H.image parocessing/normalize
I. data extraction
J. data analysis
K. database schema
L. database interface/tools


XIV. Full-Length Sequencing
I. GOALS

II. High-level view

III. Details of processing
    A. select set of clones likely to be FL
       * 5pC
       * sequence similarity (BLAST-based) (should go against dbEST tool?)
       * tail/signal status
    B. verify non-redundancy
    C. filesystem structure (MGC)
    D. phrap assembly
    E. consed
    F. Finishing
    G. summary/interfaces

IV. Contact
    * TomC
    * ChadR
    * TomB
    * Jesse (phrap, consed)
    * Todd
    * Einat


XV. Genotyping

I. GOALS/BACKGROUND/PROJECTS (Genotyping and Screening)
    A. Background on genotyping
    B. Background on gene structure, screening, and SSCP

II. High-level
    * GT -> Linkage -> candidates -> screening

III. Details of Processing
    A. genotyping/GenoScape/GenoMap
    B. export of data
    C. Linkage analysis
    D. Identification of candidates
    E. Automated annotation
       * BLAST
       * domains
       * correlation to exon
       * secondary structure
       * others ??
    F. prioritization of candidates
    G. genomic structure
    H. primer design
    I. screening
    J. tools and database (under design)

IV. CONTACTS
    * Terry
    * Todd
    * TomC
    * Jesse (genomic structure, domains)



Last edited: November 19, 2002 by TS