CoCalc -- mda14jx_Final Project.ipynb

Project: Jingyi Xie - Autumn2016/BMS353

Path: Autumn2016 / Week6 / mda14jx_Final Project.ipynb

Views: ⁴⁰³¹

Kernel: R (R-Project)

Step 1: Load packages with data from Bioconductor, library(affy) - mas5, rma, library(puma)

Step 2: Load and read data, create affybatch. Annotate with pData.

Step 3: Analysis of gene expression data with different methods and normalisation techniques.

Step 4: Diagnostics of the data with plotting techniques

Step 5: Differential Expression Analysis

For puma, combine the data using an bayesian Hierarchical model
Check the dimension and the pData() for the eset of the combined values. Calculate the FC and plot the data with a MA plot using the command ma.plot()
MAPlot
use of limma for DE analysis. Remember the three core steps of limma

Step 6: Visualisation of Data with PCA

perform PCA in R using the command prcomp()
It needs the traspose command t() since the input for the prcomp() wants the genes in the columns
For probabilistic PCA you can use pumaPCA()

Step 7: Hierarchical clustering of DE (Differentially Expressed) genes

To perform this we need to activate a library called gplots. We will use the command heatmap.2().
We do clustering a the selected genes from our DE analysis this is to search for patterns in of differentially regulatend pathways.

Step 8: Functional/Pathway analysis of DE targets using PANTHER or DAVID

In [ ]: