| Size: 5061 Comment:  |  ← Revision 28 as of 2009-07-07 00:48:05  ⇥ Size: 5054 Comment: converted to 1.6 markup | 
| Deletions are marked like this. | Additions are marked like this. | 
| Line 14: | Line 14: | 
| * [:DanieleMerico/HowtoDirectory/AffyCelCalSig: Importing Affymetrix CEL files, calculating MAS5 calls and signals][[BR]] CEL files are the almost-raw files generated after chip image processing by Affymetrix software; [[BR]] | * [[DanieleMerico/HowtoDirectory/AffyCelCalSig| Importing Affymetrix CEL files, calculating MAS5 calls and signals]]<<BR>> CEL files are the almost-raw files generated after chip image processing by Affymetrix software; <<BR>> | 
| Line 17: | Line 17: | 
| * [:DanieleMerico/HowtoDirectory/ExprSet: Importing Affymetrix CEL files, bothering about the R exprSet object, calculating MAS5 calls and signals][[BR]] if the experimental design is quite complex, or you are using a function requiring an expression set (exprSet),[[BR]] | * [[DanieleMerico/HowtoDirectory/ExprSet| Importing Affymetrix CEL files, bothering about the R exprSet object, calculating MAS5 calls and signals]]<<BR>> if the experimental design is quite complex, or you are using a function requiring an expression set (exprSet),<<BR>> | 
| Line 23: | Line 23: | 
| * [:DanieleMerico/HowtoDirectory/PCA_stats_princomp: using] the ''stats package'' function '''princomp'''  (covariance matrix) * [:DanieleMerico/HowtoDirectory/PCA_ade4_dudipca: using] the ''ade4 package'' function '''dudi.pca''' (covariance matrix) | * [[DanieleMerico/HowtoDirectory/PCA_stats_princomp| using the stats-package function princomp (covariance matrix)]] * [[DanieleMerico/HowtoDirectory/PCA_ade4_dudipca| using the ade4-package function dudi.pca (covariance matrix)]] | 
| Line 27: | Line 27: | 
| * '''2-class methods''' [[BR]] | * '''2-class methods''' <<BR>> | 
| Line 30: | Line 30: | 
| * [:DanieleMerico/HowtoDirectory/PLGEM: PLGEM][[BR]] | * [[DanieleMerico/HowtoDirectory/PLGEM| PLGEM]]<<BR>> | 
| Line 36: | Line 36: | 
| * proteomics: successfully applied to tandem mass-spec proteomics data, where the signal was generated as abundancy-normalized peptide counts (NSAF)[[BR]] | * proteomics: successfully applied to tandem mass-spec proteomics data, where the signal was generated as abundancy-normalized peptide counts (NSAF)<<BR>> | 
| Line 40: | Line 40: | 
| * SAM[[BR]] | * SAM<<BR>> | 
| Line 45: | Line 45: | 
| * proteomics: unknown[[BR]] | * proteomics: unknown<<BR>> | 
| Line 52: | Line 52: | 
| * [:DanieleMerico/HowtoDirectory/Distances: A few tips on distances] (especially for binary strings) | * [[DanieleMerico/HowtoDirectory/Distances| A few tips on distances]] (especially for binary strings) | 
| Line 56: | Line 56: | 
| * [:DanieleMerico/HowtoDirectory/Boxplots: Hacks for boxplots tuning] * [:DanieleMerico/HowtoDirectory/Identify: interacting with a scatter plot: the < identify > function] * [:DanieleMerico/HowtoDirectory/Legend: drawing a legend in a plot, the < legend > function] | * [[DanieleMerico/HowtoDirectory/Boxplots| Hacks for boxplots tuning]] * [[DanieleMerico/HowtoDirectory/Identify| interacting with a scatter plot: the < identify > function]] * [[DanieleMerico/HowtoDirectory/Legend| drawing a legend in a plot, the < legend > function]] | 
| Line 61: | Line 61: | 
| * [http://research.stowers-institute.org/efg/R/Graphics/Basics/mar-oma/index.htm A graphical description of the main graphical parameters for R graphs][[BR]] and [http://research.stowers-institute.org/efg/R/ a broader how-to for R graphics] | * [[http://research.stowers-institute.org/efg/R/Graphics/Basics/mar-oma/index.htm|A graphical description of the main graphical parameters for R graphs]]<<BR>> and [[http://research.stowers-institute.org/efg/R/|a broader how-to for R graphics]] | 
| Line 65: | Line 65: | 
| * Where is R installed in the Mac?[[BR]] | * Where is R installed in the Mac?<<BR>> | 
| Line 73: | Line 73: | 
| * [:DanieleMerico/HowtoDirectory/EclipseRplugin: how to install the R plugin for Eclipse] * [:DanieleMerico/HowtoDirectory/EclipseSubversion: how to install the Subversion plugin for Eclipse] | * [[DanieleMerico/HowtoDirectory/EclipseRplugin| how to install the R plugin for Eclipse]] * [[DanieleMerico/HowtoDirectory/EclipseSubversion| how to install the Subversion plugin for Eclipse]] | 
Daniele Merico - HowTo Directory
Affymetrix Microarray Analysis
Importing raw data and generating standard gene expression metrics (signals, calls, etc...)
- Importing Affymetrix CEL files, calculating MAS5 calls and signals 
 CEL files are the almost-raw files generated after chip image processing by Affymetrix software;
 the "fun" usually starts from the CEL files onwards; here's is the simplest things you can do with CEL files.
- Importing Affymetrix CEL files, bothering about the R exprSet object, calculating MAS5 calls and signals 
 if the experimental design is quite complex, or you are using a function requiring an expression set (exprSet),
 then, sorry, but you probably need to read this part instead of the previous one.
Data exploration by dimensionality reduction techniques
- How to perform on a data matrix (e.g. expression matrix)
Computing Differential Expression
- 2-class methods 
 these methods require a dicotomic classification of the samples (e.g. case vs control), and reproducibility of samples belonging to the same class- PLGEM 
 Features:- statistic used: corrected signal-to-noise, every gene treated as an independent entity; signal-to-noise is corrected according to an error model for the global estimation of varibility; - error model requires: linear relation between signal mean and standard deviation
 
- significance: estimated by randomly permuting the data (by column), and computing the statistic;
- recommended when: the number of replicates is uneven between case and control, with one of the two having very few, or just one replicate;
- proteomics: successfully applied to tandem mass-spec proteomics data, where the signal was generated as abundancy-normalized peptide counts (NSAF) 
 
 - Pubmed.ID: 15606915 (main)
- Pubmed.ID: 18029349 (proteomic application)
 
- statistic used: corrected signal-to-noise, every gene treated as an independent entity; signal-to-noise is corrected according to an error model for the global estimation of varibility; 
- SAM 
 Features:- statistic used: corrected signal-to-noise, every gene treated as an independent entity;
- significance: estimated by randomly permuting the data (by column), and computing the statistic;
- recommended when: the number of replicates is 3 or more, and even between case and control;
- proteomics: unknown 
 
 - Pubmed.ID: 11309499 (main)
 
 
General Computational Techniques
Computational Techniques for multi-dimensional data:
- A few tips on distances (especially for binary strings) 
Tuning Visualization in R
My stuff:
For a more general reference:
- A graphical description of the main graphical parameters for R graphs 
 and a broader how-to for R graphics
System: R & the Mac
- Where is R installed in the Mac? 
 As a former Windows user, I spent an hour trying to answer the following question: what is the f. location of R executables on the Mac? (i.e. where the hell are R files installed?) (where, of course, "f." stands for funny). The answer is quite straightforward if, instead of wasting time looking for them all round your Mac, you just read the R Mac OS X FAQ, under the chapter "uninstalling R". In my system (Mac OS X 10.5.1), the funny location of R files is:- Rgui:
- other R files: /library/frameworks/R.framework - for arcane reasons, the R plugin for Eclipse requires as folder of R executables: /Library/Frameworks/R.framework/Versions/.../Resources (where "..." is the version currently under use) 
 
 
The Eclipse Plug-in for Mac
- Eclipse can be used as a programming environment for R, and it can be also connected to Subversion (thus catching two pigeons with one bite)
