# ----------------------------------------------------------------------------- # SOURCE TEXT FOR SLIDES.SH # ADAPT THE VARIABLES BELOW AS APPROPRIATE # !!!!! PLEASE DON'T PUT ANY BLANK LINES IN THE SOURCE !!!!! # ----- DIRECTORIES AND FILES ------------------------------------------------- destination_dir=. # you can also use the command line argument -d images_dir=../images # you can also use the command line argument -i prefix=slide # of the HTML file suffix=html # or "htm" index_name=index # or "homepage" # ----- MISCELLANEOUS --------------------------------------------------------- table_height=80 # height (in percentage) of central table / page table_width_std=95 # width (in percentage) central table / std page table_width_toc=60 # width (in percentage) central table / page for # table of contents font_face="Arial,Helvetica,sans-serif" # of the document's characters table_of_contents=yes # you can also use the cmd line argument -t (yes) max_line_contents=12 # max lines by page for table of contents table_of_contents_text="Table of contents" table_of_contents_cont="(continued)" background_color="#ffffff" #background_img="my_background.gif" auto_pull_wait=0 # a value > 0 make an auto slideshow with "value" # secondes wait separator="§" # divides title and blue complementary text ^q^" on my emacs # ----- NAVIGATION LINE ------------------------------------------------------- color="#ffffcc" start_img_presence=no # yes enable "Start" Button (link to first slide) start_img_alt="Start" previous_img_alt="Previous" next_img_alt="Next" # ----- HTML HEADER and ELEMENT for PAGE 1 ------------------------------------ author="Humberto Ortiz Zuazaga" # META name="author" and element page 1 keywords="bioinformatics, microarray, cluster" # META name="keywords" charset="iso-8859-1" # ----- HTML FOOTER ----------------------------------------------------------- ruler_width=580 tel_text="Telephone" fax_text="Fax" email_text="E-mail" organization="UPR High Performance Computing facility" adr1="PO Box 23334" adr2="San Juan PR 00931-3334" email="humberto@hpcf.upr.edu" tel="(787) 758-3054" fax="(787) 758-3058" logo_img=linux-hpcf-tiny.jpg logo_alt="HPCf GNU/Linux" logo_height=47 logo_width=100 # ----------------------------------------------------------------------------- # DON'T MODIFY THE "--" SEPARATOR ! # THE FIRST LINE AFTER IS THE PRESENTATION TITLE. DON'T PUT A COMMENT ! -- Clustering Microarray Expression Data :: ----- this is a comment ---------------------------------------------------- Bioinformatics§The application of computer science to the study of biological problems. Clustering§Dividing the elements of a set into related subsets based on a distance metric among elements. :: --- Question§What other biological problem groups elements based on their "distance"? :: --- Reviews Patrik D'Haeseleer !hhttp://www.cs.unm.edu/~patrik/ Doctoral Thesis! Bioinformatics article PMID: 11099257 !hhttp://www.cs.unm.edu/~patrik/networks/Bioinforev.pdf :: --- Distance Metrics I Euclidean distance Mutual information Coeficient of correlation :: --- Distance Metrics II§Euclidean distance: distance according to Euclid's formula for geometric distance. Distance between points in n-dimensional space !i3d.png :: --- Common clustering techniques Hierarchical - Eisen et al K Means, Fuzzy K Means Self Organizing Maps (SOM) - GENECLUSTER Support Vector Machines (SVM) clique graphs - Amir Ben-Dor :: --- Exercise§Use a phylogeny program (like neighbor in phylip) to group the sample data. Phylip homepage !hhttp://evolution.genetics.washington.edu/phylip.html The sample !lexpr.txt !lexpr2phylip.py The phylip sample !linfile !loutfile What elements are most related? What groupings of elements stand out? :: --- CLUSTER Eisen et al PNAS !hhttp://rana.lbl.gov/papers/Eisen_PNAS_1998.pdf !hhttp://rana.lbl.gov/ Free software and manuals (registration required) Question: what clustering technique and distance function? :: --- GENECLUSTER Tamayo et al PNAS !hhttp://www.pnas.org/cgi/content/abstract/96/6/2907 MIT Molecular Pattern Recognition site !hhttp://www-genome.wi.mit.edu/MPR/ Free software and manuals (registration required). Tutorial at !hhttp://www-genome.wi.mit.edu/MPR/GeneCluster/GeneCluster2.html Question: what clustering technique and distance function? :: --- GENECLUSTER clustering technique Self Organizing Maps !hhttp://www-genome.wi.mit.edu/MPR/SOM.html