Prof. Yudi Pawitan

Prof. Yudi Pawitan
Department of Medical Epidemiology and Biostatistics
PO Box 281 Karolinska Institutet
17177 Stockholm, Sweden

Phone: 46-8-5248 3983 
Fax    : 46-8-314 975
Email: yudi.pawitan@ki.se

Research interest

  • Statistical genetics, microarrays, family data
  • Biostatistics
  • Likelihood inference

Downloadables:

Software:

·         cnvpack_0.4.2.zip and cnvpack_0.4.2.tar.gz (Oct 09) Windows binary and unix source for finding common cnv regions.

·         Slr_0.1.6.zip (Apr 09) Windows binary for performing smoothed logistic regression for CGH data. Unix source slr_0.1.6.tar.gz.

·         Mwt_0.2.6.zip (Oct09) Windows binary for Moderated Welch Test for microarray data (Demissie et al Bioinformatics 2008). Unix source: mwt_0.2.6.tar.gz.

·         FLUSH.LVS.bundle_1.3.1.zip (Sep 09) Windows-binary-installation R package to compute LVS normalization (Calza et al, BMC Bioinformatics 2008) and FLUSH filtering (Calza et al, Nucl Acid Research 2007). FLUSH has been revised to allow various background corrections. The data example in RData format: FLUSH.RData. The Unix source: FLUSH.LVS.bundle_1.3.1.tar.gz. To see the work flow: type vignette(‘FLUSH’) in R.

·         smoothseg_0.0.2.zip (Nov 2009) Windows-binary-installation R package to compute smooth-segmentation of array CGH data, including the estimation of FDR for comparative studies. The Unix version: smoothseg_0.0.2.tar.gz.

·         OCplus.zip: (version 1.3.5, 5-March 2006) Windows-installed R package to compute theoretical and estimated operating characteristics of microarray data such as FDR, sensitivity etc plus sample size requirements, fitting mixture model, and computing local fdr. To use it,

1.      you first need to download and install R, then install the package from inside R (use the Packages menu).

2.      Then type: library(OCplus) to start it.

3.      Type library(help=OCplus) to see the list of available functions, and type, for example ?TOC for help.

4.      The details are given in Pawitan et al ‘FDR, sensitivity and sample size for microarray studies’ in Bioinformatics 2005. This package is now in Bioconductor.

·         OCplus.tar.gz: (version 1.3.5, 5-March 2006) gzipped file for Unix users.

·         ProSpect (Ver 0.3.5 – Sept 08) and rsmooth: zip files of Windows R packages for processing of SELDI protein spectra. You need to install both in R for Windows; additionally you also need to install ‘quadprog’ package. After running library(ProSpect), type ?ProSpect.README for a short description and an example of a complete run. NOTE: names are case sensitive.

·         ProSpect_0.3.5.tar.gz and rsmooth_1.0.tar.gz: Unix version of ProSpect

·         FLUSH.zip (version 1.1.0, Aug 2007) Windows-installed R package for gene filtering. Needs packages affy, affyPLM and quantreg. Run demo(FLUSH.tour) to start.

·         FLUSH_1.1.0.tar.gz (version 1.1.0, Aug 2007) gzipped version for Unix users

·         ELF-example.zip (June 2006): R codes and dataset to run the estimated latent FDR procedure.

Data:

·         The ProSpect package contains a subset of the spike-in data used in Tan et al (Bioinformatics 2006). This is the complete set:

1.      SpikeInFull.zip: preprocessed by Ciphergen for background correction and total ion normalization (what we used in the paper),

2.      spikein_xml.zip raw uncorrected data

3.      blank(corrected)_csv.zip and blank_xml.zip: corrected and raw blank scan data

Related to In all likelihood: Statistical modelling and inference using likelihood.  Oxford University Press, June 2001. You can download: 


Recent Publications

 

Books

Pawitan Y: In All Likelihood: Statistical modeling and inference using likelihood. 525 pages. Oxford University Press. 2001.

Lee Y, Nelder J and Pawitan Y: Generalized linear models with random effects. 396 pages, Chapman and Hall, July 2006.

 

Recent Articles

  1. Lichtenstein P, Yip BH, Björk C, Pawitan Y, Cannon TD, Sullivan PF, Hultman CM. Common genetic determinants of schizophrenia and bipolar disorder in Swedish families: a population-based study. Lancet. 2009 Jan 17;373(9659):234-9.
  2. Hong MG, Pawitan Y, Magnusson PK, Prince JA. Strategies and issues in the detection of pathway enrichment in genome-wide association studies. Hum Genet. 2009 Aug; 126(2):289-301. Epub 2009 May 1.
  3. Tan CS, Salim A, Ploner A, Lehtiö J, Chia KS, Pawitan Y. Correlating gene and protein expression data using Correlated  Factor Analysis.  BMC Bioinformatics. 2009 Sep 1; 10: 272
  4. Weichselbaum RR, Ishwaran H, Yoon T, Nuyten DS, Baker SW, Khodarev N, Su AW, Shaikh AY, Roach P, Kreike B, Roizman B, Bergh J, Pawitan Y, van de Vijver MJ, Minn AJ. An interferon-related gene signature for DNA damage resistance is a predictive marker for chemotherapy and radiation for breast cancer. Proc Natl Acad Sci U S A. 2008 Nov 25;105(47):18490-5. Epub 2008 Nov 10.
  5. Demissie M; Mascialino B; Calza S; Pawitan Y. Unequal group variances in microarray data analyses. Bioinformatics. 2008 May 1;24(9):1168-74. Epub 2008 Mar 14.
  6. Calza S, Valentini D, Pawitan Y. Normalization of oligonucleotide arrays based on the least-variant set of genes. BMC Bioinformatics. 2008 Mar 5;9(1):140 [Epub ahead of print]
  7. Calza S, Raffelsberger W, Ploner A, Sahel J, Leveillard T, Pawitan Y. Filtering genes to improve sensitivity in oligonucleotide microarray data analysis. Nucleic Acids Research. 2007 Aug 15; [Epub ahead of print]
  8. Huang J, Gusnanto A, O'Sullivan K, Staaf J, Borg A, Pawitan Y. Robust smooth segmentation approach for array CGH data analysis. Bioinformatics. 2007 Sep 15;23(18):2463-9. Epub 2007 Jul 27.
  9. Moger TA, Pawitan Y, Borgan O. Case-cohort methods for survival data on families from routine registers. Stat Med. 2008 Mar 30;27(7):1062-74
  10. Yip BH, Bjork C, Lichtenstein P, Hultman CM, Pawitan Y. Covariance component models for multivariate binary traits in family data analysis. Stat Med. 2008 Mar 30;27(7):1086-1095
  11. Gusnanto A, Calza S, Pawitan Y. Identification of differentially expressed genes and false discovery rate in microarray studies. Current Opinion in Lipidology. 2007 Apr;18(2):187-93.
  12. Salim A, Pawitan Y. Model-Based Maximum Covariance Analysis for Irregularly Observed Climatological Data. Journal of Agricultural, Biological & Environmental Statistics 12: 1-24, 2007.
  13. Ha ID, Lee Y, Pawitan Y. Genetic Mixed Linear Models for Twin Survival Data. Behavior Genetics. 2007Jul;37(4):621-30. Epub 2007 Mar 31.
  14.  Perelman E, Ploner A, Calza S, Pawitan Y. Detecting differential expression in microarray data: comparison of optimal procedures. BMC Bioinformatics. 2007 Jan 26; 8:28.
  15. Pawitan Y, Calza S and Ploner A. Estimation of false discovery proportion under general dependence. Bioinformatics 22: 3025 – 3031, 2006
  16. Finding regions of significance in SELDI measurements for identifying protein biomarkers. Bioinformatics (2006): Advance Access, 27 March 2006.
  17. Multidimensional local false discovery rate for microarray studies. Bioinformatics 22: 556-565, 2006.
  18. An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects and patient survival. Proceedings of the National Academy of Science (PNAS) 2005
  19. Multi-component variance estimation from binary traits in family based-studies. Genetic Epidemiology 2005.
  20. Gene expression profiling spares early breast cancer patients from adjuvant therapy. Breast cancer research 2005
  21. Bias in the estimation of false discovery rate in microarray studies. Bioinformatics 2005.
  22. Robust ascertainment-adjusted parameter estimation. Genetic Epidemiology  2005.
  23. Using correlations to evaluate low-level analysis procedures for high-density oligonucleotide microarray data. BMC Bioifnormatics 2005.     
  24. FDR, sensitivity and sample size for microarray studies. Bioinformatics 2005.
  25. NonGaussian smoothing of short transmission scans for PET whole body studies. IEEE Transaction in Medical Imaging. 2005.
  26. Maximal covariance analysis of two spatio-temporal processes. JRSS(C): Applied Statistics 2005.
  27. Modelling infectious disease transmission with complex exposure pattern and sparse outcome data. Statistics in Medicine. 2004.
  28. Estimation of genetic and environmental factors for binary traits using family data. Statistics in Medicine. 2004.
  29. Gene expression profiling for prognosis using Cox regression. Statistics in Medicine. 2004.
  30. Analysis and prediction of BSE in Ireland. Preventive Veterinary Medicine. 2004.
  31. Maternal and paternal contributions in the risk of preeclampsia. American Journal of Medical Genetics 2004.
  32. Improved grading of breast adenocarcinomas based on genomic instability. Cancer Research 2004.
  33. Risk and protective factors for Parkinson's disease: a study in Swedish twins. Annals of Neorology 2004.
  34. Profound alterations in breast cancer incidence may reflect changes into a westernized lifestyle. International Journal of Cancer 2004.
  35. Variable selection in random calibration of near-infrared instruments: ridge regression and partial least squares regression settings. Journal of Chemometrics. 2003.
  36. Extensions of Bartlett-Lewis model for rainfall processes. Statistical Modelling. 2003.
  37. Constrained clustering of irregularly sampled spatial data. Journal of Statistical Computation and Simulation. 2003.

List of older publications

Likelihood Modelling and Inference

  1. In All Likelihood: modelling and inference using the likelihood. 2001. Oxford University Press. 
  2. Estimating variance components in generalized linear mixed models using quasi-likelihood. Journal of Statistical Computation and Simulation, 2000.
  3. Computing empirical likelihood from the bootstrap. Statistics and Probability Letters, 2000 
  4. Reminder of the fallibility of Wald statistic: likelihood explanation. American Statistician, 2000

Time series analysis

  1. Quasi-likelihood estimation of non-invertible moving average processes. Scandinavian Journal of Statistics, 2000 
  2. Consistent estimation of noncausal nonGaussian autoregressive processes. Journal of Time Series Analysis, 1999.
  3. Whittle likelihood. Encyclopaedia of Statistical Science, 1999.
  4. Change point problems. Encyclopaedia of Biostatistics, 1999
  5. Seasonal time series. Encyclopaedia of Biostatistics, 1999
  6. Coherence between time series. Encyclopaedia of Biostatistics, 1999
  7. Automatic estimation of coherence of bivariate time series. Biometrika, 1996
  8. Penalized Whittle likelihood estimate of spectral density functions. Journal of American Statistical Association, 1994
  9. Efficient bias corrected nonparametric spectral estimation. Biometrika, 1991
  10. Spectral estimation and deconvolution for a linear time series model. Journal of Time Series Analysis, 1989
  11. Modelling mortality fluctuations in Los Angeles as functions of pollution and weather effects. Environmental Research, 1988

Statistical methods in medical imaging

  1. Mixed inverse problems arising in the estimation of PET calibration factors.Journal of the Royal Statistical Society, Series C, 1998
  2. PET system calibration and attenuation correction. IEEE Transaction on Nuclear Science, 1997. 
  3. Bandwidth selection for indirect density estimation. Journal of American Statistical Association, 1996. 
  4. Multivariate density estimation by tomography. Journal of the Royal Statistical Society, Series B, 1993. 
  5. Data dependent bandwidth selection for emission computed tomography. IEEE Transactions on medical Imaging, 1993. 
  6. Reducing negativity artifacts in emission tomography. IEEE Transactions on Medical Imaging, 1993. 
  7. Discussion of ``From image deblurring to optimal investment: maximum likelihood solutions for positive linear inverse problems'' by Y. Vardi and D. Lee. Journal of the Royal Statistical Society, Series B, 1993

Biostatistics: methods and applications

  1. Association between ease of suppression of ventricular arrhythmia and survival. Circulation, 1995. Note: Comment in: Circulation 91(1): 245-7, 1995.
  2. Modelling disease markers in acquired immunodeficiency syndrome. Journal of American Statistical Association, 1993. 
  3. Identification of secondary peak in myocardial infarction onset 11 and 12 hours after awakening. Journal of American College of Cardiology, 1993. 
  4. Methods for assessing quality of life in the Cardiac Arrhythmia Suppression Trial. Quality of Life Research, 1992. 
  5. Effects of advancing age on the efficacy and side effects of antiarrhythmic drugs. Journal of the American Geriatric Society, 1992. 
  6. Modeling a marker of disease progression and onset of disease. AIDS Epidemiology: Methodological Issues, 1992. 
  7. Congestive heart failure with preserved left ventricular function. Journal of American College of Cardiology, 1991. 
  8. Events in Cardiac Arrhythmia Suppression Trial: Analysis of the placebo group. Journal of American College of Cardiology, 1991. 
  9. Prevalence, characteristics and significance of ventricular arrhythmia in the Cardiac Arrhythmia Suppression Trial. American Journal of Cardiology, 1991. 
  10. Increased risk of deaths and cardiac arrests from encainide and flecainide in patients after non-Q-wave myocardial infarction. American Journal of Cardiology, 1991. 
  11. Statistical interim monitoring of the Cardiac Arrhythmia Suppression Trial. Statistics in Medicine 1990. 
  12. Effect of encainide and flecainide on mortality in a randomized trial of arrhythmia suppression after myocardial infarction. New England Journal of Medicine, 1989. 

General

  1. Selecting random numbers for the lotto. Journal of Statistical Education, 1999.
  2. Two-sided P-values from discrete asymmetric distributions. Statistician: Journal of the Royal Statistical Society, Series D, 1997

Powered by counter.bloke.com