Curriculum Vitae

Yihui Xie

Contact Information

14807 Pratt CT Apt 301, Omaha NE 68116 Email: xie@yihui.name; Homepage: http://yihui.name/

Research Interest

Statistical computing and graphics / data visualization

Education Background

  • 2009 ~ 2013, PhD in Statistics, Department of Statistics, Iowa State University
  • 2006 ~ 2009, Master of Statistics, School of Statistics, Renmin University of China
  • 2002 ~ 2006, Bachelor of Economics, School of Statistics, Renmin University of China

Honors & Awards

  • Snedecor Award, 2012 (the most outstanding PhD candidate in the Department of Statistics, Iowa State University)
  • Honorable Mention Prize, Applications of R in Business Competition, 2012
  • Vince Sposito Statistical Computing Award, 2011 (Department of Statistics, Iowa State University)
  • John M. Chambers Statistical Software Award, ASA, 2009 (for the R package animation)
  • Excellent graduate scholarship, Renmin University of China, 2007
  • Successful Participant, Mathematical Contest in Modeling (America), 2006 (paper; certificate)
  • Excellent graduate of Renmin University of China, 2006
  • “Excellent Worker in Student Media” of Renmin University of China, 2006 (for hosting the website of statistics discussions in http://cos.name)
  • Scholarship for outstanding learning, Renmin University of China, 2005

Publications

In Chinese

  1. Yanyun Zhao, Nailin Feng, Yuncheng Zhao, Yihui Xie, Ning Xiao, 2008, Research on the National Quality Competitiveness of China, Statistical Research, 25(3), 12-19
  2. Yihui Xie, 2008. On the Applications of Statistical Graphics in Data Analysis, Statistics Review, Bo Zhang (ed)
  3. Yihui Xie, 2008. Bootstrap Methods: Developments and Frontiers, Statistics & Information Forum, 89, 90-96.
  4. Yihui Xie, 2007. The Application of the Classification and Regression Tree Based on the Package rpart in R, Statistics & Information Forum, 86, 67-70.
  5. Yihui Xie, 2007. Series in International Competitiveness: How Weak is the Financial System of China?, China Statistics, 7, 35.
  6. Yanyun Zhao, Yihui Xie, 2006. Unbalanced growth in top 100 industrial enterprises in China: market expanding outgoes business managing, Talents, 9, 84.
  7. Yanyun Zhao, Yihui Xie, 2005. Who is the champion of growth among top 1000 industrial enterprises in China: analysis on the capability of market expanding and aggregate efficiency, Talents, 9, 72-73.

Besides, I published several articles in the Capital of Statistics in Chinese.

In English

  1. Yihui Xie (2015). Dynamic Documents with R and knitr. 2nd edition. Chapman and Hall/CRC. ISBN 978-1498716963
  2. Yihui Xie, Heike Hofmann, and Xiaoyue Cheng (2014). Reactive Programming for Interactive Graphics. Statistical Science, 29(2), 201-213. (PDF)
  3. Yihui Xie (2014). knitr: A Comprehensive Tool for Reproducible Research in R. In Victoria Stodden, Friedrich Leisch and Roger D. Peng, editors, Implementing Reproducible Computational Research. Chapman and Hall/CRC. ISBN 978-1466561595
  4. Yihui Xie (2013). Dynamic Documents with R and knitr. Chapman and Hall/CRC. ISBN 978-1482203530
  5. Yihui Xie (2013). animation: An R Package for Creating Animations and Demonstrating Statistical Methods. Journal of Statistical Software, 53(1), 1-27. URL http://www.jstatsoft.org/v53/i01/
  6. Yihui Xie and Xiaoyue Cheng, 2008. animation: A Package for Statistical Animations, R News, 8(2). URL: http://www.r-project.org/doc/Rnews/Rnews_2008-2.pdf
  7. Yihui Xie, 2007. Visualization of Data and Statistical Models Using R, Proceedings of the 3rd Conference on Australia-China Free Trade Agreement (ACFTA): Econometric Modeling and Policy in Trade, Investment and Economic Integration (paper)

Professional Experience

I’m currently a software engineer at RStudio, Inc.

Internship

  1. Summer 2012, AT&T Labs Research, Florham, NJ (mentor: Simon Urbanek)
  2. Summer 2013, Fred Hutchinson Cancer Research Center (mentor: Raphael Gottardo)

Web Construction and Management

Sept 2003 - Present

I have made and been maintaining web pages for:

  • The School of Statistics, Renmin University of China (Sept 2003 - Jun 2009)
  • The Center for Applied Statistics, Renmin University of China (Apr 2004 - Jun 2009)
  • The Center for Risk Management and Actuarial Studies (Jun 2004 - Apr 2006); Data Mining Center of Renmin University of China (Jul 2004 - Apr 2006), etc
  • The “Great Expectation Society”, a student society in Renmin University (Mar 2005 - Jul 2006)

I have built and been hosting this web forum:

I have been working in these web studios:

  • The English web studio of Renmin University of China (for the construction of English website of the university) (Sept 2006 - Jun 2009)
  • The web club of the School of Statistics (as the founder & leader) (Apr 2006 - 2008)

My own projects:

Conferences Organization

Feb 2005 - Present

I organized

  • the 1st Chinese R conference (in Beijing), Dec 13~14, 2008

and I helped organize these conferences:

  • the 3rd Chinese R conference (Beijing), Jun 12~13, 2010
  • the 2nd Chinese R conference (Beijing & Shanghai), December 2009
  • Second Conference on Micro Evidence on Innovation in Developing Economies, April 22 ~ 23, 2008 (an international conference; I was also the Chair for the last session on April 22)
  • 2nd Conference on Australia-China Free Trade Agreement (ACFTA): Econometric Modeling and Policy in Trade, Investment and Economic Integration, April 17th ~ 18th, 2006
  • 15th International Conference on Input-Output Techniques, June 27th ~ July 1st, 2005
  • Data Mining and Business Intelligence Conference 2006
  • International Statistics Forum 2006

Statistical Consulting

May 2005 - Present

I offer statistical consulting by email and I have been answering questions in the Capital of Statistics almost every day. I’m particularly interested in these topics:

  • data analysis and visualization
  • theoretical problems without measure theory
  • linear models, generalized linear models and nonlinear models
  • R packages that I maintain (other questions on R should go to R-help or StackOverflow)
  • statistics in other disciplines (e.g. biology) in which I can also learn something from you
  • real world data mining
  • simulation
  • teaching

I will ignore any questions on Structural Equation Modeling (SEM) or factor analysis, as I do not believe their usefulness at all. I know little about time series and do not like econometrics.

Other Work Experience

Sep 2004 - present

Duties in Associations & Centers:

  • Program chair, Section on Statistical Graphics, American Statistical Association (2015)
  • Data Sharing and Reproducibility Committee, American Statistical Association (2013)
  • Column editor of the ICSA bulletin for its R column (2012 - 2013)
  • Webmaster of the Center for Applied Statistics, Renmin University of China (2004 - 2009)
  • Vice-chairman of the “Great Expectation Society” (a student association, Apr 2005 - Jul 2006)

Projects Participated:

  • Google Summer of Code 2012: Dynamic report generation in the web with R (mentor)
  • Translation of R for beginners by Emmanuel Paradis (Co-operated with some colleagues to finish the translation) (Nov 2005 - Feb 2006);
  • Performance Evaluation of Taxation Management of the China Customs (designed several R programs for sampling databases using R) (Sept 2006 - Jun 2009);
  • Important Projects in Social Science of the Ministry of Education: Research on the Industry Competitiveness of China (Apr 2007 - May 2007);
  • A research project on statistical quality control co-operated with China National Research Institute of Food and Fermentation Industries, National Standardization Centre of Food & Fermentation Industry, P.R.C.; R (D)COM was used in developing programs (May 2007 - Jun 2009)
  • Cooperative project with the Office of 1% Population Sampling Survey of the State Department: Research on the National Population Competitiveness of China (Jun 2007 - Nov 2007, report finished)

Presentations & Lectures

  • Yihui Xie, 2015, Reproducible Reports with Big Data, Strata, New York City (slides)
  • Yihui Xie, 2015, Escape the Land of LaTeX/Word for Statistical Reporting: The Ecosystem of R Markdown
    • March, Wright State University, Dayton, Ohio (slides)
    • July, Central Iowa R User Group
    • September, ASA webinar (slides)
    • October, RStudio webinar (slides; video)
  • Yihui Xie, 2015, Boosting R Skills and Automating Statistical Reports, New England Statistical Symposium, UConn
  • Yihui Xie, 2015, HTML Widgets, LA R User Group (slides; video)
  • Yihui Xie & Dirk Eddelbuettel, 2014, An R Workshop at the Warren Center of UPenn, Philadelphia, PA
  • JJ Allaire & Yihui Xie, 2014, _Reproducible R Reports with Packrat and Rmarkdown _, R Day at Strata New York
  • Yihui Xie, 2014, Reproducible Research Is Dead, Long Live Reproducible Research!, a talk given at JSM 2014, Boston, MA (slides)
  • Yihui Xie, 2014, Knitr Ninja, a talk given at UseR! 2014, UCLA (slides)
  • Yihui Xie, 2014, Dynamic Documents with R and knitr, a tutorial given at UseR! 2014, UCLA (slides; examples)
  • Yihui Xie, 2014, The Next Generation of Dynamic Documents with R Markdown, Great Plains R-Users Group Conference, Sioux Center, Iowa (slides)
  • Yihui Xie, 2014, How I Managed to Find Time to Play Badminton Three Times A Week, a talk in the Department of Animal Science, ISU (slides)
  • Yihui Xie, 2014, New developments in knitr and R Markdown v2, LA R user group (slides)
  • Yihui Xie, 2014, Dynamic and Interactive Statistical Reporting, a talk in the Department of Statistics, UCLA (slides)
  • Yihui Xie, 2014, How to Save More Time from Your Homework and Projects, a guest lecture for Stat 416 at Iowa State University (slides)
  • Yihui Xie, 2014, An Introduction to knitr, a remote talk for the Orange County R User Group (slides)
  • Yihui Xie, 2014, R Markdown v2, Seattle R User Meetup, Seattle, WA (slides)
  • Yihui Xie, 2013, You Can Save More Time from Homework/Projects/Data Analysis for Beer/Fun/Whatever, Iowa Chapter of ASA poster session with mixer (poster)
  • Yihui Xie, 2013, knitr for ninja, a distance talk at the New York R Users Group (slides)
  • Yihui Xie, 2013, A tour through knitr, Raphael Gottardo’s research lab, Fred Hutchinson Cancer Research Center, Seattle, WA (slides; same talk given to the stats group)
  • Yihui Xie, 2013, An Introduction to knitr, Bioconductor 2013, Seattle, WA (slides, R Markdown)
  • Yihui Xie, 2013, The Development of R in China, and One Sentence Summaries of My R Projects, Revolution Analytics, Seattle, WA (slides)
  • Yihui Xie, 2013, Doing Reproducible Research Unconsciously: Higher Standard But Less Work, The 36th annual Midwest Biopharmaceutical Statistics Workshop, Muncie, IN (slides)
  • Yihui Xie, 2013, Lessons Learned in Developing R Packages, The 6th Chinese R Conference, Beijing (slides)
  • Yihui Xie, 2013, knitr: A General-purpose Tool for Dynamic Report Generation in R, ENAR 2013, Orlando, FL (slides)
  • Yihui Xie, 2013, Reproducible Research on the Web: From Homework, Blogging to Open Journals, SIAM CSE13, Boston, MA (slides)
  • Yihui Xie, 2012, knitr: Starting From Reproducible Homework, Workshop on Reproducibility in Computational and Experimental Mathematics, ICERM, Brown University, Providence, RI (slides)
  • Yihui Xie, 2012, Can You Reproduce Your Homework?, Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD (slides)
  • Yihui Xie, 2012, I web, therefore I am, ISU Computing Working Group (slides)
  • Yihui Xie, 2012, Dynamic Report Generation and Interactive Graphics on the Web, End of Summer talk at AT&T Labs as a summer intern, Florham Park, NJ
  • Yihui Xie, 2012, cranvas: A New Package for Interactive Statistical Graphics in R, Joint Statistical Meetings at San Diego, CA
  • Yihui Xie, 2012, cranvas: Interactive statistical graphics in R based on Qt, useR! 2012, Nashville, TN (R code)
  • Yihui Xie and JJ Allaire, 2012, New Tools for Reproducible Research with R (slides)
    • 05/30 Iowa State University
    • 05/30 Minneapolis UseR Group
    • 06/05 New York UseR Group
    • 06/14 invited talk at the 8th International useR! Conference
  • Yihui Xie, 2012, Go social coding! Make open source more open!, lightning talk at useR! 2012 (slides)
  • Yihui Xie, 2012, cranvas: Building from Plumbing and Painting, Interface 2012, Houston TX (slides)
  • Yihui Xie, 2012, Generating beautiful statistical reports quickly and faithfully, ISU Graphics Working Group (slides)
  • Yihui Xie, 2011, How I See Bootstrap, Tests and KDE, Stat546 presentation at ISU (slides)
  • Yihui Xie, 2011, An Introduction to R, tutorial workshops in the Econ department of ISU (slides)
  • Yihui Xie, 2011, Lessons Learned in Developing R Packages, ISU Computational Statistics working group (slides)
  • Yihui Xie, 2011, cranvas: Interactive Statistical Graphics in R via Qt, AT&T Research Labs, New Jersey (slides)
  • Yihui Xie, 2011, Interactive Parallel Coordinates Plot Based on Qt, Joint Statistical Meetings at Miami Beach, Florida (slides)
  • Yihui Xie, 2011, New Features and Enhancements of the animation Package, ISU Graphics Working Group (slides)
  • Yihui Xie, 2010, LaTeX and Sweave without Tears, ISU Graphics Working Group (slides)
  • Yihui Xie, 2010, Creating Animations with R, the useR! 2010 conference at NIST, Gaithersburg, MD (slides; R code)
  • Yihui Xie, 2010, Creating GUI with the gWidgets package, the 3rd Chinese R conference, Beijing (slides in Chinese; R code)
  • Yihui Xie, 2010, Modern Statistical Graphics, tutorial lecture on the 3rd Chinese R conference, Beijing (slides in Chinese)
  • Yihui Xie, 2009, Creating R Packages, Working Group in Statistical Computing and Computationally-intensive Statistical Methods, Department of Statistics, ISU (slides; C code)
  • Yihui Xie, 2009, R in Stat500: Demonstration, Computing, and Graphics (with An Introduction to the R Package ISU.Stat500), RTG Seminar, Department of Statistics, ISU (slides)
  • Yihui Xie, 2009, Turning Statistical Ideas into Animations, VIGRE Graphics Seminar, Department of Statistics, ISU (slides; R code)
  • Yihui Xie, 2009, Reproducible Research and An Introduction to LaTeX, R and Sweave, a report for PhD students in the School of Statistics, RUC (slides in Chinese)
  • Yihui Xie, 2008, (1) The History of R Language and Current Developments (slides) (2) animation: An R Package for Statistical Animations with Applications in Teaching and Data Analysis (slides) (3) Explore Irregular Data with R (slides, R code), the 1st Chinese R Conference, Beijing, 2008.12.13~14 (introduction to the conference, R code)
  • Yihui Xie, 2008, Tutorial Lectures on R Basics, a four-day tutorial lecture series given at the Institute of Botany, Chinese Academy of Sciences (slides in Chinese: 1, 2, 3, 4)
  • Yihui Xie, 2008, See? (escaping the math world via animations…), a talk in the School of Statistics, Renmin University of China (slides)
  • Yihui Xie, 2008, Statistical Animations Using R, useR! 2008 in Dortmund (abstract; slides; slides in PDF)
  • Yihui Xie, 2008, Computer Animations for Demonstrating Statistical Procedures: Methods and Implementations, a talk at the workshop “Statistical Graphics: Data and Information Visualization in Today’s Multimedia Society (Data Viz VI)” in Jacobs University, Bremen, Germany (slides)
  • Yihui Xie, 2008, The Influence of Population Quality Competitiveness to Regional Innovation: the China Case, a presentation at the 2nd MEIDE conference (slides)
  • Yihui Xie, 2008, A Training Course on EViews and Examples in Macroeconomic Statistics, a lecture in the National Development and Reform Commission (NDRC) of People’s Republic of China (slides in Chinese)
  • Yihui Xie, 2007, Animation in Statistics: Dynamic Graphics for Statistical Models and Practical Applications, an invited talk in the 2nd postgraduate forum on statistics for universities in Beijing (slides)
  • Yihui Xie, 2007, Classification Tree Ensembles with Applications to the Prognostic Analysis of Cerebral Infarction, a report in “The Seminar for the Statistical Methods in Clinical Research” held in Renmin University of China (slides in Chinese )
  • Yihui Xie, 2007, Jokes in Statistics, an invited lecture in the Capital University of Economics and Business (slides)
  • Yihui Xie, 2007, A Leisure Look on Some Tools for Statistics, an invited lecture in the Capital University of Economics and Business (slides in Chinese; R code)
  • Yihui Xie, 2007, Bootstrap Methods: A Review, a report in the class of “Frontiers in Statistics” (slides)
  • Yihui Xie, 2007, An Introduction to Support Vector Machine and Implementation in R, an article for a presentation of the class “Data Mining and Machine Learning” (slides)
  • Yihui Xie, 2007, A Framework for Statistical Programming and An Introduction to Basic Statistical Analysis Using R, a lecture given to the members of the “Capital of Statistics” (slides in Chinese)
  • Yihui Xie, 2007, A Brief Introduction to Java, a lecture in the School of Statistics (slides)
  • Yihui Xie, 2007, A Guide to Stata, a lecture in People’s Bank of China (slides in Chinese)
  • Yihui Xie, 2007, Visualization of Data and Statistical Models Using R, 3rd Conference on Australia-China Free Trade Agreement (ACFTA): Econometric Modeling and Policy in Trade, Investment and Economic Integration (paper; slides)
  • Yihui Xie, 2006, Thoughts of Statistics, an invited talk in the 1st postgraduate forum on statistics for universities in Beijing (slides)
  • Yihui Xie, 2006, Typesetting and LATEX, a lecture in the School of Statistics (slides)
  • Yihui Xie, 2006, Statistical Programming & R Basics, a lecture in the School of Statistics (slides)
  • Yihui Xie, 2006, An Introduction to Structural Equation Modeling, A small seminar on SEM in Renmin University (slides)
  • Yanyun Zhao, Guirong Li, Xuanxuan Chen, Yihui Xie, 2006, The Industrial Structure and Foreign Investment Introduction Policy in Shanghai and Its Adjacency, 2nd Conference on Australia-China Free Trade Agreement (ACFTA): Econometric Modeling and Policy in Trade, Investment and Economic Integration (slides)

Computer Skills

I like playing with computer tools for building web pages and statistical computation & graphics. I improve the efficiency of my work with the aid of programming (i.e. work hard to be lazy).

  • Statistical Packages: R (OpenBUGS, S-Plus, SPSS, Eviews, SAS, MatLab, STATA, AMOS and LISREL)
  • Languages: R, HTML, JavaScript, PHP, SQL (Visual Basic, VBA, VBScript, ASP, Java, C)
  • Applications: LaTeX, LyX, databases (e.g. MySQL), SVN and GIT
  • Operating Systems: 97.5% GNU Linux (Ubuntu); 2.5% Windows (mostly for purposes of testing software)

Listed above are what I’m very familiar with, and the ones being crossed out are due to my brain stack overflow and I do garbage collection in my brain once in a while. Occasionally I also use other tools like D3. I’m slightly insterested in learning Python and Julia. If I have more time to kill, I will definitely learn http://processingjs.org.

Software Packages

I support open source software and below is a list of my contributions to the open source community:

  • R package knitr as an alternative to Sweave for dynamic report generation
  • R package animation which provides several animation demonstrations for statistical theories and data analysis (Nov 2007 - Present)
  • R package testit for testing R packages
  • R packages formatR, Rd2roxygen, R2SWF, MSG, iBUGS, fun, …
  • R package cranvas as the next generation of GGobi for interactive statistical graphics
  • the Sweave and knitr modules in LyX
  • Wordpress plugin UTF-8 DB Converter (June 2009)
  • Dokuwiki plugin animation to insert animations into wiki pages; written in PHP and JavaScript (Jun 2008)