<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>R Data-Science on Chris McKelt - Remembering Thoughts</title><link>https://blog.smarttechventures.au/tags/r-data-science/</link><description>Recent content in R Data-Science on Chris McKelt - Remembering Thoughts</description><generator>Hugo -- 0.147.2</generator><language>en</language><lastBuildDate>Sun, 03 Dec 2017 00:00:00 +0800</lastBuildDate><atom:link href="https://blog.smarttechventures.au/tags/r-data-science/index.xml" rel="self" type="application/rss+xml"/><item><title>Word prediction with Natural Language Processing</title><link>https://blog.smarttechventures.au/articles/posts/word-prediction-with-natural-language-processing/</link><pubDate>Sun, 03 Dec 2017 00:00:00 +0800</pubDate><guid>https://blog.smarttechventures.au/articles/posts/word-prediction-with-natural-language-processing/</guid><description>&lt;h3>&lt;/h3>
&lt;p>&lt;a href="https://www.rpubs.com/chris_mckelt/capstone-presentation" target="_blank">https://www.rpubs.com/chris_mckelt/capstone-presentation&lt;/a>&lt;/p></description></item><item><title>R–Data Exploration</title><link>https://blog.smarttechventures.au/articles/posts/rdata-exploration/</link><pubDate>Sat, 23 Jul 2016 00:00:00 +0800</pubDate><guid>https://blog.smarttechventures.au/articles/posts/rdata-exploration/</guid><description>&lt;h1 id="pca">PCA&lt;/h1>
&lt;p>&lt;strong>Principal Components Analysis&lt;/strong> (PCA) allows us to study and explore a set of quantitative variables measured on a set of objects&lt;/p>
&lt;h6 id="core-idea">Core Idea&lt;/h6>
&lt;p> &lt;/p>
&lt;p>With PCA we seek to reduce the dimensionality (reduce the number of variables) of a data set while retaining as much as possible of the variation present in the data&lt;/p>
&lt;p>Before performing a PCA(or any other multivariate method) we should start with some preliminary explorations&lt;/p>
&lt;ul>
&lt;li>Descriptive statistics&lt;/li>
&lt;li>Basic graphical displays&lt;/li>
&lt;li>Distribution of variables&lt;/li>
&lt;li>Pair-wise correlations among variables&lt;/li>
&lt;li>Perhaps transforming some variables&lt;/li>
&lt;li>ETC&lt;/li>
&lt;/ul>
&lt;p>&lt;img alt="image" loading="lazy" src="https://user-images.githubusercontent.com/662868/120943805-c6724780-c763-11eb-83bb-d1920d08269e.png">&lt;/p></description></item></channel></rss>