<?xml version="1.0" encoding="utf-8"?>
<?xml-stylesheet type="text/xsl" href="../assets/xml/rss.xsl" media="all"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>#Prob ECSE-2500-01 Engineering Probability, Spring 2018, Rensselaer Polytechnic Institute (Posts about class)</title><link>https://wrf.ecse.rpi.edu/Teaching/probability-s2018/</link><description></description><atom:link href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/categories/cat_class.xml" rel="self" type="application/rss+xml"></atom:link><language>en</language><copyright>Contents © 2019 &lt;a href="mailto:frankwr@rpi.edu"&gt;W Randolph Franklin (WRF), RPI&lt;/a&gt; </copyright><lastBuildDate>Thu, 17 Jan 2019 19:14:27 GMT</lastBuildDate><generator>Nikola (getnikola.com)</generator><docs>http://blogs.law.harvard.edu/tech/rss</docs><item><title>Engineering Probability Class 29 Mon 2018-05-10</title><link>https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class29/</link><dc:creator>W Randolph Franklin (WRF), RPI</dc:creator><description>&lt;div&gt;&lt;div class="contents topic" id="table-of-contents"&gt;
&lt;p class="topic-title first"&gt;Table of contents&lt;/p&gt;
&lt;ul class="auto-toc simple"&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class29/#final-grading-notes" id="id1"&gt;1   Final grading notes&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class29/#closing-remarks" id="id2"&gt;2   Closing remarks&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/div&gt;
&lt;!--  --&gt;
&lt;style&gt; .red {color:red} &lt;/style&gt;
&lt;style&gt; .blue {color:blue} &lt;/style&gt;&lt;p&gt;This is not an actual class, but a place to present info about the grading.&lt;/p&gt;
&lt;div class="section" id="final-grading-notes"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class29/#id1"&gt;1   Final grading notes&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;I made some grade formula changes to so that your final total would not go down.&lt;/p&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;Make full points for piazza continue to be 5, not 6.   So, some students could go over full points here, but allowing that seemed better than clipping it at 5.&lt;/li&gt;
&lt;li&gt;Make full points for iclickers be 9 although there were 14 iclicker days.&lt;/li&gt;
&lt;li&gt;Something went wrong on the last iclicker day, so everyone got a point, although half the class was absent.&lt;/li&gt;
&lt;li&gt;Use these changes and exam 3 (main and conflict) to compute a total grade 509 (altho it's computed on 510).&lt;/li&gt;
&lt;li&gt;BTW some students did increase their grades by writing exam 3.&lt;/li&gt;
&lt;li&gt;I'd uploaded earlier total grades on 423, 501, and 507.&lt;/li&gt;
&lt;li&gt;Make the new total total510=max(total423, total501, total507, total509).&lt;/li&gt;
&lt;li&gt;Use the grade cutoffs in the syllabus.&lt;/li&gt;
&lt;li&gt;This gives a course GPA=3.3.  That's not so bad for a 2000-level course.&lt;/li&gt;
&lt;li&gt;I uploaded total510, grade510, and exam3normalized to LMS.&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="closing-remarks"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class29/#id2"&gt;2   Closing remarks&lt;/a&gt;&lt;/h2&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;I enjoyed teaching this, and hope you learned some fun and useful stuff.&lt;/li&gt;
&lt;li&gt;I'm available in the future to discuss and advise any legal ethical topics, such as career advice or ideas about problems you may have.&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;&lt;/div&gt;</description><category>mathjax</category><guid>https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class29/</guid><pubDate>Thu, 10 May 2018 04:00:00 GMT</pubDate></item><item><title>Engineering Probability Class 28 Mon 2018-04-30</title><link>https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class28/</link><dc:creator>W Randolph Franklin (WRF), RPI</dc:creator><description>&lt;div&gt;&lt;div class="contents topic" id="table-of-contents"&gt;
&lt;p class="topic-title first"&gt;Table of contents&lt;/p&gt;
&lt;ul class="auto-toc simple"&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class28/#grades" id="id1"&gt;1   Grades&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class28/#material-from-text" id="id2"&gt;2   Material from text&lt;/a&gt;&lt;ul class="auto-toc"&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class28/#hypothesis-testing" id="id3"&gt;2.1   Hypothesis testing&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class28/#iclicker-questions" id="id4"&gt;3   Iclicker questions&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class28/#counterintuitive-things-in-statistics" id="id5"&gt;4   Counterintuitive things in statistics&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class28/#relevant-xkcd-comics" id="id6"&gt;5   Relevant Xkcd comics&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/div&gt;
&lt;!--  --&gt;
&lt;style&gt; .red {color:red} &lt;/style&gt;
&lt;style&gt; .blue {color:blue} &lt;/style&gt;&lt;div class="section" id="grades"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class28/#id1"&gt;1   Grades&lt;/a&gt;&lt;/h2&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;I think I've responded to all grade emails.   Please resend any that I overlooked.&lt;/li&gt;
&lt;li&gt;Any that hasn't been complained about is resumed to be correct.&lt;/li&gt;
&lt;li&gt;The conflict exam is Thurs May 10 at 3pm, in a room TBD.  It is open only to students with conflicts who wrote me.   If you're one of those students, but you don't plan to write it, then please tell me.  E.g., a smaller room might then suffice.&lt;/li&gt;
&lt;li&gt;We'll try to get updated guaranteed grades uploaded, so you can decide whether to write the final exam.&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="material-from-text"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class28/#id2"&gt;2   Material from text&lt;/a&gt;&lt;/h2&gt;
&lt;div class="section" id="hypothesis-testing"&gt;
&lt;h3&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class28/#id3"&gt;2.1   Hypothesis testing&lt;/a&gt;&lt;/h3&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;Say we want to test whether the average height of an RPI student (called the population) is 2m.&lt;/li&gt;
&lt;li&gt;We assume that the distribution is Gaussian (normal) and that the standard deviation of heights is, say, 0.2m.&lt;/li&gt;
&lt;li&gt;However we don't know the mean.&lt;/li&gt;
&lt;li&gt;We do an experiment and measure the heights of n=100 random students.   Their mean height is, say, 1.9m.&lt;/li&gt;
&lt;li&gt;The question on the table is, is the population mean 2m?&lt;/li&gt;
&lt;li&gt;This is different from the earlier question that we analyzed, which was this: What is the most likely population mean?  (Answer: 1.9m.)&lt;/li&gt;
&lt;li&gt;Now we have a hypothesis (that the population mean is 2m) that we're testing.&lt;/li&gt;
&lt;li&gt;The standard way that this is handled is as follows.&lt;/li&gt;
&lt;li&gt;Define a null hypothesis, called H0, that the population mean is 2m.&lt;/li&gt;
&lt;li&gt;Define an alternate hypothesis, called HA, that the population mean is not 2m.&lt;/li&gt;
&lt;li&gt;Note that we observed our sample mean to be $0.5 \sigma$ below the population mean, if H0 is true.&lt;/li&gt;
&lt;li&gt;Each time we rerun the experiment (measure 100 students) we'll observe a different number.&lt;/li&gt;
&lt;li&gt;We compute the probability that, if H0 is true, our sample mean would be this far from 2m.&lt;/li&gt;
&lt;li&gt;Depending on what our underlying model of students is, we might use a 1-tail or a 2-tail probability.&lt;/li&gt;
&lt;li&gt;Perhaps we think that the population mean might be less than 2m but it's not going to be more.  Then a 1-tail distribution makes sense.&lt;/li&gt;
&lt;li&gt;That is, our assumptions affect the results.&lt;/li&gt;
&lt;li&gt;The probability is Q(5), which is very small.&lt;/li&gt;
&lt;li&gt;Therefore we reject H0 and accept HA.&lt;/li&gt;
&lt;li&gt;We make a type-1 error if we reject H0 and it was really true.  See &lt;a class="reference external" href="http://en.wikipedia.org/wiki/Type_I_and_type_II_errors"&gt;http://en.wikipedia.org/wiki/Type_I_and_type_II_errors&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;We make a type-2 error if we accept H0 and it was really false.&lt;/li&gt;
&lt;li&gt;These two errors trade off: by reducing the probability of one we increase the probability of the other, for a given sample size.&lt;/li&gt;
&lt;li&gt;E.g. in a criminal trial we prefer that a guilty person go free to having an innocent person convicted.&lt;/li&gt;
&lt;li&gt;Rejecting H0 says nothing about what the population mean really is, just that it's not likely 2m.&lt;/li&gt;
&lt;li&gt;Enrichment:  Random sampling is hard.  The US government got it wrong here: &lt;a class="reference external" href="http://politics.slashdot.org/story/11/05/13/2249256/Algorithm-Glitch-Voids-Outcome-of-US-Green-Card-Lottery"&gt;http://politics.slashdot.org/story/11/05/13/2249256/Algorithm-Glitch-Voids-Outcome-of-US-Green-Card-Lottery&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Example 8.1 page 412.&lt;/li&gt;
&lt;li&gt;Example 8.21 page 442.&lt;/li&gt;
&lt;li&gt;Example 8.23.&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;div class="section" id="iclicker-questions"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class28/#id4"&gt;3   Iclicker questions&lt;/a&gt;&lt;/h2&gt;
&lt;ol class="arabic"&gt;
&lt;li&gt;&lt;p class="first"&gt;Suppose that RPI students' heights have mean 1.8m and standard deviation 0.2m.  (These are fictitious numbers.)&lt;/p&gt;
&lt;p&gt;You measure a sample of 16 students, and compute the sample mean $m$.&lt;/p&gt;
&lt;p&gt;What is E[m]?&lt;/p&gt;
&lt;ol class="loweralpha simple"&gt;
&lt;li&gt;10&lt;/li&gt;
&lt;li&gt;.2&lt;/li&gt;
&lt;li&gt;.05&lt;/li&gt;
&lt;li&gt;9.8&lt;/li&gt;
&lt;li&gt;2.5&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;What is STD[m]?&lt;/p&gt;
&lt;ol class="loweralpha simple"&gt;
&lt;li&gt;10&lt;/li&gt;
&lt;li&gt;.2&lt;/li&gt;
&lt;li&gt;.05&lt;/li&gt;
&lt;li&gt;9.8&lt;/li&gt;
&lt;li&gt;2.5&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="counterintuitive-things-in-statistics"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class28/#id5"&gt;4   Counterintuitive things in statistics&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Statistics has some surprising examples, which would appear to be impossible.    Here are some.&lt;/p&gt;
&lt;ol class="arabic"&gt;
&lt;li&gt;&lt;p class="first"&gt;Average income can increase faster in a whole country than in any part of the country.&lt;/p&gt;
&lt;ol class="loweralpha simple"&gt;
&lt;li&gt;Consider a country with two parts: east and west.&lt;/li&gt;
&lt;li&gt;Each part has 100 people.&lt;/li&gt;
&lt;li&gt;Each person in the west makes \$100 per year; each person in the east \$200.&lt;/li&gt;
&lt;li&gt;The total income in the west is \$10K, in the east \$20K,  and in the whole country \$30K.&lt;/li&gt;
&lt;li&gt;The average income in the west is \$100, in the east \$200, and in the whole country \$150.&lt;/li&gt;
&lt;li&gt;Assume that next year nothing changes except that one westerner moves east and gets an average eastern job, so he now makes \$200 instead of \$100.&lt;/li&gt;
&lt;li&gt;The west now has 99 people @ \$100; its average income didn't change.&lt;/li&gt;
&lt;li&gt;The east now has 101 people @ \$200; its average income didn't change.&lt;/li&gt;
&lt;li&gt;The whole country's income is \$30100 for an average of \$150.50; that went up.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;College acceptance rate surprise.&lt;/p&gt;
&lt;ol class="loweralpha"&gt;
&lt;li&gt;&lt;p class="first"&gt;Imagine that we have two groups of people: Albanians and Bostonians.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;They're applying to two programs at the university: Engineering and Humanities.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Here are the numbers.    The fractions are accepted/applied.&lt;/p&gt;
&lt;table border="1" class="docutils"&gt;
&lt;colgroup&gt;
&lt;col width="40%"&gt;
&lt;col width="20%"&gt;
&lt;col width="20%"&gt;
&lt;col width="20%"&gt;
&lt;/colgroup&gt;
&lt;thead valign="bottom"&gt;
&lt;tr&gt;&lt;th class="head"&gt;city-major&lt;/th&gt;
&lt;th class="head"&gt;Engin&lt;/th&gt;
&lt;th class="head"&gt;Human&lt;/th&gt;
&lt;th class="head"&gt;Total&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody valign="top"&gt;
&lt;tr&gt;&lt;td&gt;Albanians&lt;/td&gt;
&lt;td&gt;11/15&lt;/td&gt;
&lt;td&gt;2/5&lt;/td&gt;
&lt;td&gt;13/20&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;Bostonians&lt;/td&gt;
&lt;td&gt;4/5&lt;/td&gt;
&lt;td&gt;7/15&lt;/td&gt;
&lt;td&gt;11/20&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;Total&lt;/td&gt;
&lt;td&gt;15/20&lt;/td&gt;
&lt;td&gt;9/20&lt;/td&gt;
&lt;td&gt;24/40&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;E.g, 15 Albanians applied to Engin; 11 were accepted.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Note that in Engineering, a &lt;em&gt;smaller&lt;/em&gt; fraction of Albanian applicants were accepted than Bostonian applicants.  &lt;em&gt;(corrected)&lt;/em&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Ditto in Humanities.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;However in all, a &lt;em&gt;larger&lt;/em&gt; fraction of Albanian applicants were accepted than Bostonian applicants.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;I could go on.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="relevant-xkcd-comics"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class28/#id6"&gt;5   Relevant Xkcd comics&lt;/a&gt;&lt;/h2&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;&lt;a class="reference external" href="https://xkcd.com/1985/"&gt;Meteorologist&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference external" href="https://xkcd.com/882/"&gt;Significant&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference external" href="https://xkcd.com/1478/"&gt;P-Values&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference external" href="https://xkcd.com/552/"&gt;Correlation&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference external" href="https://xkcd.com/1725/"&gt;Linear Regression&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference external" href="https://xkcd.com/925/"&gt;Cell Phones&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference external" href="https://xkcd.com/1132/"&gt;Frequentists vs. Bayesians&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference external" href="https://xkcd.com/1236/"&gt;Seashell&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference external" href="https://xkcd.com/795/"&gt;Conditional Risk&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference external" href="https://xkcd.com/892/"&gt;Null Hypothesis&lt;/a&gt;&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;&lt;/div&gt;</description><category>mathjax</category><guid>https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class28/</guid><pubDate>Sun, 29 Apr 2018 04:00:00 GMT</pubDate></item><item><title>Engineering Probability Class 27 Thurs 2018-04-26</title><link>https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class27/</link><dc:creator>W Randolph Franklin (WRF), RPI</dc:creator><description>&lt;div&gt;&lt;div class="contents topic" id="table-of-contents"&gt;
&lt;p class="topic-title first"&gt;Table of contents&lt;/p&gt;
&lt;ul class="auto-toc simple"&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class27/#iclicker-questions" id="id1"&gt;1   Iclicker questions&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class27/#material-from-text" id="id2"&gt;2   Material from text&lt;/a&gt;&lt;ul class="auto-toc"&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class27/#central-limit-theorem-etc" id="id3"&gt;2.1   Central limit theorem etc&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class27/#chapter-7-p-359-sums-of-random-variables" id="id4"&gt;2.2   Chapter 7, p 359, Sums of Random Variables&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class27/#sums-of-random-variables-ctd" id="id5"&gt;2.3   Sums of random variables ctd&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class27/#chapter-8-statistics" id="id6"&gt;2.4   Chapter 8, Statistics&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/div&gt;
&lt;!--  --&gt;
&lt;style&gt; .red {color:red} &lt;/style&gt;
&lt;style&gt; .blue {color:blue} &lt;/style&gt;&lt;div class="section" id="iclicker-questions"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class27/#id1"&gt;1   Iclicker questions&lt;/a&gt;&lt;/h2&gt;
&lt;ol class="arabic"&gt;
&lt;li&gt;&lt;p class="first"&gt;Experiment: toss two fair coins, one after the other.   Observe two random variables:&lt;/p&gt;
&lt;ol class="loweralpha simple"&gt;
&lt;li&gt;X is the number of heads.&lt;/li&gt;
&lt;li&gt;Y is the when the first head occurred, with 0 meaning both coins were tails.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;What is P[X=1]?&lt;/p&gt;
&lt;ol class="loweralpha simple"&gt;
&lt;li&gt;0&lt;/li&gt;
&lt;li&gt;1/4&lt;/li&gt;
&lt;li&gt;1/2&lt;/li&gt;
&lt;li&gt;3/4&lt;/li&gt;
&lt;li&gt;1&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;What is P[Y=1]?&lt;/p&gt;
&lt;ol class="loweralpha simple"&gt;
&lt;li&gt;0&lt;/li&gt;
&lt;li&gt;1/4&lt;/li&gt;
&lt;li&gt;1/2&lt;/li&gt;
&lt;li&gt;3/4&lt;/li&gt;
&lt;li&gt;1&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;What is P[Y=1 &amp;amp; X=1]?&lt;/p&gt;
&lt;ol class="loweralpha simple"&gt;
&lt;li&gt;0&lt;/li&gt;
&lt;li&gt;1/4&lt;/li&gt;
&lt;li&gt;1/2&lt;/li&gt;
&lt;li&gt;3/4&lt;/li&gt;
&lt;li&gt;1&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;What is P[Y=1|X=1]?&lt;/p&gt;
&lt;ol class="loweralpha simple"&gt;
&lt;li&gt;0&lt;/li&gt;
&lt;li&gt;1/4&lt;/li&gt;
&lt;li&gt;1/2&lt;/li&gt;
&lt;li&gt;3/4&lt;/li&gt;
&lt;li&gt;1&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;What is P[X=1|Y=1]?&lt;/p&gt;
&lt;ol class="loweralpha simple"&gt;
&lt;li&gt;0&lt;/li&gt;
&lt;li&gt;1/4&lt;/li&gt;
&lt;li&gt;1/2&lt;/li&gt;
&lt;li&gt;3/4&lt;/li&gt;
&lt;li&gt;1&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;What's the MAP estimator for X given Y=2?&lt;/p&gt;
&lt;ol class="loweralpha simple"&gt;
&lt;li&gt;0&lt;/li&gt;
&lt;li&gt;1&lt;/li&gt;
&lt;li&gt;&lt;ol class="first arabic" start="2"&gt;
&lt;li&gt;
&lt;/li&gt;&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;ol class="first arabic" start="3"&gt;
&lt;li&gt;
&lt;/li&gt;&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;ol class="first arabic" start="4"&gt;
&lt;li&gt;
&lt;/li&gt;&lt;/ol&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="material-from-text"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class27/#id2"&gt;2   Material from text&lt;/a&gt;&lt;/h2&gt;
&lt;div class="section" id="central-limit-theorem-etc"&gt;
&lt;h3&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class27/#id3"&gt;2.1   Central limit theorem etc&lt;/a&gt;&lt;/h3&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;Review: Almost no matter what distribution the random variable X is,  $F_{M_n}$ quickly becomes Gaussian as n increases.  n=5 already gives a good approximation.&lt;/li&gt;
&lt;li&gt;nice applets:&lt;ol class="loweralpha"&gt;
&lt;li&gt;&lt;a class="reference external" href="http://onlinestatbook.com/stat_sim/normal_approx/index.html"&gt;http://onlinestatbook.com/stat_sim/normal_approx/index.html&lt;/a&gt;  This tests how good is the normal approximation to the binomial distribution.&lt;/li&gt;
&lt;li&gt;&lt;a class="reference external" href="http://onlinestatbook.com/stat_sim/sampling_dist/index.html"&gt;http://onlinestatbook.com/stat_sim/sampling_dist/index.html&lt;/a&gt;  This lets you define a distribution, and take repeated samples of a given size.   It shows how the means of the samples are distributed.  For sample with more than a few observations, they look fairly normal.&lt;/li&gt;
&lt;li&gt;&lt;a class="reference external" href="http://www.umd.umich.edu/casl/socsci/econ/StudyAids/JavaStat/CentralLimitTheorem.html"&gt;http://www.umd.umich.edu/casl/socsci/econ/StudyAids/JavaStat/CentralLimitTheorem.html&lt;/a&gt;  This might also be interesting.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;Sample problems.&lt;ol class="loweralpha"&gt;
&lt;li&gt;Problem 7.1 on page 402.&lt;/li&gt;
&lt;li&gt;Problem 7.22.&lt;/li&gt;
&lt;li&gt;Problem 7.25.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="chapter-7-p-359-sums-of-random-variables"&gt;
&lt;h3&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class27/#id4"&gt;2.2   Chapter 7, p 359, Sums of Random Variables&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;The long term goal of this section is to summarize information from a large
group of random variables.  E.g., the mean is one way.  We will start with
that, and go farther.&lt;/p&gt;
&lt;p&gt;The next step is to infer the true mean of a large set of variables from a
small &lt;strong&gt;sample&lt;/strong&gt;.&lt;/p&gt;
&lt;/div&gt;
&lt;div class="section" id="sums-of-random-variables-ctd"&gt;
&lt;h3&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class27/#id5"&gt;2.3   Sums of random variables ctd&lt;/a&gt;&lt;/h3&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;Let Z=X+Y.&lt;/li&gt;
&lt;li&gt;$f_Z$ is convolution of  $f_X$ and  $f_Y$:  $$f_Z(z) = (f_X * f_Y)(z)$$ $$f_Z(z) = \int f_X(x) f_Y(z-x) dx$$&lt;/li&gt;
&lt;li&gt;Characteristic functions are useful.  $$\Phi_X(\omega) = E[e^{j\omega X} ]$$&lt;/li&gt;
&lt;li&gt;$\Phi_Z = \Phi_X \Phi_Y$.&lt;/li&gt;
&lt;li&gt;This extends to the sum of n random variables:  if $Z=\sum_i X_i$ then $\Phi_Z (\omega) = \Pi_i \Phi_{X_i} (\omega)$&lt;/li&gt;
&lt;li&gt;E.g. Exponential with  $\lambda=1$:   $\Phi_1(\omega) = 1/(1-j\omega)$  (page 164).&lt;/li&gt;
&lt;li&gt;Sum of m exponentials has  $\Phi(\omega)= 1/{(1-j\omega)}^m$.  That's called an m-Erlang.&lt;/li&gt;
&lt;li&gt;Example 2: sum of n iid Bernoullis.  Probability generating function is more useful for discrete random variables.&lt;/li&gt;
&lt;li&gt;Example 3: sum of n iid Gaussians.  $$\Phi_{X_1} = e^{j\mu\omega - \frac{1}{2} \sigma^2 \omega^2}$$   $$\Phi_{Z} = e^{jn\mu\omega - \frac{1}{2}n \sigma^2 \omega^2}$$  I.e., mean and variance sum.&lt;/li&gt;
&lt;li&gt;As the number increases, no matter what distribution the initial random variance is (provided that its moments are finite), for the sum   $\Phi$ starts looking like a Gaussian.&lt;/li&gt;
&lt;li&gt;The mean  $M_n$ of n random variables is itself a random variable.&lt;/li&gt;
&lt;li&gt;As $n\rightarrow\infty$  $M_n \rightarrow \mu$.&lt;/li&gt;
&lt;li&gt;That's a &lt;strong&gt;law of large numbers&lt;/strong&gt; (LLN).&lt;/li&gt;
&lt;li&gt;$E[ M_n ] = \mu$.  It's an &lt;strong&gt;unbiased estimator&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;$VAR[ M_n ] = n \sigma ^2$&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Weak law of large numbers&lt;/strong&gt; $$\forall \epsilon &amp;gt;0 \lim_{n\rightarrow\infty} P[ |M_n-\mu| &amp;lt; \epsilon] = 1$$&lt;/li&gt;
&lt;li&gt;How fast does it happen?   We can use Chebyshev, though that is very conservative.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Strong law of large numbers&lt;/strong&gt; $$P [ \lim _ {n\rightarrow\infty} M_n = \mu ] =1$$&lt;/li&gt;
&lt;li&gt;As $n\rightarrow\infty$, $F_{M_n}$ becomes Gaussian.  That's the &lt;strong&gt;Central Limit Theorem&lt;/strong&gt; (CLT).&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="chapter-8-statistics"&gt;
&lt;h3&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class27/#id6"&gt;2.4   Chapter 8, Statistics&lt;/a&gt;&lt;/h3&gt;
&lt;ol class="arabic"&gt;
&lt;li&gt;&lt;p class="first"&gt;We have a population.  &lt;strong&gt;(E.g., voters in next election, who will vote Democrat or Republican).&lt;/strong&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;We don't know the population mean.  &lt;strong&gt;(E.g., fraction of voters who will vote Democrat).&lt;/strong&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;We take several samples (observations).  From them we want to estimate the population mean and standard deviation.  &lt;strong&gt;(Ask 1000 potential voters; 520 say they will vote Democrat.  Sample mean is .52)&lt;/strong&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;We want error bounds on our estimates.   &lt;strong&gt;(.52 plus or minus .04, 95 times out of 100)&lt;/strong&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Another application: testing whether 2 populations have the same mean.  &lt;strong&gt;(Is this batch of Guiness as good as the last one?)&lt;/strong&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Observations cost money, so we want to do as few as possible.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;This gets beyond this course, but the biggest problems may be non-math ones.  E.g., how do you pick a random likely voter?  In the past phone books were used.  In a famous 1936 Presidential poll, that biased against poor people, who voted for Roosevelt.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;In &lt;strong&gt;probability&lt;/strong&gt;, we know the parameters (e.g., mean and standard deviation) of a distribution and use them to compute the probability of some event.&lt;/p&gt;
&lt;p&gt;E.g., if we toss a fair coin 4 times what's the probability of exactly 4 heads?  Answer: 1/16.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;In &lt;strong&gt;statistics&lt;/strong&gt; we do not know all the parameters, though we usually know that type the distribution is, e.g., normal.  (We often know the standard deviation.)&lt;/p&gt;
&lt;ol class="loweralpha simple"&gt;
&lt;li&gt;We make observations about some members of the distribution, i.e., draw some samples.&lt;/li&gt;
&lt;li&gt;From them we &lt;strong&gt;estimate&lt;/strong&gt; the unknown parameters.&lt;/li&gt;
&lt;li&gt;We often also compute a confidence interval on that estimate.&lt;/li&gt;
&lt;li&gt;E.g., we toss an unknown coin 100 times and see 60 heads.  A good estimate for the probability of that coin coming up heads is 0.6.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Some estimators are better than others, though that gets beyond this course.&lt;/p&gt;
&lt;ol class="loweralpha simple"&gt;
&lt;li&gt;Suppose I want to estimate the average height of an RPI student by measuring the heights of N random students.&lt;/li&gt;
&lt;li&gt;The mean of the highest and lowest heights of my N students would converge to the population mean as N increased.&lt;/li&gt;
&lt;li&gt;However the median of my sample would converge faster.  Technically, the variance of the sample median is smaller than the variance of the sample hi-lo mean.&lt;/li&gt;
&lt;li&gt;The mean of my whole sample would converge the fastest.  Technically, the variance of the sample mean is smaller than the variance of any other estimator of the population mean.  That's why we use it.&lt;/li&gt;
&lt;li&gt;However perhaps the population's distribution is not normal.  Then one of the other estimators might be better.  It would be more &lt;strong&gt;robust&lt;/strong&gt;.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;(Enrichment) How to tell if the population is normal?  We can do various plots of the observations and look.  We can compute the probability that the observations would be this uneven if the population were normal.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;An estimator may be &lt;strong&gt;biased&lt;/strong&gt;.  We have an distribution that is U[0,b] for unknown b.  We take a sample.  The max of the sample has a mean n/(n+1)b though it converges to b as n increases.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 8.2, page 413: One-tailed probability.  This is the probability that the mean of our sample is at least so far above the population mean.  $$\alpha = P[\overline{X_n}-\mu &amp;gt; c] = Q\left( \frac{c}{\sigma_x / \sqrt{n} } \right)$$  Q is defined on page 169:  $$Q(x) = \int_x^ { \infty} \frac{1}{\sqrt{2\pi} } e^{-\frac{x^2}{2} } dx$$&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Application: You sample n=100 students' verbal SAT scores, and see $ \overline{X} = 550$.  You know that $\sigma=100$.  If $\mu = 525$, what is the probability that $\overline{X_n} &amp;gt; 550$ ?&lt;/p&gt;
&lt;p&gt;Answer: Q(2.5) = 0.006&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;This means that if we take 1000 random sample of students, each with 100 students, and measure each sample's mean, then, on average, 6 of those 1000 samples will have a mean over 550.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;This is often worded as the probability of the population's mean being under 525 is 0.006, which is different.  The problem with saying that is that presumes some probability distribution for the population mean.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;The formula also works for the other tail, computing the probability that our sample mean is at least so far &lt;strong&gt;below&lt;/strong&gt; the population mean.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;The &lt;strong&gt;2-tail probability&lt;/strong&gt; is the probability that our sample mean is at least this far away from the sample mean in either direction.   It is twice the 1-tail probability.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;All this also works when you know the probability and want to know c, the cutoff.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;/div&gt;&lt;/div&gt;</description><category>mathjax</category><guid>https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class27/</guid><pubDate>Thu, 26 Apr 2018 04:00:00 GMT</pubDate></item><item><title>Engineering Probability Class 26 Mon 2018-04-23</title><link>https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/</link><dc:creator>W Randolph Franklin (WRF), RPI</dc:creator><description>&lt;div&gt;&lt;div class="contents topic" id="table-of-contents"&gt;
&lt;p class="topic-title first"&gt;Table of contents&lt;/p&gt;
&lt;ul class="auto-toc simple"&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#grades" id="id1"&gt;1   Grades&lt;/a&gt;&lt;ul class="auto-toc"&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#computation" id="id2"&gt;1.1   Computation&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#notes" id="id3"&gt;1.2   Notes&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#lms" id="id4"&gt;1.3   LMS&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#iclicker-questions" id="id5"&gt;2   Iclicker questions&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#material-from-text" id="id6"&gt;3   Material from text&lt;/a&gt;&lt;ul class="auto-toc"&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#section-6-5-page-332-estimation-of-random-variables" id="id7"&gt;3.1   Section 6.5, page 332: Estimation of random variables&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#central-limit-theorem-etc" id="id8"&gt;3.2   Central limit theorem etc&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#chapter-7-p-359-sums-of-random-variables" id="id9"&gt;3.3   Chapter 7, p 359, Sums of Random Variables&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#sums-of-random-variables-ctd" id="id10"&gt;3.4   Sums of random variables ctd&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#chapter-8-statistics" id="id11"&gt;3.5   Chapter 8, Statistics&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#hypothesis-testing" id="id12"&gt;3.6   Hypothesis testing&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/div&gt;
&lt;!--  --&gt;
&lt;style&gt; .red {color:red} &lt;/style&gt;
&lt;style&gt; .blue {color:blue} &lt;/style&gt;&lt;div class="section" id="grades"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#id1"&gt;1   Grades&lt;/a&gt;&lt;/h2&gt;
&lt;div class="section" id="computation"&gt;
&lt;h3&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#id2"&gt;1.1   Computation&lt;/a&gt;&lt;/h3&gt;
&lt;ol class="arabic"&gt;
&lt;li&gt;&lt;p class="first"&gt;This will accumulate the &lt;strong&gt;total&lt;/strong&gt; score.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Normalize each homework to 100 points.&lt;/p&gt;
&lt;p&gt;Homeworks that have not yet been graded (that's 9 and up) count for 0.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Sum top 10, multiply result by 0.02, and add into total.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Normalize each exam to 30 points.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Add top 2 into total.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Take the number of sessions in which at least one question was answered.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Divide by the total number of sessions minus 2, to help students who missed up to 2 classes.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Normalize that to 10 points and add into total.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Piazza:&lt;/p&gt;
&lt;ol class="loweralpha simple"&gt;
&lt;li&gt;Divide the semester into 3 parts: up to first test, from then to last class, and after.&lt;/li&gt;
&lt;li&gt;Require two contributions for first part, three for second, and one for last.&lt;/li&gt;
&lt;li&gt;Add up the number of contributions (max: 6), normalize to 10 points, add add to total.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Add the number of knowitall points to total.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Convert total to a letter grade per the syllabus.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Upload total and letter grades to LMS.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="notes"&gt;
&lt;h3&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#id3"&gt;1.2   Notes&lt;/a&gt;&lt;/h3&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;This is guaranteed; your grade cannot be lower (absent detected cheating).&lt;/li&gt;
&lt;li&gt;You can compute how latest homeworks would raise it.&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="lms"&gt;
&lt;h3&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#id4"&gt;1.3   LMS&lt;/a&gt;&lt;/h3&gt;
&lt;ol class="arabic"&gt;
&lt;li&gt;&lt;p class="first"&gt;I uploaded 5 columns to LMS.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;There are updated iclicker, piazza, and knowitall numbers.&lt;/p&gt;
&lt;p&gt;They should include all updates.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Your total numerical grade is in &lt;strong&gt;Total-423&lt;/strong&gt;.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Your letter grade is in &lt;strong&gt;Grade-423&lt;/strong&gt;.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Ignore other columns with names like total.  They are wrong.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;div class="section" id="iclicker-questions"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#id5"&gt;2   Iclicker questions&lt;/a&gt;&lt;/h2&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;X and Y are two uniform r.v. on the interval [0,1].  X and Y are independent.  Z=X+Y.  What is E[Z]?&lt;ol class="loweralpha"&gt;
&lt;li&gt;0&lt;/li&gt;
&lt;li&gt;1/2&lt;/li&gt;
&lt;li&gt;2/3&lt;/li&gt;
&lt;li&gt;&lt;ol class="first arabic"&gt;
&lt;li&gt;
&lt;/li&gt;&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;ol class="first arabic" start="2"&gt;
&lt;li&gt;
&lt;/li&gt;&lt;/ol&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;Now let W=max(X,Y).    What is E[W]?&lt;ol class="loweralpha"&gt;
&lt;li&gt;0&lt;/li&gt;
&lt;li&gt;1/2&lt;/li&gt;
&lt;li&gt;2/3&lt;/li&gt;
&lt;li&gt;&lt;ol class="first arabic"&gt;
&lt;li&gt;
&lt;/li&gt;&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;ol class="first arabic" start="2"&gt;
&lt;li&gt;
&lt;/li&gt;&lt;/ol&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="material-from-text"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#id6"&gt;3   Material from text&lt;/a&gt;&lt;/h2&gt;
&lt;div class="section" id="section-6-5-page-332-estimation-of-random-variables"&gt;
&lt;h3&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#id7"&gt;3.1   Section 6.5, page 332: Estimation of random variables&lt;/a&gt;&lt;/h3&gt;
&lt;ol class="arabic"&gt;
&lt;li&gt;&lt;p class="first"&gt;Assume that we want to know X but can only see Y, which depends on X.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;This is a generalization of our long-running noisy communication channel example.  We'll do things a little more precisely now.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Another application would be to estimate tomorrow's price of GOOG (X) given the prices to date (Y).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Sometimes, but not always, we have a prior probability for X.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;For the communication channel we do, for GOOG, we don't.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;If we do, it's a ''maximum a posteriori estimator''.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;If we don't, it's a ''maximum likelihood estimator''.  We effectively assume that that prior probability of X is uniform, even though that may not completely make sense.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;You toss a fair coin 3 times.  X is the number of heads, from 0 to 3.  Y is the position of the 1st head. from 0 to 3.  If there are no heads, we'll say that the first head's position is 0.&lt;/p&gt;
&lt;table border="1" class="docutils"&gt;
&lt;colgroup&gt;
&lt;col width="46%"&gt;
&lt;col width="54%"&gt;
&lt;/colgroup&gt;
&lt;thead valign="bottom"&gt;
&lt;tr&gt;&lt;th class="head"&gt;(X,Y)&lt;/th&gt;
&lt;th class="head"&gt;p(X,Y)&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody valign="top"&gt;
&lt;tr&gt;&lt;td&gt;(0,0)&lt;/td&gt;
&lt;td&gt;1/8&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;(1,1)&lt;/td&gt;
&lt;td&gt;1/8&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;(1,2)&lt;/td&gt;
&lt;td&gt;1/8&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;(1,3)&lt;/td&gt;
&lt;td&gt;1/8&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;(2,1)&lt;/td&gt;
&lt;td&gt;2/8&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;(2,2)&lt;/td&gt;
&lt;td&gt;1/8&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;(3,1)&lt;/td&gt;
&lt;td&gt;1/8&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;E.g., 1 head can occur 3 ways (out of 8): HTT, THT, TTH.  The 1st (and only) head occurs in position 1, one of those ways.  p=1/8.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Conditional probabilities:&lt;/p&gt;
&lt;table border="1" class="docutils"&gt;
&lt;colgroup&gt;
&lt;col width="48%"&gt;
&lt;col width="10%"&gt;
&lt;col width="10%"&gt;
&lt;col width="23%"&gt;
&lt;col width="10%"&gt;
&lt;/colgroup&gt;
&lt;thead valign="bottom"&gt;
&lt;tr&gt;&lt;th class="head"&gt;p(x|y)&lt;/th&gt;
&lt;th class="head"&gt;y=0&lt;/th&gt;
&lt;th class="head"&gt;y=1&lt;/th&gt;
&lt;th class="head"&gt;y=2&lt;/th&gt;
&lt;th class="head"&gt;y=3&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody valign="top"&gt;
&lt;tr&gt;&lt;td&gt;x=0&lt;/td&gt;
&lt;td&gt;1&lt;/td&gt;
&lt;td&gt;0&lt;/td&gt;
&lt;td&gt;0&lt;/td&gt;
&lt;td&gt;0&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;x=1&lt;/td&gt;
&lt;td&gt;0&lt;/td&gt;
&lt;td&gt;1/4&lt;/td&gt;
&lt;td&gt;1/2&lt;/td&gt;
&lt;td&gt;1&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;x=2&lt;/td&gt;
&lt;td&gt;0&lt;/td&gt;
&lt;td&gt;1/2&lt;/td&gt;
&lt;td&gt;1/2&lt;/td&gt;
&lt;td&gt;0&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;x=3&lt;/td&gt;
&lt;td&gt;0&lt;/td&gt;
&lt;td&gt;1/4&lt;/td&gt;
&lt;td&gt;0&lt;/td&gt;
&lt;td&gt;0&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt; &lt;/td&gt;
&lt;td&gt; &lt;/td&gt;
&lt;td&gt; &lt;/td&gt;
&lt;td&gt; &lt;/td&gt;
&lt;td&gt; &lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;$g_{MAP}(y)$&lt;/td&gt;
&lt;td&gt;0&lt;/td&gt;
&lt;td&gt;2&lt;/td&gt;
&lt;td&gt;1 or 2&lt;/td&gt;
&lt;td&gt;1&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;$P_{error}(y)$&lt;/td&gt;
&lt;td&gt;0&lt;/td&gt;
&lt;td&gt;1/2&lt;/td&gt;
&lt;td&gt;1/2&lt;/td&gt;
&lt;td&gt;0&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;&lt;td&gt;p(y)&lt;/td&gt;
&lt;td&gt;1/8&lt;/td&gt;
&lt;td&gt;1/2&lt;/td&gt;
&lt;td&gt;1/4&lt;/td&gt;
&lt;td&gt;1/8&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;The total probability of error is 3/8.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;We observe Y and want to guess X from Y.  E.g., If we observe  $$\small y= \begin{pmatrix}0\\1\\2\\3\end{pmatrix} \text{then } x= \begin{pmatrix}0\\ 2 \text{ most likely} \\ 1, 2 \text{ equally likely} \\ 1 \end{pmatrix}$$&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;There are different formulae.  The above one was the MAP, maximum a posteriori probability.&lt;/p&gt;
&lt;p&gt;$$g_{\text{MAP}} (y) = \max_x p_x(x|y) \text{ or } f_x(x|y)$$&lt;/p&gt;
&lt;p&gt;That means, the value of $x$ that maximizes $p_x(x|y)$&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;What if we don't know p(x|y)?  If we know p(y|x), we can use Bayes.  We might measure p(y|x) experimentally, e.g., by sending many messages over the channel.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Bayes requires p(x).  What if we don't know even that?  E.g. we don't know the probability of the different possible transmitted messages.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Then use maximum likelihood estimator, ML. $$g_{\text{ML}} (y) = \max_x p_y(y|x) \text{ or } f_y(y|x)$$&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;There are other estimators for different applications.  E.g., regression using least squares might attempt to predict a graduate's QPA from his/her entering SAT scores.  At Saratoga in August we might attempt to predict a horse's chance of winning a race from its speed in previous races.   Some years ago, an Engineering Assoc Dean would do that each summer.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Historically, IMO, some of the techniques, like least squares and logistic regression, have been used more because they're computationally easy than because they're logically justified.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="central-limit-theorem-etc"&gt;
&lt;h3&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#id8"&gt;3.2   Central limit theorem etc&lt;/a&gt;&lt;/h3&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;Review: Almost no matter what distribution the random variable X is,  $F_{M_n}$ quickly becomes Gaussian as n increases.  n=5 already gives a good approximation.&lt;/li&gt;
&lt;li&gt;nice applets:&lt;ol class="loweralpha"&gt;
&lt;li&gt;&lt;a class="reference external" href="http://onlinestatbook.com/stat_sim/normal_approx/index.html"&gt;http://onlinestatbook.com/stat_sim/normal_approx/index.html&lt;/a&gt;  This tests how good is the normal approximation to the binomial distribution.&lt;/li&gt;
&lt;li&gt;&lt;a class="reference external" href="http://onlinestatbook.com/stat_sim/sampling_dist/index.html"&gt;http://onlinestatbook.com/stat_sim/sampling_dist/index.html&lt;/a&gt;  This lets you define a distribution, and take repeated samples of a given size.   It shows how the means of the samples are distributed.  For sample with more than a few observations, they look fairly normal.&lt;/li&gt;
&lt;li&gt;&lt;a class="reference external" href="http://www.umd.umich.edu/casl/socsci/econ/StudyAids/JavaStat/CentralLimitTheorem.html"&gt;http://www.umd.umich.edu/casl/socsci/econ/StudyAids/JavaStat/CentralLimitTheorem.html&lt;/a&gt;  This might also be interesting.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;Sample problems.&lt;ol class="loweralpha"&gt;
&lt;li&gt;Problem 7.1 on page 402.&lt;/li&gt;
&lt;li&gt;Problem 7.22.&lt;/li&gt;
&lt;li&gt;Problem 7.25.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="chapter-7-p-359-sums-of-random-variables"&gt;
&lt;h3&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#id9"&gt;3.3   Chapter 7, p 359, Sums of Random Variables&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;The long term goal of this section is to summarize information from a large
group of random variables.  E.g., the mean is one way.  We will start with
that, and go farther.&lt;/p&gt;
&lt;p&gt;The next step is to infer the true mean of a large set of variables from a
small &lt;strong&gt;sample&lt;/strong&gt;.&lt;/p&gt;
&lt;/div&gt;
&lt;div class="section" id="sums-of-random-variables-ctd"&gt;
&lt;h3&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#id10"&gt;3.4   Sums of random variables ctd&lt;/a&gt;&lt;/h3&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;Let Z=X+Y.&lt;/li&gt;
&lt;li&gt;$f_Z$ is convolution of  $f_X$ and  $f_Y$:  $$f_Z(z) = (f_X * f_Y)(z)$$ $$f_Z(z) = \int f_X(x) f_Y(z-x) dx$$&lt;/li&gt;
&lt;li&gt;Characteristic functions are useful.  $$\Phi_X(\omega) = E[e^{j\omega X} ]$$&lt;/li&gt;
&lt;li&gt;$\Phi_Z = \Phi_X \Phi_Y$.&lt;/li&gt;
&lt;li&gt;This extends to the sum of n random variables:  if $Z=\sum_i X_i$ then $\Phi_Z (\omega) = \Pi_i \Phi_{X_i} (\omega)$&lt;/li&gt;
&lt;li&gt;E.g. Exponential with  $\lambda=1$:   $\Phi_1(\omega) = 1/(1-j\omega)$  (page 164).&lt;/li&gt;
&lt;li&gt;Sum of m exponentials has  $\Phi(\omega)= 1/{(1-j\omega)}^m$.  That's called an m-Erlang.&lt;/li&gt;
&lt;li&gt;Example 2: sum of n iid Bernoullis.  Probability generating function is more useful for discrete random variables.&lt;/li&gt;
&lt;li&gt;Example 3: sum of n iid Gaussians.  $$\Phi_{X_1} = e^{j\mu\omega - \frac{1}{2} \sigma^2 \omega^2}$$   $$\Phi_{Z} = e^{jn\mu\omega - \frac{1}{2}n \sigma^2 \omega^2}$$  I.e., mean and variance sum.&lt;/li&gt;
&lt;li&gt;As the number increases, no matter what distribution the initial random variance is (provided that its moments are finite), for the sum   $\Phi$ starts looking like a Gaussian.&lt;/li&gt;
&lt;li&gt;The mean  $M_n$ of n random variables is itself a random variable.&lt;/li&gt;
&lt;li&gt;As $n\rightarrow\infty$  $M_n \rightarrow \mu$.&lt;/li&gt;
&lt;li&gt;That's a &lt;strong&gt;law of large numbers&lt;/strong&gt; (LLN).&lt;/li&gt;
&lt;li&gt;$E[ M_n ] = \mu$.  It's an &lt;strong&gt;unbiased estimator&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;$VAR[ M_n ] = n \sigma ^2$&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Weak law of large numbers&lt;/strong&gt; $$\forall \epsilon &amp;gt;0 \lim_{n\rightarrow\infty} P[ |M_n-\mu| &amp;lt; \epsilon] = 1$$&lt;/li&gt;
&lt;li&gt;How fast does it happen?   We can use Chebyshev, though that is very conservative.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Strong law of large numbers&lt;/strong&gt; $$P [ \lim _ {n\rightarrow\infty} M_n = \mu ] =1$$&lt;/li&gt;
&lt;li&gt;As $n\rightarrow\infty$, $F_{M_n}$ becomes Gaussian.  That's the &lt;strong&gt;Central Limit Theorem&lt;/strong&gt; (CLT).&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="chapter-8-statistics"&gt;
&lt;h3&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#id11"&gt;3.5   Chapter 8, Statistics&lt;/a&gt;&lt;/h3&gt;
&lt;ol class="arabic"&gt;
&lt;li&gt;&lt;p class="first"&gt;We have a population.  &lt;strong&gt;(E.g., voters in next election, who will vote Democrat or Republican).&lt;/strong&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;We don't know the population mean.  &lt;strong&gt;(E.g., fraction of voters who will vote Democrat).&lt;/strong&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;We take several samples (observations).  From them we want to estimate the population mean and standard deviation.  &lt;strong&gt;(Ask 1000 potential voters; 520 say they will vote Democrat.  Sample mean is .52)&lt;/strong&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;We want error bounds on our estimates.   &lt;strong&gt;(.52 plus or minus .04, 95 times out of 100)&lt;/strong&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Another application: testing whether 2 populations have the same mean.  &lt;strong&gt;(Is this batch of Guiness as good as the last one?)&lt;/strong&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Observations cost money, so we want to do as few as possible.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;This gets beyond this course, but the biggest problems may be non-math ones.  E.g., how do you pick a random likely voter?  In the past phone books were used.  In a famous 1936 Presidential poll, that biased against poor people, who voted for Roosevelt.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;In &lt;strong&gt;probability&lt;/strong&gt;, we know the parameters (e.g., mean and standard deviation) of a distribution and use them to compute the probability of some event.&lt;/p&gt;
&lt;p&gt;E.g., if we toss a fair coin 4 times what's the probability of exactly 4 heads?  Answer: 1/16.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;In &lt;strong&gt;statistics&lt;/strong&gt; we do not know all the parameters, though we usually know that type the distribution is, e.g., normal.  (We often know the standard deviation.)&lt;/p&gt;
&lt;ol class="loweralpha simple"&gt;
&lt;li&gt;We make observations about some members of the distribution, i.e., draw some samples.&lt;/li&gt;
&lt;li&gt;From them we &lt;strong&gt;estimate&lt;/strong&gt; the unknown parameters.&lt;/li&gt;
&lt;li&gt;We often also compute a confidence interval on that estimate.&lt;/li&gt;
&lt;li&gt;E.g., we toss an unknown coin 100 times and see 60 heads.  A good estimate for the probability of that coin coming up heads is 0.6.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Some estimators are better than others, though that gets beyond this course.&lt;/p&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;Suppose I want to estimate the average height of an RPI student by measuring the heights of N random students.&lt;/li&gt;
&lt;li&gt;The mean of the highest and lowest heights of my N students would converge to the population mean as N increased.&lt;/li&gt;
&lt;li&gt;However the median of my sample would converge faster.  Technically, the variance of the sample median is smaller than the variance of the sample hi-lo mean.&lt;/li&gt;
&lt;li&gt;The mean of my whole sample would converge the fastest.  Technically, the variance of the sample mean is smaller than the variance of any other estimator of the population mean.  That's why we use it.&lt;/li&gt;
&lt;li&gt;However perhaps the population's distribution is not normal.  Then one of the other estimators might be better.  It would be more &lt;strong&gt;robust&lt;/strong&gt;.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;(Enrichment) How to tell if the population is normal?  We can do various plots of the observations and look.  We can compute the probability that the observations would be this uneven if the population were normal.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;An estimator may be &lt;strong&gt;biased&lt;/strong&gt;.  We have an distribution that is U[0,b] for unknown b.  We take a sample.  The max of the sample has a mean n/(n+1)b though it converges to b as n increases.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 8.2, page 413: One-tailed probability.  This is the probability that the mean of our sample is at least so far above the population mean.  $$\alpha = P[\overline{X_n}-\mu &amp;gt; c] = Q\left( \frac{c}{\sigma_x / \sqrt{n} } \right)$$  Q is defined on page 169:  $$Q(x) = \int_x^ { \infty} \frac{1}{\sqrt{2\pi} } e^{-\frac{x^2}{2} } dx$$&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Application: You sample n=100 students' verbal SAT scores, and see $ \overline{X} = 550$.  You know that $\sigma=100$.  If $\mu = 525$, what is the probability that $\overline{X_n} &amp;gt; 550$ ?&lt;/p&gt;
&lt;p&gt;Answer: Q(2.5) = 0.006&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;This means that if we take 1000 random sample of students, each with 100 students, and measure each sample's mean, then, on average, 6 of those 1000 samples will have a mean over 550.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;This is often worded as the probability of the population's mean being under 525 is 0.006, which is different.  The problem with saying that is that presumes some probability distribution for the population mean.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;The formula also works for the other tail, computing the probability that our sample mean is at least so far &lt;strong&gt;below&lt;/strong&gt; the population mean.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;The &lt;strong&gt;2-tail probability&lt;/strong&gt; is the probability that our sample mean is at least this far away from the sample mean in either direction.   It is twice the 1-tail probability.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;All this also works when you know the probability and want to know c, the cutoff.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="hypothesis-testing"&gt;
&lt;h3&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/#id12"&gt;3.6   Hypothesis testing&lt;/a&gt;&lt;/h3&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;Say we want to test whether the average height of an RPI student (called the population) is 2m.&lt;/li&gt;
&lt;li&gt;We assume that the distribution is Gaussian (normal) and that the standard deviation of heights is, say, 0.2m.&lt;/li&gt;
&lt;li&gt;However we don't know the mean.&lt;/li&gt;
&lt;li&gt;We do an experiment and measure the heights of n=100 random students.   Their mean height is, say, 1.9m.&lt;/li&gt;
&lt;li&gt;The question on the table is, is the population mean 2m?&lt;/li&gt;
&lt;li&gt;This is different from the earlier question that we analyzed, which was this: What is the most likely population mean?  (Answer: 1.9m.)&lt;/li&gt;
&lt;li&gt;Now we have a hypothesis (that the population mean is 2m) that we're testing.&lt;/li&gt;
&lt;li&gt;The standard way that this is handled is as follows.&lt;/li&gt;
&lt;li&gt;Define a null hypothesis, called H0, that the population mean is 2m.&lt;/li&gt;
&lt;li&gt;Define an alternate hypothesis, called HA, that the population mean is not 2m.&lt;/li&gt;
&lt;li&gt;Note that we observed our sample mean to be $0.5 \sigma$ below the population mean, if H0 is true.&lt;/li&gt;
&lt;li&gt;Each time we rerun the experiment (measure 100 students) we'll observe a different number.&lt;/li&gt;
&lt;li&gt;We compute the probability that, if H0 is true, our sample mean would be this far from 2m.&lt;/li&gt;
&lt;li&gt;Depending on what our underlying model of students is, we might use a 1-tail or a 2-tail probability.&lt;/li&gt;
&lt;li&gt;Perhaps we think that the population mean might be less than 2m but it's not going to be more.  Then a 1-tail distribution makes sense.&lt;/li&gt;
&lt;li&gt;That is, our assumptions affect the results.&lt;/li&gt;
&lt;li&gt;The probability is Q(5), which is very small.&lt;/li&gt;
&lt;li&gt;Therefore we reject H0 and accept HA.&lt;/li&gt;
&lt;li&gt;We make a type-1 error if we reject H0 and it was really true.  See &lt;a class="reference external" href="http://en.wikipedia.org/wiki/Type_I_and_type_II_errors"&gt;http://en.wikipedia.org/wiki/Type_I_and_type_II_errors&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;We make a type-2 error if we accept H0 and it was really false.&lt;/li&gt;
&lt;li&gt;These two errors trade off: by reducing the probability of one we increase the probability of the other, for a given sample size.&lt;/li&gt;
&lt;li&gt;E.g. in a criminal trial we prefer that a guilty person go free to having an innocent person convicted.&lt;/li&gt;
&lt;li&gt;Rejecting H0 says nothing about what the population mean really is, just that it's not likely 2m.&lt;/li&gt;
&lt;li&gt;&lt;dl class="first docutils"&gt;
&lt;dt&gt;(Enrichment)  Random sampling is hard.  The US government got it wrong here:&lt;/dt&gt;
&lt;dd&gt;&lt;a class="reference external" href="http://politics.slashdot.org/story/11/05/13/2249256/Algorithm-Glitch-Voids-Outcome-of-US-Green-Card-Lottery"&gt;http://politics.slashdot.org/story/11/05/13/2249256/Algorithm-Glitch-Voids-Outcome-of-US-Green-Card-Lottery&lt;/a&gt;&lt;/dd&gt;
&lt;/dl&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;/div&gt;&lt;/div&gt;</description><category>mathjax</category><guid>https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class26/</guid><pubDate>Sun, 22 Apr 2018 04:00:00 GMT</pubDate></item><item><title>Engineering Probability Class 25 Thu 2018-04-19</title><link>https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/</link><dc:creator>W Randolph Franklin (WRF), RPI</dc:creator><description>&lt;div&gt;&lt;div class="contents topic" id="table-of-contents"&gt;
&lt;p class="topic-title first"&gt;Table of contents&lt;/p&gt;
&lt;ul class="auto-toc simple"&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#grades" id="id1"&gt;1   Grades&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#handwritten-notes-and-homework-solutions" id="id2"&gt;2   Handwritten notes and homework solutions&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#iclicker-questions" id="id3"&gt;3   Iclicker questions&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#material-from-text" id="id4"&gt;4   Material from text&lt;/a&gt;&lt;ul class="auto-toc"&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#joint-distribution-functions-ctd" id="id5"&gt;4.1   6.1.2 Joint Distribution Functions, ctd.&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#independence" id="id6"&gt;4.2   6.1.3 Independence&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#functions-of-several-random-variables" id="id7"&gt;4.3   6.2 Functions of several random variables&lt;/a&gt;&lt;ul class="auto-toc"&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#one-function-of-several-random-variables" id="id8"&gt;4.3.1   6.2.1 One Function of Several Random Variables&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#transformations-of-random-vectors" id="id9"&gt;4.3.2   6.2.2 Transformations of Random Vectors&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#pdf-of-general-transformations" id="id10"&gt;4.3.3   6.2.3 pdf of General Transformations&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#expected-values-of-vector-random-variables" id="id11"&gt;4.4   6.3 Expected values of vector random variables&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/div&gt;
&lt;!--  --&gt;
&lt;style&gt; .red {color:red} &lt;/style&gt;
&lt;style&gt; .blue {color:blue} &lt;/style&gt;&lt;div class="section" id="grades"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#id1"&gt;1   Grades&lt;/a&gt;&lt;/h2&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;I'll try to upload a guaranteed minimum grade by the end of tomorrow.   That will assume that all the grades that I don't yet have are zero.&lt;/li&gt;
&lt;li&gt;There will be eleven homeworks.&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="handwritten-notes-and-homework-solutions"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#id2"&gt;2   Handwritten notes and homework solutions&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;I added buttons to the page headers that go directly there.&lt;/p&gt;
&lt;/div&gt;
&lt;div class="section" id="iclicker-questions"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#id3"&gt;3   Iclicker questions&lt;/a&gt;&lt;/h2&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;What is $$\int_{-\infty}^\infty e^{\big(-\frac{x^2}{2}\big)} dx$$?&lt;ol class="loweralpha"&gt;
&lt;li&gt;1/2&lt;/li&gt;
&lt;li&gt;1&lt;/li&gt;
&lt;li&gt;$2\pi$&lt;/li&gt;
&lt;li&gt;$\sqrt{2\pi}$&lt;/li&gt;
&lt;li&gt;$1/\sqrt{2\pi}$&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;What is the largest possible value for a correlation coefficient?&lt;ol class="loweralpha"&gt;
&lt;li&gt;1/2&lt;/li&gt;
&lt;li&gt;1&lt;/li&gt;
&lt;li&gt;$2\pi$&lt;/li&gt;
&lt;li&gt;$\sqrt{2\pi}$&lt;/li&gt;
&lt;li&gt;$1/\sqrt{2\pi}$&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;The most reasonable probability distribution for the number of defects on an integrated circuit caused by dust particles, cosmic rays, etc, is&lt;ol class="loweralpha"&gt;
&lt;li&gt;Exponential&lt;/li&gt;
&lt;li&gt;Poisson&lt;/li&gt;
&lt;li&gt;Normal&lt;/li&gt;
&lt;li&gt;Uniform&lt;/li&gt;
&lt;li&gt;Binomial&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;The most reasonable probability distribution for the time until the next request hits your web server is:&lt;ol class="loweralpha"&gt;
&lt;li&gt;Exponential&lt;/li&gt;
&lt;li&gt;Poisson&lt;/li&gt;
&lt;li&gt;Normal&lt;/li&gt;
&lt;li&gt;Uniform&lt;/li&gt;
&lt;li&gt;Binomial&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;If you add two independent normal random variables, each with variance 10, what is the variance of the sum?&lt;ol class="loweralpha"&gt;
&lt;li&gt;1&lt;/li&gt;
&lt;li&gt;$\sqrt2$&lt;/li&gt;
&lt;li&gt;10&lt;/li&gt;
&lt;li&gt;$10\sqrt2$&lt;/li&gt;
&lt;li&gt;20&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="material-from-text"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#id4"&gt;4   Material from text&lt;/a&gt;&lt;/h2&gt;
&lt;div class="section" id="joint-distribution-functions-ctd"&gt;
&lt;h3&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#id5"&gt;4.1   6.1.2 Joint Distribution Functions, ctd.&lt;/a&gt;&lt;/h3&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;joint cumulative distribution function, p 305.&lt;/li&gt;
&lt;li&gt;marginal cdf’s&lt;/li&gt;
&lt;li&gt;joint probability mass function&lt;/li&gt;
&lt;li&gt;conditional pmf’s&lt;/li&gt;
&lt;li&gt;jointly continuous random variables&lt;/li&gt;
&lt;li&gt;joint probability density function.&lt;/li&gt;
&lt;li&gt;marginal pdf’s&lt;/li&gt;
&lt;li&gt;conditional pdf’s&lt;/li&gt;
&lt;li&gt;Example 6.7 Multiplicative Sequence, p 308.&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="independence"&gt;
&lt;h3&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#id6"&gt;4.2   6.1.3 Independence&lt;/a&gt;&lt;/h3&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;Example 6.8  Independence.&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="functions-of-several-random-variables"&gt;
&lt;h3&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#id7"&gt;4.3   6.2 Functions of several random variables&lt;/a&gt;&lt;/h3&gt;
&lt;div class="section" id="one-function-of-several-random-variables"&gt;
&lt;h4&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#id8"&gt;4.3.1   6.2.1 One Function of Several Random Variables&lt;/a&gt;&lt;/h4&gt;
&lt;ol class="arabic"&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 6.9 Maximum and Minimum of n Random Variables&lt;/p&gt;
&lt;p&gt;Apply this to uniform r.v.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 6.11  Reliability of Redundant Systems&lt;/p&gt;
&lt;p&gt;Reminder for exponential r.v.:&lt;/p&gt;
&lt;ol class="loweralpha simple"&gt;
&lt;li&gt;$f(x) = \lambda e^{-\lambda x}$&lt;/li&gt;
&lt;li&gt;$F(x) = 1-e^{-\lambda x}$&lt;/li&gt;
&lt;li&gt;$\mu = 1/\lambda$&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;I may extend this example to find pdf and mean.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="transformations-of-random-vectors"&gt;
&lt;h4&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#id9"&gt;4.3.2   6.2.2 Transformations of Random Vectors&lt;/a&gt;&lt;/h4&gt;
&lt;/div&gt;
&lt;div class="section" id="pdf-of-general-transformations"&gt;
&lt;h4&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#id10"&gt;4.3.3   6.2.3 pdf of General Transformations&lt;/a&gt;&lt;/h4&gt;
&lt;p&gt;We skip Section 6.2.3.  However, a historical note about Student's T distribution:&lt;/p&gt;
&lt;p&gt;Student was a pseudonymn of a mathematician working for Guinness in Ireland.  He developed several statistical techniques to sample beer to assure its quality.  Guinness didn't let him publish under his real name because these were trade secrets.&lt;/p&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;div class="section" id="expected-values-of-vector-random-variables"&gt;
&lt;h3&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/#id11"&gt;4.4   6.3 Expected values of vector random variables&lt;/a&gt;&lt;/h3&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;Section 6.3, page 316, extends the covariance to a matrix.  Even with N variables, note that we're comparing only pairs of variables.  If there were a complicated 3 variable dependency, which could happen (and did in a much earlier example), all the pairwise covariances would be 0.&lt;/li&gt;
&lt;li&gt;Note the sequence.&lt;ol class="loweralpha"&gt;
&lt;li&gt;First, the correlation matrix has the expectations of the products.&lt;/li&gt;
&lt;li&gt;Then the covariance matrix corrects for the means not being 0.&lt;/li&gt;
&lt;li&gt;Finally the correlation coefficents (not shown here) correct for the variances not being 1.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;/div&gt;&lt;/div&gt;</description><category>mathjax</category><guid>https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class25/</guid><pubDate>Wed, 18 Apr 2018 04:00:00 GMT</pubDate></item><item><title>Engineering Probability Class 24 Mon 2018-04-16</title><link>https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class24/</link><dc:creator>W Randolph Franklin (WRF), RPI</dc:creator><description>&lt;div&gt;&lt;div class="contents topic" id="table-of-contents"&gt;
&lt;p class="topic-title first"&gt;Table of contents&lt;/p&gt;
&lt;ul class="auto-toc simple"&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class24/#material-from-text" id="id1"&gt;1   Material from text&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class24/#tutorial-on-probability-density-2-variables" id="id2"&gt;2   Tutorial on probability density - 2 variables&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class24/#chapter-6-vector-random-variables" id="id3"&gt;3   Chapter 6: Vector random variables&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/div&gt;
&lt;!--  --&gt;
&lt;style&gt; .red {color:red} &lt;/style&gt;
&lt;style&gt; .blue {color:blue} &lt;/style&gt;&lt;div class="section" id="material-from-text"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class24/#id1"&gt;1   Material from text&lt;/a&gt;&lt;/h2&gt;
&lt;ol class="arabic"&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 5.47, page 282: Estimation of signal in noise&lt;/p&gt;
&lt;ol class="loweralpha"&gt;
&lt;li&gt;&lt;p class="first"&gt;This is our perennial example of signal and noise.  However, here the signal is not just $\pm1$ but is normal.  Our job is to find the &lt;em&gt;most likely&lt;/em&gt; input signal for a given output.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Important concept in the noisy channel example (with X and N both being
Gaussian):   The most likely value of X given Y is
not Y but is somewhat smaller, depending on the relative sizes of
&lt;span class="math"&gt;\(\sigma_X\)&lt;/span&gt; and &lt;span class="math"&gt;\(\sigma_N\)&lt;/span&gt;.  This is true in spite of &lt;span class="math"&gt;\(\mu_N=0\)&lt;/span&gt;. It
would be really useful for you to understand this intuitively.  Here's
one way:&lt;/p&gt;
&lt;p&gt;If you don't know Y, then the most likely value of X is 0.  Knowing Y
gives you more information, which you combine with your initial info
(that X is &lt;span class="math"&gt;\(N(0,\sigma_X)\)&lt;/span&gt; to get a new estimate for the most likely X.
The smaller the noise, the more valuable is Y.  If the noise is very
small, then the mostly likely X is close to Y.  If the noise is very
large (on average) then the most likely X is still close to 0.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="tutorial-on-probability-density-2-variables"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class24/#id2"&gt;2   Tutorial on probability density - 2 variables&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;In class 15, I tried to motivate the effect of changing one variable on probability density.   Here's a try at motivating changing 2 variables.&lt;/p&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;We're throwing darts uniformly at a one foot square dartboard.&lt;/li&gt;
&lt;li&gt;We observe 2 random variables, X, Y, where the dart hits (in Cartesian coordinates).&lt;/li&gt;
&lt;li&gt;$$f_{X,Y}(x,y) =  \begin{cases} 1&amp;amp; \text{if}\,\,  0\le x\le1 \cap 0\le y\le1\\ 0&amp;amp;\text{otherwise} \end{cases}$$&lt;/li&gt;
&lt;li&gt;$$P[.5\le x\le .6 \cap .8\le y\le.9]  = \int_{.5}^{.6}\int_{.8}^{.9} f_{XY}(x,y) dx \, dy = 0.01 $$&lt;/li&gt;
&lt;li&gt;Transform to centimeters:  $$\begin{bmatrix}V\\W\end{bmatrix} = \begin{pmatrix}30&amp;amp;0\\0&amp;amp;30\end{pmatrix} \begin{bmatrix}X\\Y\end{bmatrix}$$&lt;/li&gt;
&lt;li&gt;$$f_{V,W}(v,w)   = \begin{cases} 1/900&amp;amp; \text{if } 0\le v\le30 \cap 0\le w\le30\\ 0&amp;amp;\text{otherwise} \end{cases}$$&lt;/li&gt;
&lt;li&gt;$$P[15\le v\le 18 \cap 24\le w\le27]  = \int_{15}^{18}\int_{24}^{27} f_{VW}(v,w)\, dv\, dw = \frac{ (18-15)(27-24) }{900} =  0.01$$&lt;/li&gt;
&lt;li&gt;See Section 5.8.3 on page 286.&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="chapter-6-vector-random-variables"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class24/#id3"&gt;3   Chapter 6: Vector random variables&lt;/a&gt;&lt;/h2&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;Skip the starred sections.&lt;/li&gt;
&lt;li&gt;Examples:&lt;ol class="loweralpha"&gt;
&lt;li&gt;arrivals in a multiport switch,&lt;/li&gt;
&lt;li&gt;audio signal at different times.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;pmf, cdf, marginal pmf and cdf are obvious.&lt;/li&gt;
&lt;li&gt;conditional pmf has a nice chaining rule.&lt;/li&gt;
&lt;li&gt;For continuous random variables, the pdf, cdf, conditional pdf etc are all obvious.&lt;/li&gt;
&lt;li&gt;Independence is obvious.&lt;/li&gt;
&lt;li&gt;Work out example 6.5, page 306.  The input ports are a distraction.
This problem reduces to a multinomial probability where N is itself a
random variable.&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;&lt;/div&gt;</description><category>mathjax</category><guid>https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class24/</guid><pubDate>Sun, 15 Apr 2018 04:00:00 GMT</pubDate></item><item><title>Engineering Probability Class 23 Thu 2018-04-12</title><link>https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class23/</link><dc:creator>W Randolph Franklin (WRF), RPI</dc:creator><description>&lt;div&gt;&lt;div class="contents topic" id="table-of-contents"&gt;
&lt;p class="topic-title first"&gt;Table of contents&lt;/p&gt;
&lt;ul class="auto-toc simple"&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class23/#material-from-text" id="id1"&gt;1   Material from text&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/div&gt;
&lt;!--  --&gt;
&lt;style&gt; .red {color:red} &lt;/style&gt;
&lt;style&gt; .blue {color:blue} &lt;/style&gt;&lt;div class="section" id="material-from-text"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class23/#id1"&gt;1   Material from text&lt;/a&gt;&lt;/h2&gt;
&lt;ol class="arabic"&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 5.35 Maximum A Posteriori Receiver on page 268.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 5.37, page 270.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Remember equations 5.49 a,b for total probability on page 269-70 for conditional expectation of Y given X.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Section 5.8 page 271: Functions of two random variables.&lt;/p&gt;
&lt;ol class="loweralpha simple"&gt;
&lt;li&gt;This is an important topic.&lt;/li&gt;
&lt;li&gt;Linear transformation of two Gaussian r.v.&lt;/li&gt;
&lt;li&gt;Sum and difference of 2 Gaussian r.v. are independent.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;What's the point of transforming variables in engineering?  E.g. in video, (R,G,B) might be transformed to (Y,I,Q) with a 3x3 matrix multiply.  Y is brightness (mostly the green component).  I and Q are approximately the red and blue.  Since we see brightness more accurately than color hue, we want to transmit or compress Y with greater precision.  So, we want to do probabilities on all this.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 5.39 Sum of Two Random Variables, page 271.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 5.40 Sum of Nonindependent Gaussian Random Variables, page 272.&lt;/p&gt;
&lt;p&gt;I'll do an easier case of independent N(0,1) r.v.   The sum will be N(0, $\sqrt{2}$ ).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;dl class="first docutils"&gt;
&lt;dt&gt;Example 5.44, page 275. Tranform two independent Gaussian r.v from&lt;/dt&gt;
&lt;dd&gt;&lt;p class="first last"&gt;(X,Y) to (R, $\theta$).&lt;/p&gt;
&lt;/dd&gt;
&lt;/dl&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Section 5.9, page 278: pairs of jointly Gaussian r.v.&lt;/p&gt;
&lt;ol class="arabic"&gt;
&lt;li&gt;&lt;p class="first"&gt;I will simplify formula 5.61a by assuming that $\mu=0, \sigma=1$.&lt;/p&gt;
&lt;p&gt;$$f_{XY}(x,y)= \frac{1}{2\pi \sqrt{1-\rho^2}} e^{ \frac{-\left( x^2-2\rho x y + y^2\right)}{2(1-\rho^2)} }  $$ .&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;The r.v. are probably dependent.  $\rho$} says how much.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;The formula degenerates if $|\rho|=1$ since the numerator and denominator are both zero.  However the pdf is still valid.  You could make the formula valid with l'Hopital's rule.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;The lines of equal probability density are ellipses.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;The marginal pdf is a 1 variable Gaussian.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;&lt;/div&gt;</description><category>mathjax</category><guid>https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class23/</guid><pubDate>Thu, 12 Apr 2018 04:00:00 GMT</pubDate></item><item><title>Engineering Probability Class 22 Mon 2018-04-09</title><link>https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class22/</link><dc:creator>W Randolph Franklin (WRF), RPI</dc:creator><description>&lt;div&gt;&lt;div class="contents topic" id="table-of-contents"&gt;
&lt;p class="topic-title first"&gt;Table of contents&lt;/p&gt;
&lt;ul class="auto-toc simple"&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class22/#homework-solutions" id="id3"&gt;1   Homework solutions&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class22/#handwritten-notes-made-in-class" id="id4"&gt;2   Handwritten notes made in class&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class22/#exam-2-and-estimated-final-grade" id="id5"&gt;3   Exam 2 and estimated final grade&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class22/#final-exam" id="id6"&gt;4   Final exam&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class22/#material-from-text" id="id7"&gt;5   Material from text&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/div&gt;
&lt;!--  --&gt;
&lt;style&gt; .red {color:red} &lt;/style&gt;
&lt;style&gt; .blue {color:blue} &lt;/style&gt;&lt;div class="section" id="homework-solutions"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class22/#id3"&gt;1   Homework solutions&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Here are all the &lt;a class="reference external" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/homework_sols/"&gt;homework solutions&lt;/a&gt;.&lt;/p&gt;
&lt;/div&gt;
&lt;div class="section" id="handwritten-notes-made-in-class"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class22/#id4"&gt;2   Handwritten notes made in class&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Here are all my &lt;a class="reference external" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/handwritten/"&gt;handwritten notes made in class&lt;/a&gt;.&lt;/p&gt;
&lt;/div&gt;
&lt;div class="section" id="exam-2-and-estimated-final-grade"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class22/#id5"&gt;3   Exam 2 and estimated final grade&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Exam 2 will be returned in class on Thurs.  Shortly after I will compute an estimated letter grade, if you don't write exam 3 (the final exam).   Later, it will be refined into a guaranteed letter grade, assuming you don't do any more homeworks.  That will require working in the latest iclicker scores etc.&lt;/p&gt;
&lt;/div&gt;
&lt;div class="section" id="final-exam"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class22/#id6"&gt;4   Final exam&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Six students need a conflict exam (if everyone writes it).   Next week, I'll set up an online poll for those students, to pick a good time.&lt;/p&gt;
&lt;/div&gt;
&lt;div class="section" id="material-from-text"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class22/#id7"&gt;5   Material from text&lt;/a&gt;&lt;/h2&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;Example 5.31 on page 264 in detail.  This is the noisy comm channel, now with Gaussian (normal) noise.&lt;/li&gt;
&lt;li&gt;Remember equation 5.45 for conditional pdf on page 265.&lt;/li&gt;
&lt;li&gt;Remember equation 5.48 for total probability on page 266.&lt;/li&gt;
&lt;li&gt;Example 5.33 on page 267.&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;&lt;/div&gt;</description><category>mathjax</category><guid>https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class22/</guid><pubDate>Mon, 09 Apr 2018 04:00:00 GMT</pubDate></item><item><title>Engineering Probability Class 21 Thu 2018-04-05</title><link>https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class21/</link><dc:creator>W Randolph Franklin (WRF), RPI</dc:creator><description>&lt;div&gt;&lt;div class="contents topic" id="table-of-contents"&gt;
&lt;p class="topic-title first"&gt;Table of contents&lt;/p&gt;
&lt;ul class="auto-toc simple"&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class21/#parallel-computer-access" id="id1"&gt;1   Parallel computer access&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class21/#final-exam" id="id2"&gt;2   Final exam&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class21/#material-from-text" id="id3"&gt;3   Material from text&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/div&gt;
&lt;!--  --&gt;
&lt;style&gt; .red {color:red} &lt;/style&gt;
&lt;style&gt; .blue {color:blue} &lt;/style&gt;&lt;div class="section" id="parallel-computer-access"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class21/#id1"&gt;1   Parallel computer access&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;As I mentioned Monday, parallel.ecse.rpi.edu is available.   It has a dual 14-core (56 hyperthread) 2GHz Intel Xeon CPU, Intel Xeon Phi coprocessor with 60 cores running 240 threads, Nvidia GeForce GTX 1080 GPU with 2560 CUDA cores, and 256GB main memory.&lt;/p&gt;
&lt;p&gt;Parallel SW includes CUDA, Thrust, OpenMP, TBB.&lt;/p&gt;
&lt;p&gt;Using it for private business is against RPI policy.&lt;/p&gt;
&lt;p&gt;It is slower than an IBM Blue Gene, but it cost only $10K total.&lt;/p&gt;
&lt;p&gt;I manage it; ask me for an account.&lt;/p&gt;
&lt;p&gt;Many datasets that are considered to be big data will fit into main memory; MPI etc are not required.&lt;/p&gt;
&lt;/div&gt;
&lt;div class="section" id="final-exam"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class21/#id2"&gt;2   Final exam&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;will be 80 minutes like the first 2 exams.&lt;/p&gt;
&lt;/div&gt;
&lt;div class="section" id="material-from-text"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class21/#id3"&gt;3   Material from text&lt;/a&gt;&lt;/h2&gt;
&lt;ol class="arabic"&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 4.33, page 177.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 4.36, page 180,  Amplitude Samples of a Sinusoidal Waveform&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Equation 5.32 on page 259.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 5.27 on page 260.   Uncorrelated but Dependent Random Variables.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 5.29 on page 263. Loaded Dice&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 5.30 on page 263.  Number of Defects in a Region; Random Splitting of Poisson Counts&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 5.31 on page 264.  This is a noisy comm channel, now with Gaussian (normal) noise.  This is a more realistic version of the earlier example with uniform noise.  The application problems are:&lt;/p&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;what input signal to infer from each output,&lt;/li&gt;
&lt;li&gt;how accurate is this, and&lt;/li&gt;
&lt;li&gt;what cutoff minimizes this?&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;In the real world there are several ways you could reduce that error:&lt;/p&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;Increase the transmitted signal,&lt;/li&gt;
&lt;li&gt;Reduce the noise,&lt;/li&gt;
&lt;li&gt;Retransmit several times and vote.&lt;/li&gt;
&lt;li&gt;Handshake: Include a checksum and ask for retransmission if it fails.&lt;/li&gt;
&lt;li&gt;Instead of just deciding X=+1 or X=-1 depending on Y, have a 3rd decision, i.e., &lt;em&gt;uncertain&lt;/em&gt; if $|Y|&amp;lt;0.5$, and ask for retransmission in that case.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;This is relevant to solving Problem 5.3, page 288.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;&lt;/div&gt;</description><category>mathjax</category><guid>https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class21/</guid><pubDate>Thu, 05 Apr 2018 04:00:00 GMT</pubDate></item><item><title>Engineering Probability Class 20 Mon 2018-04-02</title><link>https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class20/</link><dc:creator>W Randolph Franklin (WRF), RPI</dc:creator><description>&lt;div&gt;&lt;div class="contents topic" id="table-of-contents"&gt;
&lt;p class="topic-title first"&gt;Table of contents&lt;/p&gt;
&lt;ul class="auto-toc simple"&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class20/#final-exam-conflicts" id="id1"&gt;1   Final exam conflicts&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class20/#normal-distribution-table" id="id2"&gt;2   Normal distribution table&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class20/#not-in-text-enrichment-large-effect-of-small-bias" id="id3"&gt;3   Not in text enrichment - large effect of small bias&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a class="reference internal" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class20/#material-from-text" id="id4"&gt;4   Material from text&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/div&gt;
&lt;!--  --&gt;
&lt;style&gt; .red {color:red} &lt;/style&gt;
&lt;style&gt; .blue {color:blue} &lt;/style&gt;&lt;div class="section" id="final-exam-conflicts"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class20/#id1"&gt;1   Final exam conflicts&lt;/a&gt;&lt;/h2&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;If you have 3 exams on that day, or another exam at the same time, please email me this week.  Tell me the other courses.&lt;/li&gt;
&lt;li&gt;The RPI rule is that the lower numbered course has precedence.  If one of your other classes has a number higher than 2500, then it gives the conflict exam.  If all the other courses with exams that day are lower than 2500, then I do.&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="normal-distribution-table"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class20/#id2"&gt;2   Normal distribution table&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;For your convenience.   I computed it with Matlab.:&lt;/p&gt;
&lt;pre class="literal-block"&gt;
x          f(x)      F(x)      Q(x)
-3.0000    0.0044    0.0013    0.9987
-2.9000    0.0060    0.0019    0.9981
-2.8000    0.0079    0.0026    0.9974
-2.7000    0.0104    0.0035    0.9965
-2.6000    0.0136    0.0047    0.9953
-2.5000    0.0175    0.0062    0.9938
-2.4000    0.0224    0.0082    0.9918
-2.3000    0.0283    0.0107    0.9893
-2.2000    0.0355    0.0139    0.9861
-2.1000    0.0440    0.0179    0.9821
-2.0000    0.0540    0.0228    0.9772
-1.9000    0.0656    0.0287    0.9713
-1.8000    0.0790    0.0359    0.9641
-1.7000    0.0940    0.0446    0.9554
-1.6000    0.1109    0.0548    0.9452
-1.5000    0.1295    0.0668    0.9332
-1.4000    0.1497    0.0808    0.9192
-1.3000    0.1714    0.0968    0.9032
-1.2000    0.1942    0.1151    0.8849
-1.1000    0.2179    0.1357    0.8643
-1.0000    0.2420    0.1587    0.8413
-0.9000    0.2661    0.1841    0.8159
-0.8000    0.2897    0.2119    0.7881
-0.7000    0.3123    0.2420    0.7580
-0.6000    0.3332    0.2743    0.7257
-0.5000    0.3521    0.3085    0.6915
-0.4000    0.3683    0.3446    0.6554
-0.3000    0.3814    0.3821    0.6179
-0.2000    0.3910    0.4207    0.5793
-0.1000    0.3970    0.4602    0.5398
      0    0.3989    0.5000    0.5000
 0.1000    0.3970    0.5398    0.4602
 0.2000    0.3910    0.5793    0.4207
 0.3000    0.3814    0.6179    0.3821
 0.4000    0.3683    0.6554    0.3446
 0.5000    0.3521    0.6915    0.3085
 0.6000    0.3332    0.7257    0.2743
 0.7000    0.3123    0.7580    0.2420
 0.8000    0.2897    0.7881    0.2119
 0.9000    0.2661    0.8159    0.1841
 1.0000    0.2420    0.8413    0.1587
 1.1000    0.2179    0.8643    0.1357
 1.2000    0.1942    0.8849    0.1151
 1.3000    0.1714    0.9032    0.0968
 1.4000    0.1497    0.9192    0.0808
 1.5000    0.1295    0.9332    0.0668
 1.6000    0.1109    0.9452    0.0548
 1.7000    0.0940    0.9554    0.0446
 1.8000    0.0790    0.9641    0.0359
 1.9000    0.0656    0.9713    0.0287
 2.0000    0.0540    0.9772    0.0228
 2.1000    0.0440    0.9821    0.0179
 2.2000    0.0355    0.9861    0.0139
 2.3000    0.0283    0.9893    0.0107
 2.4000    0.0224    0.9918    0.0082
 2.5000    0.0175    0.9938    0.0062
 2.6000    0.0136    0.9953    0.0047
 2.7000    0.0104    0.9965    0.0035
 2.8000    0.0079    0.9974    0.0026
 2.9000    0.0060    0.9981    0.0019
 3.0000    0.0044    0.9987    0.0013
&lt;/pre&gt;
&lt;/div&gt;
&lt;div class="section" id="not-in-text-enrichment-large-effect-of-small-bias"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class20/#id3"&gt;3   Not in text enrichment - large effect of small bias&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Consider tossing $n=10^6$ fair coins.&lt;/p&gt;
&lt;ol class="arabic"&gt;
&lt;li&gt;&lt;p class="first"&gt;P[more heads than tails] = 0.5&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Now assume that each coin has chance of being heads $p=0.5005$.&lt;/p&gt;
&lt;p&gt;What's P[more heads than tails]?&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Now assume that 999,000 of the coins are fair, but 1,000 will always be heads.&lt;/p&gt;
&lt;p&gt;What's P[more heads than tails]?&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;
&lt;div class="section" id="material-from-text"&gt;
&lt;h2&gt;&lt;a class="toc-backref" href="https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class20/#id4"&gt;4   Material from text&lt;/a&gt;&lt;/h2&gt;
&lt;ol class="arabic"&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 5.17 on page 253.  P[X+Y&amp;lt;=1]&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 5.18 on page 253.  Joint Gaussian.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 5.19 on page 255.  Independence.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 5.20 on page 255.  Independence of Q and R in the block transmission example.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Independence: Example 5.22 on page 256.  Are 2 normal r.v. independent for different values of $\rho$ ?&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 5.31 on page 264.  This is a noisy comm channel, now with Gaussian (normal) noise.  The problems are:&lt;/p&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;what input signal to infer from each output, and&lt;/li&gt;
&lt;li&gt;how accurate is this?&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;5.6.2 Joint moments etc&lt;/p&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;Work out for 2 3-sided dice.&lt;/li&gt;
&lt;li&gt;Work out for tossing dart onto triangular board.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 5.27: correlation measures ''linear dependence''.  If the dependence is more complicated, the variables may be dependent but not correlated.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Covariance, correlation coefficient.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Section 5.7, page 261. Conditional pdf.  There is nothing majorly new here; it's an obvious extension of 1 variable.&lt;/p&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;Discrete: Work out an example with a pair of 3-sided loaded dice.&lt;/li&gt;
&lt;li&gt;Continuous: a triangular dart board.  There is one little trick because for P[X=x]=0 since X is continuous, so how can we compute P[Y=y|X=x] = P[Y=y &amp;amp;amp; X=x]/P[x]?  The answer is that we take the limiting probability P[x&amp;lt;X&amp;lt;x+dx] etc as dx shrinks, which nets out to using f(x) etc.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 5.31 on page 264.  This is a noisy comm channel, now with Gaussian (normal) noise.  This is a more realistic version of the earlier example with uniform noise.  The application problems are:&lt;/p&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;what input signal to infer from each output,&lt;/li&gt;
&lt;li&gt;how accurate is this, and&lt;/li&gt;
&lt;li&gt;what cutoff minimizes this?&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;In the real world there are several ways you could reduce that error:&lt;/p&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;Increase the transmitted signal,&lt;/li&gt;
&lt;li&gt;Reduce the noise,&lt;/li&gt;
&lt;li&gt;Retransmit several times and vote.&lt;/li&gt;
&lt;li&gt;Handshake: Include a checksum and ask for retransmission if it fails.&lt;/li&gt;
&lt;li&gt;Instead of just deciding X=+1 or X=-1 depending on Y, have a 3rd decision, i.e., &lt;em&gt;uncertain&lt;/em&gt; if $|Y|&amp;lt;0.5$, and ask for retransmission in that case.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Section 5.8 page 271: Functions of two random variables.&lt;/p&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;We already saw how to compute the pdf of the sum and max of 2 r.v.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;What's the point of transforming variables in engineering?  E.g. in video, (R,G,B) might be transformed to (Y,I,Q) with a 3x3 matrix multiply.  Y is brightness (mostly the green component).  I and Q are approximately the red and blue.  Since we see brightness more accurately than color hue, we want to transmit Y with greater precision.  So, we want to do probabilities on all this.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Functions of 2 random variables&lt;/p&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;This is an important topic.&lt;/li&gt;
&lt;li&gt;Example 5.44, page 275. Tranform two independent Gaussian r.v from
(X,Y) to (R, $\theta$} ).&lt;/li&gt;
&lt;li&gt;Linear transformation of two Gaussian r.v.&lt;/li&gt;
&lt;li&gt;Sum and difference of 2 Gaussian r.v. are independent.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Section 5.9, page 278: pairs of jointly Gaussian r.v.&lt;/p&gt;
&lt;ol class="arabic"&gt;
&lt;li&gt;&lt;p class="first"&gt;I will simplify formula 5.61a by assuming that $\mu=0, \sigma=1$.&lt;/p&gt;
&lt;p&gt;$$f_{XY}(x,y)= \frac{1}{2\pi \sqrt{1-\rho^2}} e^{ \frac{-\left( x^2-2\rho x y + y^2\right)}{2(1-\rho^2)} }  $$ .&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;The r.v. are probably dependent.  $\rho$} says how much.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;The formula degenerates if $|\rho|=1$ since the numerator and denominator are both zero.  However the pdf is still valid.  You could make the formula valid with l'Hopital's rule.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;The lines of equal probability density are ellipses.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;The marginal pdf is a 1 variable Gaussian.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Example 5.47, page 282: Estimation of signal in noise&lt;/p&gt;
&lt;ol class="arabic simple"&gt;
&lt;li&gt;This is our perennial example of signal and noise.  However, here the signal is not just $\pm1$ but is normal.  Our job is to find the ''most likely'' input signal for a given output.&lt;/li&gt;
&lt;/ol&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p class="first"&gt;Next time: We've seen 1 r.v., we've seen 2 r.v.  Now we'll see several r.v.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;/div&gt;&lt;/div&gt;</description><category>mathjax</category><guid>https://wrf.ecse.rpi.edu/Teaching/probability-s2018/posts/class20/</guid><pubDate>Mon, 02 Apr 2018 04:00:00 GMT</pubDate></item></channel></rss>