{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {
    "collapsed": false
   },
   "source": [
    "# Lecture 17\n",
    "\n",
    "Today:\n",
    "1. Review of hypotesis test\n",
    "2. Application: A/B Testing\n",
    "    + Example\n",
    "3. Causality"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "collapsed": false
   },
   "source": [
    "# 1. Review of hypotesis test\n",
    "\n",
    "A possible rule for rejecting the null hypothesis:\n",
    "\n",
    "- establish cutoff for p-value\n",
    "\n",
    "- for example, a 5% cutoff: if the observed p-value is 5% or less, then reject the null hypothesis. Otherwise, do not reject it"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "collapsed": false
   },
   "source": [
    "# 2. A/B Testing: Comparing Two Samples\n",
    "\n",
    "- compare values of sampled individuals in group a with values of sampled individuals in group b\n",
    "- example: random sample of visiotrs to etsy. comparing A) click rate using design A vs B) click rate using design B"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "collapsed": false
   },
   "source": [
    "### Example: smoking behaviors of mothers and its influence on babies weights\n",
    "\n",
    "- comparing A) birth weights of babies of mothers who smoked during pregnancy vs. B) birth weights of babies of mothers who didn't smoke. question: could the difference be due to chance alone?\n",
    "\n",
    "HYPOTHESES\n",
    "- Null: In the population, the distributions of the birth weights of babies in two groups are the same\n",
    "- Alternate: babies of the mothers who smoked weighed less than the babies of the non-smokers\n",
    "- To test this we have to compute a test statistic (one number) between group A and group B. the test statistic is group b - group a\n",
    "    - the statistic for the null hypothesis would be 0\n",
    "\n",
    "SIMULATION\n",
    "- If the null is true, all rearrangements of the birth weights among the two groups are equally likely.\n",
    "- Plan:\n",
    "    - shuffle birth weights\n",
    "    - assign some to \"group a\" and the rest to \"group b,\" maintaining sample sizes\n",
    "    - find the difference b/t the averages of two shuffled groups\n",
    "    -repeat"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
    {
     "name": "stderr",
     "output_type": "stream",
     "text": [
      "\n",
      "Attaching package: ‘dplyr’\n",
      "\n"
     ]
    },
    {
     "name": "stderr",
     "output_type": "stream",
     "text": [
      "The following objects are masked from ‘package:stats’:\n",
      "\n",
      "    filter, lag\n",
      "\n"
     ]
    },
    {
     "name": "stderr",
     "output_type": "stream",
     "text": [
      "The following objects are masked from ‘package:base’:\n",
      "\n",
      "    intersect, setdiff, setequal, union\n",
      "\n"
     ]
    }
   ],
   "source": [
    "library('dplyr')\n",
    "library('ggplot2')"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 2,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
   ],
   "source": [
    "babyweight <- read.csv(\"babyweight.csv\")"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 12,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<ol class=list-inline>\n",
       "\t<li>32</li>\n",
       "\t<li>4</li>\n",
       "</ol>\n"
      ]
     },
     "execution_count": 12,
     "metadata": {
     },
     "output_type": "execute_result"
    }
   ],
   "source": [
    "dim(babyweight)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 3,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<table>\n",
       "<thead><tr><th scope=col>X</th><th scope=col>Wgt</th><th scope=col>Gest</th><th scope=col>Smoke</th></tr></thead>\n",
       "<tbody>\n",
       "\t<tr><td>1   </td><td>2940</td><td>38  </td><td>yes </td></tr>\n",
       "\t<tr><td>2   </td><td>3130</td><td>38  </td><td>no  </td></tr>\n",
       "\t<tr><td>3   </td><td>2420</td><td>36  </td><td>yes </td></tr>\n",
       "\t<tr><td>4   </td><td>2450</td><td>34  </td><td>no  </td></tr>\n",
       "\t<tr><td>5   </td><td>2760</td><td>39  </td><td>yes </td></tr>\n",
       "\t<tr><td>6   </td><td>2440</td><td>35  </td><td>yes </td></tr>\n",
       "</tbody>\n",
       "</table>\n"
      ]
     },
     "execution_count": 3,
     "metadata": {
     },
     "output_type": "execute_result"
    }
   ],
   "source": [
    "head(babyweight)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 9,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<table>\n",
       "<thead><tr><th scope=col>X</th><th scope=col>Wgt</th><th scope=col>Gest</th><th scope=col>Smoke</th></tr></thead>\n",
       "<tbody>\n",
       "\t<tr><td> 1  </td><td>2940</td><td>38  </td><td>yes </td></tr>\n",
       "\t<tr><td> 3  </td><td>2420</td><td>36  </td><td>yes </td></tr>\n",
       "\t<tr><td> 5  </td><td>2760</td><td>39  </td><td>yes </td></tr>\n",
       "\t<tr><td> 6  </td><td>2440</td><td>35  </td><td>yes </td></tr>\n",
       "\t<tr><td> 8  </td><td>3301</td><td>42  </td><td>yes </td></tr>\n",
       "\t<tr><td>11  </td><td>2715</td><td>36  </td><td>yes </td></tr>\n",
       "</tbody>\n",
       "</table>\n"
      ]
     },
     "execution_count": 9,
     "metadata": {
     },
     "output_type": "execute_result"
    },
    {
     "data": {
      "text/html": [
       "2973.625"
      ]
     },
     "execution_count": 9,
     "metadata": {
     },
     "output_type": "execute_result"
    }
   ],
   "source": [
    "smokers <- filter( babyweight, Smoke == \"yes\")\n",
    "head(smoker)\n",
    "ave_weight_smokers <- mean(smokers$Wgt)\n",
    "ave_weight_smokers"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 8,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "3066.125"
      ]
     },
     "execution_count": 8,
     "metadata": {
     },
     "output_type": "execute_result"
    }
   ],
   "source": [
    "nonsmokers <- filter( babyweight, Smoke == \"no\")\n",
    "ave_weight_nonsmokers <- mean(nonsmokers$Wgt)\n",
    "ave_weight_nonsmokers"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 10,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<table>\n",
       "<thead><tr><th scope=col>Smoke</th><th scope=col>AveWeight</th></tr></thead>\n",
       "<tbody>\n",
       "\t<tr><td>no      </td><td>3066.125</td></tr>\n",
       "\t<tr><td>yes     </td><td>2973.625</td></tr>\n",
       "</tbody>\n",
       "</table>\n"
      ]
     },
     "execution_count": 10,
     "metadata": {
     },
     "output_type": "execute_result"
    }
   ],
   "source": [
    "babyweight_grouped <- group_by( babyweight, Smoke )\n",
    "babyweight_summary <- summarize(babyweight_grouped, AveWeight = mean(Wgt))\n",
    "babyweight_summary"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 11,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "92.5"
      ]
     },
     "execution_count": 11,
     "metadata": {
     },
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# observed statistic\n",
    "\n",
    "observed_diff <- ave_weight_nonsmokers - ave_weight_smokers\n",
    "observed_diff"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 16,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<ol class=list-inline>\n",
       "\t<li>2420</li>\n",
       "\t<li>3523</li>\n",
       "\t<li>3459</li>\n",
       "\t<li>2440</li>\n",
       "\t<li>2520</li>\n",
       "\t<li>3200</li>\n",
       "\t<li>2957</li>\n",
       "\t<li>3346</li>\n",
       "\t<li>2928</li>\n",
       "\t<li>2450</li>\n",
       "\t<li>2920</li>\n",
       "\t<li>3446</li>\n",
       "\t<li>2619</li>\n",
       "\t<li>3244</li>\n",
       "\t<li>3226</li>\n",
       "\t<li>3095</li>\n",
       "\t<li>2760</li>\n",
       "\t<li>3175</li>\n",
       "\t<li>3322</li>\n",
       "\t<li>2729</li>\n",
       "\t<li>3500</li>\n",
       "\t<li>2740</li>\n",
       "\t<li>2715</li>\n",
       "\t<li>3530</li>\n",
       "\t<li>2841</li>\n",
       "\t<li>3130</li>\n",
       "\t<li>3410</li>\n",
       "\t<li>3130</li>\n",
       "\t<li>2580</li>\n",
       "\t<li>2940</li>\n",
       "\t<li>3301</li>\n",
       "\t<li>3040</li>\n",
       "</ol>\n"
      ]
     },
     "execution_count": 16,
     "metadata": {
     },
     "output_type": "execute_result"
    },
    {
     "data": {
      "text/html": [
       "<ol class=list-inline>\n",
       "\t<li>2420</li>\n",
       "\t<li>3459</li>\n",
       "</ol>\n"
      ]
     },
     "execution_count": 16,
     "metadata": {
     },
     "output_type": "execute_result"
    },
    {
     "data": {
      "text/html": [
       "<ol class=list-inline>\n",
       "\t<li>2420</li>\n",
       "\t<li>3523</li>\n",
       "\t<li>3459</li>\n",
       "\t<li>2440</li>\n",
       "\t<li>2520</li>\n",
       "\t<li>3200</li>\n",
       "\t<li>2957</li>\n",
       "\t<li>3346</li>\n",
       "\t<li>2928</li>\n",
       "\t<li>2450</li>\n",
       "\t<li>2920</li>\n",
       "\t<li>3446</li>\n",
       "\t<li>2619</li>\n",
       "\t<li>3244</li>\n",
       "\t<li>3226</li>\n",
       "\t<li>3095</li>\n",
       "</ol>\n"
      ]
     },
     "execution_count": 16,
     "metadata": {
     },
     "output_type": "execute_result"
    },
    {
     "data": {
      "text/html": [
       "<ol class=list-inline>\n",
       "\t<li>2760</li>\n",
       "\t<li>3175</li>\n",
       "\t<li>3322</li>\n",
       "\t<li>2729</li>\n",
       "\t<li>3500</li>\n",
       "\t<li>2740</li>\n",
       "\t<li>2715</li>\n",
       "\t<li>3530</li>\n",
       "\t<li>2841</li>\n",
       "\t<li>3130</li>\n",
       "\t<li>3410</li>\n",
       "\t<li>3130</li>\n",
       "\t<li>2580</li>\n",
       "\t<li>2940</li>\n",
       "\t<li>3301</li>\n",
       "\t<li>3040</li>\n",
       "</ol>\n"
      ]
     },
     "execution_count": 16,
     "metadata": {
     },
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# Selecting elements from a list\n",
    "\n",
    "shuffled_babies <- sample( babyweight$Wgt, 32, replace = FALSE )\n",
    "shuffled_babies\n",
    "shuffled_babies[c(1, 3)] #selecting first and third baby\n",
    "shuffled_babies[1:16] #first sixteen numbers in the list; telling R what index to select\n",
    "shuffled_babies[17:32] #last sixteen numbers in the list"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 19,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
   ],
   "source": [
    "# simulate\n",
    "\n",
    "num_simulations <- 1000\n",
    "\n",
    "# set up data frame with 1000 rows, each row being an observation. one column would be the test statistic. test statistic = mean weight of group b - mean weight of group a. two other columns would be average weight group A and average weight of group B.\n",
    "simulated_data <- data.frame(ave_weight_A = double(num_simulations), \n",
    "                             ave_weight_B = double(num_simulations),\n",
    "                             statistic = double(num_simulations) )\n",
    "\n",
    "\n",
    "count <- 1\n",
    "while( count <= num_simulations ) {\n",
    "\n",
    "    shuffled_babies <- sample( babyweight$Wgt, 32, replace = FALSE )\n",
    "    group_A <- shuffled_babies[1:16]\n",
    "    group_B <- shuffled_babies[17:32]\n",
    "    \n",
    "    #find mean of weight in each group, place in correct data frame, and then find the difference\n",
    "    simulated_data$ave_weight_A[count] <- mean(group_A)\n",
    "    simulated_data$ave_weight_B[count] <- mean(group_B)\n",
    "    simulated_data$statistic[count] <- simulated_data$ave_weight_B[count] - simulated_data$ave_weight_A[count]\n",
    "\n",
    "\n",
    "    count <- count + 1\n",
    "}"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 20,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "<table>\n",
       "<thead><tr><th scope=col>ave_weight_A</th><th scope=col>ave_weight_B</th><th scope=col>statistic</th></tr></thead>\n",
       "<tbody>\n",
       "\t<tr><td>3034.688</td><td>3005.062</td><td> -29.625</td></tr>\n",
       "\t<tr><td>2990.938</td><td>3048.812</td><td>  57.875</td></tr>\n",
       "\t<tr><td>2994.250</td><td>3045.500</td><td>  51.250</td></tr>\n",
       "\t<tr><td>3011.438</td><td>3028.312</td><td>  16.875</td></tr>\n",
       "\t<tr><td>3077.438</td><td>2962.312</td><td>-115.125</td></tr>\n",
       "\t<tr><td>3019.250</td><td>3020.500</td><td>   1.250</td></tr>\n",
       "</tbody>\n",
       "</table>\n"
      ]
     },
     "execution_count": 20,
     "metadata": {
     },
     "output_type": "execute_result"
    }
   ],
   "source": [
    "head(simulated_data)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 21,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
    {
     "data": {
      "image/svg+xml": "<?xml version=\"1.0\" encoding=\"UTF-8\"?>\n<svg xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\" width=\"504pt\" height=\"504pt\" viewBox=\"0 0 504 504\" version=\"1.1\">\n<defs>\n<g>\n<symbol overflow=\"visible\" id=\"glyph0-0\">\n<path style=\"stroke:none;\" d=\"\"/>\n</symbol>\n<symbol overflow=\"visible\" id=\"glyph0-1\">\n<path style=\"stroke:none;\" d=\"M 4.71875 -3.171875 C 4.71875 -5.453125 3.984375 -6.59375 2.5625 -6.59375 C 1.140625 -6.59375 0.40625 -5.4375 0.40625 -3.21875 C 0.40625 -1 1.140625 0.140625 2.5625 0.140625 C 3.953125 0.140625 4.71875 -1 4.71875 -3.171875 Z M 3.875 -3.25 C 3.875 -1.375 3.453125 -0.546875 2.53125 -0.546875 C 1.671875 -0.546875 1.234375 -1.40625 1.234375 -3.21875 C 1.234375 -5.015625 1.671875 -5.859375 2.5625 -5.859375 C 3.4375 -5.859375 3.875 -5.015625 3.875 -3.25 Z M 3.875 -3.25 \"/>\n</symbol>\n<symbol overflow=\"visible\" id=\"glyph0-2\">\n<path style=\"stroke:none;\" d=\"M 3.21875 0 L 3.21875 -6.59375 L 2.6875 -6.59375 C 2.40625 -5.578125 2.21875 -5.4375 0.953125 -5.28125 L 0.953125 -4.6875 L 2.40625 -4.6875 L 2.40625 0 Z M 3.21875 0 \"/>\n</symbol>\n<symbol overflow=\"visible\" id=\"glyph0-3\">\n<path style=\"stroke:none;\" d=\"M 4.75 -4.65625 C 4.75 -5.765625 3.890625 -6.59375 2.640625 -6.59375 C 1.296875 -6.59375 0.515625 -5.90625 0.46875 -4.296875 L 1.28125 -4.296875 C 1.34375 -5.40625 1.796875 -5.875 2.609375 -5.875 C 3.359375 -5.875 3.90625 -5.34375 3.90625 -4.640625 C 3.90625 -4.125 3.609375 -3.671875 3.015625 -3.34375 L 2.171875 -2.859375 C 0.796875 -2.078125 0.390625 -1.453125 0.3125 0 L 4.703125 0 L 4.703125 -0.8125 L 1.234375 -0.8125 C 1.3125 -1.34375 1.625 -1.6875 2.421875 -2.171875 L 3.359375 -2.671875 C 4.28125 -3.15625 4.75 -3.84375 4.75 -4.65625 Z M 4.75 -4.65625 \"/>\n</symbol>\n<symbol overflow=\"visible\" id=\"glyph0-4\">\n<path style=\"stroke:none;\" d=\"M 2.640625 -2.234375 L 2.640625 -2.90625 L 0.421875 -2.90625 L 0.421875 -2.234375 Z M 2.640625 -2.234375 \"/>\n</symbol>\n<symbol overflow=\"visible\" id=\"glyph0-5\">\n<path style=\"stroke:none;\" d=\"M 4.765625 -2.1875 C 4.765625 -3.484375 3.90625 -4.34375 2.640625 -4.34375 C 2.171875 -4.34375 1.796875 -4.21875 1.421875 -3.9375 L 1.6875 -5.640625 L 4.421875 -5.640625 L 4.421875 -6.453125 L 1.015625 -6.453125 L 0.53125 -3 L 1.28125 -3 C 1.671875 -3.453125 1.984375 -3.609375 2.484375 -3.609375 C 3.375 -3.609375 3.9375 -3.046875 3.9375 -2.078125 C 3.9375 -1.125 3.390625 -0.578125 2.484375 -0.578125 C 1.78125 -0.578125 1.34375 -0.953125 1.140625 -1.6875 L 0.328125 -1.6875 C 0.59375 -0.375 1.34375 0.140625 2.515625 0.140625 C 3.84375 0.140625 4.765625 -0.796875 4.765625 -2.1875 Z M 4.765625 -2.1875 \"/>\n</symbol>\n<symbol overflow=\"visible\" id=\"glyph1-0\">\n<path style=\"stroke:none;\" d=\"\"/>\n</symbol>\n<symbol overflow=\"visible\" id=\"glyph1-1\">\n<path style=\"stroke:none;\" d=\"M 5.28125 -1.6875 C 5.28125 -2.59375 4.765625 -3.03125 3.578125 -3.328125 L 2.65625 -3.546875 C 1.875 -3.71875 1.546875 -3.984375 1.546875 -4.40625 C 1.546875 -4.953125 2.03125 -5.3125 2.8125 -5.3125 C 3.59375 -5.3125 4 -4.984375 4.03125 -4.34375 L 5.03125 -4.34375 C 5.03125 -5.53125 4.25 -6.203125 2.859375 -6.203125 C 1.453125 -6.203125 0.546875 -5.46875 0.546875 -4.359375 C 0.546875 -3.421875 1.03125 -2.96875 2.453125 -2.625 L 3.34375 -2.40625 C 4.015625 -2.25 4.28125 -2.046875 4.28125 -1.609375 C 4.28125 -1.046875 3.71875 -0.71875 2.875 -0.71875 C 2.015625 -0.71875 1.53125 -0.921875 1.40625 -1.84375 L 0.390625 -1.84375 C 0.4375 -0.453125 1.21875 0.171875 2.796875 0.171875 C 4.3125 0.171875 5.28125 -0.53125 5.28125 -1.6875 Z M 5.28125 -1.6875 \"/>\n</symbol>\n<symbol overflow=\"visible\" id=\"glyph1-2\">\n<path style=\"stroke:none;\" d=\"M 2.921875 0 L 2.921875 -0.8125 C 2.796875 -0.765625 2.640625 -0.765625 2.46875 -0.765625 C 2.046875 -0.765625 1.9375 -0.875 1.9375 -1.296875 L 1.9375 -5.25 L 2.921875 -5.25 L 2.921875 -6.03125 L 1.9375 -6.03125 L 1.9375 -7.6875 L 0.984375 -7.6875 L 0.984375 -6.03125 L 0.15625 -6.03125 L 0.15625 -5.25 L 0.984375 -5.25 L 0.984375 -0.875 C 0.984375 -0.265625 1.390625 0.078125 2.140625 0.078125 C 2.375 0.078125 2.59375 0.0625 2.921875 0 Z M 2.921875 0 \"/>\n</symbol>\n<symbol overflow=\"visible\" id=\"glyph1-3\">\n<path style=\"stroke:none;\" d=\"M 6.15625 -0.015625 L 6.15625 -0.75 C 6.046875 -0.71875 6 -0.71875 5.953125 -0.71875 C 5.609375 -0.71875 5.421875 -0.890625 5.421875 -1.203125 L 5.421875 -4.546875 C 5.421875 -5.625 4.640625 -6.203125 3.15625 -6.203125 C 1.703125 -6.203125 0.8125 -5.640625 0.75 -4.25 L 1.71875 -4.25 C 1.796875 -4.984375 2.234375 -5.3125 3.125 -5.3125 C 3.984375 -5.3125 4.46875 -4.984375 4.46875 -4.421875 L 4.46875 -4.15625 C 4.46875 -3.765625 4.234375 -3.59375 3.46875 -3.5 C 2.109375 -3.328125 1.90625 -3.28125 1.546875 -3.125 C 0.84375 -2.84375 0.484375 -2.296875 0.484375 -1.5625 C 0.484375 -0.46875 1.234375 0.171875 2.46875 0.171875 C 3.234375 0.171875 3.984375 -0.15625 4.515625 -0.71875 C 4.609375 -0.25 5.03125 0.078125 5.5 0.078125 C 5.6875 0.078125 5.84375 0.0625 6.15625 -0.015625 Z M 4.46875 -2.078125 C 4.46875 -1.21875 3.59375 -0.671875 2.671875 -0.671875 C 1.921875 -0.671875 1.484375 -0.9375 1.484375 -1.59375 C 1.484375 -2.21875 1.90625 -2.5 2.9375 -2.640625 C 3.9375 -2.78125 4.15625 -2.828125 4.46875 -2.984375 Z M 4.46875 -2.078125 \"/>\n</symbol>\n<symbol overflow=\"visible\" id=\"glyph1-4\">\n<path style=\"stroke:none;\" d=\"M 1.765625 0 L 1.765625 -6.03125 L 0.8125 -6.03125 L 0.8125 0 Z M 1.890625 -6.9375 L 1.890625 -8.125 L 0.6875 -8.125 L 0.6875 -6.9375 Z M 1.890625 -6.9375 \"/>\n</symbol>\n<symbol overflow=\"visible\" id=\"glyph1-5\">\n<path style=\"stroke:none;\" d=\"M 5.484375 -2.0625 L 4.515625 -2.0625 C 4.359375 -1.109375 3.859375 -0.71875 3.046875 -0.71875 C 1.984375 -0.71875 1.359375 -1.53125 1.359375 -2.953125 C 1.359375 -4.46875 1.984375 -5.3125 3.03125 -5.3125 C 3.828125 -5.3125 4.328125 -4.84375 4.453125 -4 L 5.421875 -4 C 5.296875 -5.46875 4.375 -6.203125 3.03125 -6.203125 C 1.421875 -6.203125 0.359375 -4.953125 0.359375 -2.953125 C 0.359375 -1.015625 1.390625 0.171875 3.03125 0.171875 C 4.46875 0.171875 5.375 -0.6875 5.484375 -2.0625 Z M 5.484375 -2.0625 \"/>\n</symbol>\n<symbol overflow=\"visible\" id=\"glyph2-0\">\n<path style=\"stroke:none;\" d=\"\"/>\n</symbol>\n<symbol overflow=\"visible\" id=\"glyph2-1\">\n<path style=\"stroke:none;\" d=\"M -2.0625 -5.484375 L -2.0625 -4.515625 C -1.109375 -4.359375 -0.71875 -3.859375 -0.71875 -3.046875 C -0.71875 -1.984375 -1.53125 -1.359375 -2.953125 -1.359375 C -4.46875 -1.359375 -5.3125 -1.984375 -5.3125 -3.03125 C -5.3125 -3.828125 -4.84375 -4.328125 -4 -4.453125 L -4 -5.421875 C -5.46875 -5.296875 -6.203125 -4.375 -6.203125 -3.03125 C -6.203125 -1.421875 -4.953125 -0.359375 -2.953125 -0.359375 C -1.015625 -0.359375 0.171875 -1.390625 0.171875 -3.03125 C 0.171875 -4.46875 -0.6875 -5.375 -2.0625 -5.484375 Z M -2.0625 -5.484375 \"/>\n</symbol>\n<symbol overflow=\"visible\" id=\"glyph2-2\">\n<path style=\"stroke:none;\" d=\"M -2.96875 -5.859375 C -5.046875 -5.859375 -6.203125 -4.859375 -6.203125 -3.125 C -6.203125 -1.4375 -5.03125 -0.40625 -3.015625 -0.40625 C -0.984375 -0.40625 0.171875 -1.421875 0.171875 -3.140625 C 0.171875 -4.828125 -0.984375 -5.859375 -2.96875 -5.859375 Z M -2.984375 -4.859375 C -1.5625 -4.859375 -0.71875 -4.203125 -0.71875 -3.140625 C -0.71875 -2.0625 -1.546875 -1.421875 -3.015625 -1.421875 C -4.46875 -1.421875 -5.3125 -2.0625 -5.3125 -3.140625 C -5.3125 -4.21875 -4.46875 -4.859375 -2.984375 -4.859375 Z M -2.984375 -4.859375 \"/>\n</symbol>\n<symbol overflow=\"visible\" id=\"glyph2-3\">\n<path style=\"stroke:none;\" d=\"M 0 -5.546875 L -6.03125 -5.546875 L -6.03125 -4.59375 L -2.703125 -4.59375 C -1.46875 -4.59375 -0.671875 -3.9375 -0.671875 -2.9375 C -0.671875 -2.1875 -1.125 -1.703125 -1.84375 -1.703125 L -6.03125 -1.703125 L -6.03125 -0.75 L -1.46875 -0.75 C -0.46875 -0.75 0.171875 -1.5 0.171875 -2.671875 C 0.171875 -3.546875 -0.140625 -4.109375 -0.9375 -4.6875 L 0 -4.6875 Z M 0 -5.546875 \"/>\n</symbol>\n<symbol overflow=\"visible\" id=\"glyph2-4\">\n<path style=\"stroke:none;\" d=\"M 0 -5.59375 L -4.546875 -5.59375 C -5.546875 -5.59375 -6.203125 -4.859375 -6.203125 -3.6875 C -6.203125 -2.796875 -5.859375 -2.21875 -5.015625 -1.6875 L -6.03125 -1.6875 L -6.03125 -0.8125 L 0 -0.8125 L 0 -1.765625 L -3.328125 -1.765625 C -4.546875 -1.765625 -5.359375 -2.421875 -5.359375 -3.40625 C -5.359375 -4.15625 -4.90625 -4.640625 -4.171875 -4.640625 L 0 -4.640625 Z M 0 -5.59375 \"/>\n</symbol>\n<symbol overflow=\"visible\" id=\"glyph2-5\">\n<path style=\"stroke:none;\" d=\"M 0 -2.921875 L -0.8125 -2.921875 C -0.765625 -2.796875 -0.765625 -2.640625 -0.765625 -2.46875 C -0.765625 -2.046875 -0.875 -1.9375 -1.296875 -1.9375 L -5.25 -1.9375 L -5.25 -2.921875 L -6.03125 -2.921875 L -6.03125 -1.9375 L -7.6875 -1.9375 L -7.6875 -0.984375 L -6.03125 -0.984375 L -6.03125 -0.15625 L -5.25 -0.15625 L -5.25 -0.984375 L -0.875 -0.984375 C -0.265625 -0.984375 0.078125 -1.390625 0.078125 -2.140625 C 0.078125 -2.375 0.0625 -2.59375 0 -2.921875 Z M 0 -2.921875 \"/>\n</symbol>\n</g>\n<clipPath id=\"clip1\">\n  <path d=\"M 40.046875 5.480469 L 499 5.480469 L 499 470 L 40.046875 470 Z M 40.046875 5.480469 \"/>\n</clipPath>\n<clipPath id=\"clip2\">\n  <path d=\"M 40.046875 373 L 499 373 L 499 375 L 40.046875 375 Z M 40.046875 373 \"/>\n</clipPath>\n<clipPath id=\"clip3\">\n  <path d=\"M 40.046875 224 L 499 224 L 499 226 L 40.046875 226 Z M 40.046875 224 \"/>\n</clipPath>\n<clipPath id=\"clip4\">\n  <path d=\"M 40.046875 75 L 499 75 L 499 77 L 40.046875 77 Z M 40.046875 75 \"/>\n</clipPath>\n<clipPath id=\"clip5\">\n  <path d=\"M 80 5.480469 L 82 5.480469 L 82 470.6875 L 80 470.6875 Z M 80 5.480469 \"/>\n</clipPath>\n<clipPath id=\"clip6\">\n  <path d=\"M 192 5.480469 L 194 5.480469 L 194 470.6875 L 192 470.6875 Z M 192 5.480469 \"/>\n</clipPath>\n<clipPath id=\"clip7\">\n  <path d=\"M 303 5.480469 L 305 5.480469 L 305 470.6875 L 303 470.6875 Z M 303 5.480469 \"/>\n</clipPath>\n<clipPath id=\"clip8\">\n  <path d=\"M 415 5.480469 L 417 5.480469 L 417 470.6875 L 415 470.6875 Z M 415 5.480469 \"/>\n</clipPath>\n<clipPath id=\"clip9\">\n  <path d=\"M 40.046875 447 L 499.519531 447 L 499.519531 450 L 40.046875 450 Z M 40.046875 447 \"/>\n</clipPath>\n<clipPath id=\"clip10\">\n  <path d=\"M 40.046875 298 L 499.519531 298 L 499.519531 301 L 40.046875 301 Z M 40.046875 298 \"/>\n</clipPath>\n<clipPath id=\"clip11\">\n  <path d=\"M 40.046875 149 L 499.519531 149 L 499.519531 152 L 40.046875 152 Z M 40.046875 149 \"/>\n</clipPath>\n<clipPath id=\"clip12\">\n  <path d=\"M 136 5.480469 L 138 5.480469 L 138 470.6875 L 136 470.6875 Z M 136 5.480469 \"/>\n</clipPath>\n<clipPath id=\"clip13\">\n  <path d=\"M 247 5.480469 L 250 5.480469 L 250 470.6875 L 247 470.6875 Z M 247 5.480469 \"/>\n</clipPath>\n<clipPath id=\"clip14\">\n  <path d=\"M 359 5.480469 L 361 5.480469 L 361 470.6875 L 359 470.6875 Z M 359 5.480469 \"/>\n</clipPath>\n<clipPath id=\"clip15\">\n  <path d=\"M 470 5.480469 L 473 5.480469 L 473 470.6875 L 470 470.6875 Z M 470 5.480469 \"/>\n</clipPath>\n</defs>\n<g id=\"surface67\">\n<rect x=\"0\" y=\"0\" width=\"504\" height=\"504\" style=\"fill:rgb(100%,100%,100%);fill-opacity:1;stroke:none;\"/>\n<rect x=\"0\" y=\"0\" width=\"504\" height=\"504\" style=\"fill:rgb(100%,100%,100%);fill-opacity:1;stroke:none;\"/>\n<path style=\"fill:none;stroke-width:1.422638;stroke-linecap:round;stroke-linejoin:round;stroke:rgb(100%,100%,100%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 0 0 L 504 0 L 504 504 L 0 504 Z M 0 0 \"/>\n<g clip-path=\"url(#clip1)\" clip-rule=\"nonzero\">\n<path style=\" stroke:none;fill-rule:nonzero;fill:rgb(92.156863%,92.156863%,92.156863%);fill-opacity:1;\" d=\"M 40.046875 5.480469 L 498.519531 5.480469 L 498.519531 469.6875 L 40.046875 469.6875 Z M 40.046875 5.480469 \"/>\n</g>\n<g clip-path=\"url(#clip2)\" clip-rule=\"nonzero\">\n<path style=\"fill:none;stroke-width:0.711319;stroke-linecap:butt;stroke-linejoin:round;stroke:rgb(100%,100%,100%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 40.046875 374.027344 L 498.519531 374.027344 \"/>\n</g>\n<g clip-path=\"url(#clip3)\" clip-rule=\"nonzero\">\n<path style=\"fill:none;stroke-width:0.711319;stroke-linecap:butt;stroke-linejoin:round;stroke:rgb(100%,100%,100%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 40.046875 224.90625 L 498.519531 224.90625 \"/>\n</g>\n<g clip-path=\"url(#clip4)\" clip-rule=\"nonzero\">\n<path style=\"fill:none;stroke-width:0.711319;stroke-linecap:butt;stroke-linejoin:round;stroke:rgb(100%,100%,100%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 40.046875 75.789062 L 498.519531 75.789062 \"/>\n</g>\n<g clip-path=\"url(#clip5)\" clip-rule=\"nonzero\">\n<path style=\"fill:none;stroke-width:0.711319;stroke-linecap:butt;stroke-linejoin:round;stroke:rgb(100%,100%,100%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 81.105469 469.6875 L 81.105469 5.480469 \"/>\n</g>\n<g clip-path=\"url(#clip6)\" clip-rule=\"nonzero\">\n<path style=\"fill:none;stroke-width:0.711319;stroke-linecap:butt;stroke-linejoin:round;stroke:rgb(100%,100%,100%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 192.664062 469.6875 L 192.664062 5.480469 \"/>\n</g>\n<g clip-path=\"url(#clip7)\" clip-rule=\"nonzero\">\n<path style=\"fill:none;stroke-width:0.711319;stroke-linecap:butt;stroke-linejoin:round;stroke:rgb(100%,100%,100%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 304.222656 469.6875 L 304.222656 5.480469 \"/>\n</g>\n<g clip-path=\"url(#clip8)\" clip-rule=\"nonzero\">\n<path style=\"fill:none;stroke-width:0.711319;stroke-linecap:butt;stroke-linejoin:round;stroke:rgb(100%,100%,100%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 415.78125 469.6875 L 415.78125 5.480469 \"/>\n</g>\n<g clip-path=\"url(#clip9)\" clip-rule=\"nonzero\">\n<path style=\"fill:none;stroke-width:1.422638;stroke-linecap:butt;stroke-linejoin:round;stroke:rgb(100%,100%,100%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 40.046875 448.585938 L 498.519531 448.585938 \"/>\n</g>\n<g clip-path=\"url(#clip10)\" clip-rule=\"nonzero\">\n<path style=\"fill:none;stroke-width:1.422638;stroke-linecap:butt;stroke-linejoin:round;stroke:rgb(100%,100%,100%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 40.046875 299.46875 L 498.519531 299.46875 \"/>\n</g>\n<g clip-path=\"url(#clip11)\" clip-rule=\"nonzero\">\n<path style=\"fill:none;stroke-width:1.422638;stroke-linecap:butt;stroke-linejoin:round;stroke:rgb(100%,100%,100%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 40.046875 150.347656 L 498.519531 150.347656 \"/>\n</g>\n<g clip-path=\"url(#clip12)\" clip-rule=\"nonzero\">\n<path style=\"fill:none;stroke-width:1.422638;stroke-linecap:butt;stroke-linejoin:round;stroke:rgb(100%,100%,100%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 136.886719 469.6875 L 136.886719 5.480469 \"/>\n</g>\n<g clip-path=\"url(#clip13)\" clip-rule=\"nonzero\">\n<path style=\"fill:none;stroke-width:1.422638;stroke-linecap:butt;stroke-linejoin:round;stroke:rgb(100%,100%,100%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 248.445312 469.6875 L 248.445312 5.480469 \"/>\n</g>\n<g clip-path=\"url(#clip14)\" clip-rule=\"nonzero\">\n<path style=\"fill:none;stroke-width:1.422638;stroke-linecap:butt;stroke-linejoin:round;stroke:rgb(100%,100%,100%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 360.003906 469.6875 L 360.003906 5.480469 \"/>\n</g>\n<g clip-path=\"url(#clip15)\" clip-rule=\"nonzero\">\n<path style=\"fill:none;stroke-width:1.422638;stroke-linecap:butt;stroke-linejoin:round;stroke:rgb(100%,100%,100%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 471.5625 469.6875 L 471.5625 5.480469 \"/>\n</g>\n<path style=\" stroke:none;fill-rule:nonzero;fill:rgb(34.901961%,34.901961%,34.901961%);fill-opacity:1;\" d=\"M 60.886719 447.09375 L 102.566406 447.09375 L 102.566406 448.585938 L 60.886719 448.585938 Z M 60.886719 447.09375 \"/>\n<path style=\" stroke:none;fill-rule:nonzero;fill:rgb(34.901961%,34.901961%,34.901961%);fill-opacity:1;\" d=\"M 102.566406 415.78125 L 144.246094 415.78125 L 144.246094 448.585938 L 102.566406 448.585938 Z M 102.566406 415.78125 \"/>\n<path style=\" stroke:none;fill-rule:nonzero;fill:rgb(34.901961%,34.901961%,34.901961%);fill-opacity:1;\" d=\"M 144.246094 297.976562 L 185.925781 297.976562 L 185.925781 448.585938 L 144.246094 448.585938 Z M 144.246094 297.976562 \"/>\n<path style=\" stroke:none;fill-rule:nonzero;fill:rgb(34.901961%,34.901961%,34.901961%);fill-opacity:1;\" d=\"M 185.925781 81.753906 L 227.605469 81.753906 L 227.605469 448.585938 L 185.925781 448.585938 Z M 185.925781 81.753906 \"/>\n<path style=\" stroke:none;fill-rule:nonzero;fill:rgb(34.901961%,34.901961%,34.901961%);fill-opacity:1;\" d=\"M 227.605469 26.578125 L 269.285156 26.578125 L 269.285156 448.585938 L 227.605469 448.585938 Z M 227.605469 26.578125 \"/>\n<path style=\" stroke:none;fill-rule:nonzero;fill:rgb(34.901961%,34.901961%,34.901961%);fill-opacity:1;\" d=\"M 269.285156 110.085938 L 310.964844 110.085938 L 310.964844 448.585938 L 269.285156 448.585938 Z M 269.285156 110.085938 \"/>\n<path style=\" stroke:none;fill-rule:nonzero;fill:rgb(34.901961%,34.901961%,34.901961%);fill-opacity:1;\" d=\"M 310.964844 308.414062 L 352.644531 308.414062 L 352.644531 448.585938 L 310.964844 448.585938 Z M 310.964844 308.414062 \"/>\n<path style=\" stroke:none;fill-rule:nonzero;fill:rgb(34.901961%,34.901961%,34.901961%);fill-opacity:1;\" d=\"M 352.640625 415.78125 L 394.320312 415.78125 L 394.320312 448.585938 L 352.640625 448.585938 Z M 352.640625 415.78125 \"/>\n<path style=\" stroke:none;fill-rule:nonzero;fill:rgb(34.901961%,34.901961%,34.901961%);fill-opacity:1;\" d=\"M 394.320312 444.113281 L 436 444.113281 L 436 448.585938 L 394.320312 448.585938 Z M 394.320312 444.113281 \"/>\n<path style=\" stroke:none;fill-rule:nonzero;fill:rgb(34.901961%,34.901961%,34.901961%);fill-opacity:1;\" d=\"M 436 447.09375 L 477.679688 447.09375 L 477.679688 448.585938 L 436 448.585938 Z M 436 447.09375 \"/>\n<g style=\"fill:rgb(30.196078%,30.196078%,30.196078%);fill-opacity:1;\">\n  <use xlink:href=\"#glyph0-1\" x=\"29.941406\" y=\"451.976562\"/>\n</g>\n<g style=\"fill:rgb(30.196078%,30.196078%,30.196078%);fill-opacity:1;\">\n  <use xlink:href=\"#glyph0-2\" x=\"19.601562\" y=\"302.855469\"/>\n  <use xlink:href=\"#glyph0-1\" x=\"24.772362\" y=\"302.855469\"/>\n  <use xlink:href=\"#glyph0-1\" x=\"29.943162\" y=\"302.855469\"/>\n</g>\n<g style=\"fill:rgb(30.196078%,30.196078%,30.196078%);fill-opacity:1;\">\n  <use xlink:href=\"#glyph0-3\" x=\"19.601562\" y=\"153.738281\"/>\n  <use xlink:href=\"#glyph0-1\" x=\"24.772362\" y=\"153.738281\"/>\n  <use xlink:href=\"#glyph0-1\" x=\"29.943162\" y=\"153.738281\"/>\n</g>\n<path style=\"fill:none;stroke-width:1.422638;stroke-linecap:butt;stroke-linejoin:round;stroke:rgb(20%,20%,20%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 37.308594 448.585938 L 40.046875 448.585938 \"/>\n<path style=\"fill:none;stroke-width:1.422638;stroke-linecap:butt;stroke-linejoin:round;stroke:rgb(20%,20%,20%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 37.308594 299.46875 L 40.046875 299.46875 \"/>\n<path style=\"fill:none;stroke-width:1.422638;stroke-linecap:butt;stroke-linejoin:round;stroke:rgb(20%,20%,20%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 37.308594 150.347656 L 40.046875 150.347656 \"/>\n<path style=\"fill:none;stroke-width:1.422638;stroke-linecap:butt;stroke-linejoin:round;stroke:rgb(20%,20%,20%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 136.886719 472.425781 L 136.886719 469.6875 \"/>\n<path style=\"fill:none;stroke-width:1.422638;stroke-linecap:butt;stroke-linejoin:round;stroke:rgb(20%,20%,20%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 248.445312 472.425781 L 248.445312 469.6875 \"/>\n<path style=\"fill:none;stroke-width:1.422638;stroke-linecap:butt;stroke-linejoin:round;stroke:rgb(20%,20%,20%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 360.003906 472.425781 L 360.003906 469.6875 \"/>\n<path style=\"fill:none;stroke-width:1.422638;stroke-linecap:butt;stroke-linejoin:round;stroke:rgb(20%,20%,20%);stroke-opacity:1;stroke-miterlimit:10;\" d=\"M 471.5625 472.425781 L 471.5625 469.6875 \"/>\n<g style=\"fill:rgb(30.196078%,30.196078%,30.196078%);fill-opacity:1;\">\n  <use xlink:href=\"#glyph0-4\" x=\"127.582031\" y=\"481.398438\"/>\n  <use xlink:href=\"#glyph0-3\" x=\"130.678931\" y=\"481.398438\"/>\n  <use xlink:href=\"#glyph0-5\" x=\"135.849731\" y=\"481.398438\"/>\n  <use xlink:href=\"#glyph0-1\" x=\"141.020531\" y=\"481.398438\"/>\n</g>\n<g style=\"fill:rgb(30.196078%,30.196078%,30.196078%);fill-opacity:1;\">\n  <use xlink:href=\"#glyph0-1\" x=\"245.859375\" y=\"481.398438\"/>\n</g>\n<g style=\"fill:rgb(30.196078%,30.196078%,30.196078%);fill-opacity:1;\">\n  <use xlink:href=\"#glyph0-3\" x=\"352.246094\" y=\"481.398438\"/>\n  <use xlink:href=\"#glyph0-5\" x=\"357.416894\" y=\"481.398438\"/>\n  <use xlink:href=\"#glyph0-1\" x=\"362.587694\" y=\"481.398438\"/>\n</g>\n<g style=\"fill:rgb(30.196078%,30.196078%,30.196078%);fill-opacity:1;\">\n  <use xlink:href=\"#glyph0-5\" x=\"463.804688\" y=\"481.398438\"/>\n  <use xlink:href=\"#glyph0-1\" x=\"468.975487\" y=\"481.398438\"/>\n  <use xlink:href=\"#glyph0-1\" x=\"474.146287\" y=\"481.398438\"/>\n</g>\n<g style=\"fill:rgb(0%,0%,0%);fill-opacity:1;\">\n  <use xlink:href=\"#glyph1-1\" x=\"250.113281\" y=\"495.519531\"/>\n  <use xlink:href=\"#glyph1-2\" x=\"255.863281\" y=\"495.519531\"/>\n  <use xlink:href=\"#glyph1-3\" x=\"259.060281\" y=\"495.519531\"/>\n  <use xlink:href=\"#glyph1-2\" x=\"265.454281\" y=\"495.519531\"/>\n  <use xlink:href=\"#glyph1-4\" x=\"268.651281\" y=\"495.519531\"/>\n  <use xlink:href=\"#glyph1-1\" x=\"271.204281\" y=\"495.519531\"/>\n  <use xlink:href=\"#glyph1-2\" x=\"276.954281\" y=\"495.519531\"/>\n  <use xlink:href=\"#glyph1-4\" x=\"280.151281\" y=\"495.519531\"/>\n  <use xlink:href=\"#glyph1-5\" x=\"282.704281\" y=\"495.519531\"/>\n</g>\n<g style=\"fill:rgb(0%,0%,0%);fill-opacity:1;\">\n  <use xlink:href=\"#glyph2-1\" x=\"13.863281\" y=\"251.648438\"/>\n  <use xlink:href=\"#glyph2-2\" x=\"13.863281\" y=\"245.898438\"/>\n  <use xlink:href=\"#glyph2-3\" x=\"13.863281\" y=\"239.504437\"/>\n  <use xlink:href=\"#glyph2-4\" x=\"13.863281\" y=\"233.110437\"/>\n  <use xlink:href=\"#glyph2-5\" x=\"13.863281\" y=\"226.716437\"/>\n</g>\n</g>\n</svg>\n"
     },
     "execution_count": 21,
     "metadata": {
      "image/svg+xml": {
       "isolated": true
      }
     },
     "output_type": "execute_result"
    }
   ],
   "source": [
    "ggplot(simulated_data, aes( x = statistic)) + geom_histogram( bins = 10 )"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 23,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "0.766"
      ]
     },
     "execution_count": 23,
     "metadata": {
     },
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# find percentile of observed stat:\n",
    "sum( simulated_data$statistic <= observed_diff ) / 1000\n",
    "\n",
    "# area to the left is 76.6th percentile"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 25,
   "metadata": {
    "collapsed": false
   },
   "outputs": [
    {
     "data": {
      "text/html": [
       "0.234"
      ]
     },
     "execution_count": 25,
     "metadata": {
     },
     "output_type": "execute_result"
    }
   ],
   "source": [
    "# p-value\n",
    "1-sum(simulated_data$statistic <= observed_diff) / 1000\n",
    "\n",
    "# area to the right is 23.4th percentile"
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "R (R-Project)",
   "language": "r",
   "name": "ir"
  },
  "language_info": {
   "codemirror_mode": "r",
   "file_extension": ".r",
   "mimetype": "text/x-r-source",
   "name": "R",
   "pygments_lexer": "r",
   "version": "3.4.4"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 0
}