{
"cells": [
{
"cell_type": "code",
"execution_count": 3,
"metadata": {
"collapsed": false
},
"outputs": [
],
"source": [
"import pandas as pd"
]
},
{
"cell_type": "code",
"execution_count": 4,
"metadata": {
"collapsed": false
},
"outputs": [
{
"data": {
"text/html": [
"
\n",
"\n",
"
\n",
" \n",
" \n",
" | \n",
" ID | \n",
" Name | \n",
" Sex | \n",
" Age | \n",
" Height | \n",
" Weight | \n",
" Team | \n",
" NOC | \n",
" Games | \n",
" Year | \n",
" Season | \n",
" City | \n",
" Sport | \n",
" Event | \n",
" Medal | \n",
"
\n",
" \n",
" \n",
" \n",
" 0 | \n",
" 1 | \n",
" A Dijiang | \n",
" M | \n",
" 24.0 | \n",
" 180.0 | \n",
" 80.0 | \n",
" China | \n",
" CHN | \n",
" 1992 Summer | \n",
" 1992 | \n",
" Summer | \n",
" Barcelona | \n",
" Basketball | \n",
" Basketball Men's Basketball | \n",
" NaN | \n",
"
\n",
" \n",
" 1 | \n",
" 2 | \n",
" A Lamusi | \n",
" M | \n",
" 23.0 | \n",
" 170.0 | \n",
" 60.0 | \n",
" China | \n",
" CHN | \n",
" 2012 Summer | \n",
" 2012 | \n",
" Summer | \n",
" London | \n",
" Judo | \n",
" Judo Men's Extra-Lightweight | \n",
" NaN | \n",
"
\n",
" \n",
" 2 | \n",
" 3 | \n",
" Gunnar Nielsen Aaby | \n",
" M | \n",
" 24.0 | \n",
" NaN | \n",
" NaN | \n",
" Denmark | \n",
" DEN | \n",
" 1920 Summer | \n",
" 1920 | \n",
" Summer | \n",
" Antwerpen | \n",
" Football | \n",
" Football Men's Football | \n",
" NaN | \n",
"
\n",
" \n",
" 3 | \n",
" 4 | \n",
" Edgar Lindenau Aabye | \n",
" M | \n",
" 34.0 | \n",
" NaN | \n",
" NaN | \n",
" Denmark/Sweden | \n",
" DEN | \n",
" 1900 Summer | \n",
" 1900 | \n",
" Summer | \n",
" Paris | \n",
" Tug-Of-War | \n",
" Tug-Of-War Men's Tug-Of-War | \n",
" Gold | \n",
"
\n",
" \n",
" 4 | \n",
" 5 | \n",
" Christine Jacoba Aaftink | \n",
" F | \n",
" 21.0 | \n",
" 185.0 | \n",
" 82.0 | \n",
" Netherlands | \n",
" NED | \n",
" 1988 Winter | \n",
" 1988 | \n",
" Winter | \n",
" Calgary | \n",
" Speed Skating | \n",
" Speed Skating Women's 500 metres | \n",
" NaN | \n",
"
\n",
" \n",
"
\n",
"
"
]
},
"execution_count": 4,
"metadata": {
},
"output_type": "execute_result"
}
],
"source": [
"data = pd.read_csv('athlete_events.csv')\n",
"data.head()"
]
},
{
"cell_type": "markdown",
"metadata": {
"collapsed": false
},
"source": [
"__1. How old were the youngest male and female participants of the 1996 Olympics?__\n",
"\n",
"- 16 and 15\n",
"- 14 and 12 \n",
"- 16 and 12\n",
"- 13 and 11"
]
},
{
"cell_type": "code",
"execution_count": 5,
"metadata": {
"collapsed": false
},
"outputs": [
{
"data": {
"text/plain": [
"(14.0, 12.0)"
]
},
"execution_count": 5,
"metadata": {
},
"output_type": "execute_result"
}
],
"source": [
"data[(data['Sex'] == 'M') & (data['Games'] == '1996 Summer')]['Age'].min(), data[(data['Sex'] == 'F') & (data['Games'] == '1996 Summer')]['Age'].min()"
]
},
{
"cell_type": "markdown",
"metadata": {
"collapsed": false
},
"source": [
"__2. What was the percentage of male gymnasts among all the male participants of the 2000 Olympics? Consider only Gymnastics as a target sport. Round the answer to the first decimal.__\n",
"\n",
"Hint: here and further if needed drop duplicated sportsmen to count only unique ones.\n",
"\n",
"- 0.2\n",
"- 1.5\n",
"- 2.5\n",
"- 7.7"
]
},
{
"cell_type": "code",
"execution_count": 14,
"metadata": {
"collapsed": false
},
"outputs": [
{
"data": {
"text/plain": [
"1.5"
]
},
"execution_count": 14,
"metadata": {
},
"output_type": "execute_result"
}
],
"source": [
"round(float(data[(data['Sport']=='Gymnastics') & (data['Sex'] == 'M') & (data['Games'] == '2000 Summer')]['Name'].nunique())/\n",
" float(data[(data['Sex'] == 'M') & (data['Games'] == '2000 Summer')]['Name'].nunique())*100, 1)"
]
},
{
"cell_type": "markdown",
"metadata": {
"collapsed": false
},
"source": [
"\n",
"__3. What are the mean and standard deviation of height for female basketball players participated in the 2000 Olympics? Round the answer to the first decimal.__\n",
"\n",
"- 178.5 and 7.2\n",
"- 179.4 and 10\n",
"- 180.7 and 6.7\n",
"- 182.4 and 9.1"
]
},
{
"cell_type": "code",
"execution_count": 17,
"metadata": {
"collapsed": false
},
"outputs": [
{
"data": {
"text/plain": [
"count 142.000000\n",
"mean 182.387324\n",
"std 9.139462\n",
"min 162.000000\n",
"25% 175.000000\n",
"50% 182.000000\n",
"75% 190.000000\n",
"max 213.000000\n",
"Name: Height, dtype: float64"
]
},
"execution_count": 17,
"metadata": {
},
"output_type": "execute_result"
}
],
"source": [
"data[(data['Sex']=='F') & (data['Sport']=='Basketball') & (data['Games'] == '2000 Summer')]['Height'].describe()"
]
},
{
"cell_type": "markdown",
"metadata": {
"collapsed": false
},
"source": [
"__4. Find a sportsperson participated in the 2002 Olympics, with the highest weight among other participants of the same Olympics. What sport did he or she do?__\n",
"\n",
"- Judo\n",
"- Bobsleigh\n",
"- Weightlifting\n",
"- Boxing"
]
},
{
"cell_type": "code",
"execution_count": 18,
"metadata": {
"collapsed": false
},
"outputs": [
{
"data": {
"text/plain": [
"count 4062.000000\n",
"mean 71.197070\n",
"std 13.257178\n",
"min 42.000000\n",
"25% 61.000000\n",
"50% 69.000000\n",
"75% 80.000000\n",
"max 123.000000\n",
"Name: Weight, dtype: float64"
]
},
"execution_count": 18,
"metadata": {
},
"output_type": "execute_result"
}
],
"source": [
"data[data['Games'] == '2002 Winter']['Weight'].describe()"
]
},
{
"cell_type": "code",
"execution_count": 20,
"metadata": {
"collapsed": false
},
"outputs": [
{
"data": {
"text/html": [
"\n",
"\n",
"
\n",
" \n",
" \n",
" | \n",
" ID | \n",
" Name | \n",
" Sex | \n",
" Age | \n",
" Height | \n",
" Weight | \n",
" Team | \n",
" NOC | \n",
" Games | \n",
" Year | \n",
" Season | \n",
" City | \n",
" Sport | \n",
" Event | \n",
" Medal | \n",
"
\n",
" \n",
" \n",
" \n",
" 99154 | \n",
" 50171 | \n",
" Emmanuel Hostache | \n",
" M | \n",
" 26.0 | \n",
" 190.0 | \n",
" 123.0 | \n",
" France | \n",
" FRA | \n",
" 2002 Winter | \n",
" 2002 | \n",
" Winter | \n",
" Salt Lake City | \n",
" Bobsleigh | \n",
" Bobsleigh Men's Two | \n",
" NaN | \n",
"
\n",
" \n",
"
\n",
"
"
]
},
"execution_count": 20,
"metadata": {
},
"output_type": "execute_result"
}
],
"source": [
"data[(data['Games'] == '2002 Winter') & (data['Weight'] == 123)]"
]
},
{
"cell_type": "markdown",
"metadata": {
"collapsed": false
},
"source": [
"__5. How many times did Pawe Abratkiewicz participate in the Olympics held in different years?__\n",
"\n",
"- 0\n",
"- 1\n",
"- 2\n",
"- 3"
]
},
{
"cell_type": "code",
"execution_count": 23,
"metadata": {
"collapsed": false
},
"outputs": [
{
"data": {
"text/plain": [
"3"
]
},
"execution_count": 23,
"metadata": {
},
"output_type": "execute_result"
}
],
"source": [
"data[data['Name'] == 'Pawe Abratkiewicz']['Games'].nunique()"
]
},
{
"cell_type": "markdown",
"metadata": {
"collapsed": false
},
"source": [
"__6. How many silver medals in tennis did sportspeople from the Australia team win at the 2000 Olympics? Count every medal from every sportsperson.__\n",
"\n",
"- 0\n",
"- 1\n",
"- 2\n",
"- 3"
]
},
{
"cell_type": "code",
"execution_count": 33,
"metadata": {
"collapsed": false,
"scrolled": true
},
"outputs": [
{
"data": {
"text/html": [
"\n",
"\n",
"
\n",
" \n",
" \n",
" | \n",
" ID | \n",
" Name | \n",
" Sex | \n",
" Age | \n",
" Height | \n",
" Weight | \n",
" Team | \n",
" NOC | \n",
" Games | \n",
" Year | \n",
" Season | \n",
" City | \n",
" Sport | \n",
" Event | \n",
" Medal | \n",
"
\n",
" \n",
" \n",
" \n",
" 262820 | \n",
" 131504 | \n",
" Todd Andrew Woodbridge | \n",
" M | \n",
" 29.0 | \n",
" 178.0 | \n",
" 75.0 | \n",
" Australia | \n",
" AUS | \n",
" 2000 Summer | \n",
" 2000 | \n",
" Summer | \n",
" Sydney | \n",
" Tennis | \n",
" Tennis Men's Doubles | \n",
" Silver | \n",
"
\n",
" \n",
" 262831 | \n",
" 131511 | \n",
" Mark Raymond Woodforde | \n",
" M | \n",
" 34.0 | \n",
" 183.0 | \n",
" 80.0 | \n",
" Australia | \n",
" AUS | \n",
" 2000 Summer | \n",
" 2000 | \n",
" Summer | \n",
" Sydney | \n",
" Tennis | \n",
" Tennis Men's Doubles | \n",
" Silver | \n",
"
\n",
" \n",
"
\n",
"
"
]
},
"execution_count": 33,
"metadata": {
},
"output_type": "execute_result"
}
],
"source": [
"data[(data['Sport']=='Tennis') & (data['Games'] == '2000 Summer') & (data['Medal'] =='Silver') & (data['Team'] == 'Australia')]"
]
},
{
"cell_type": "markdown",
"metadata": {
"collapsed": false
},
"source": [
"__7. Is it true that Switzerland won fewer medals than Serbia at the 2016 Olympics? Do not consider NaN values in Medal column.__\n",
"\n",
"- Yes\n",
"- No"
]
},
{
"cell_type": "code",
"execution_count": 36,
"metadata": {
"collapsed": false,
"scrolled": true
},
"outputs": [
],
"source": [
"ds = data[(data['Team'].isin(['Serbia', 'Switzerland'])) & (data['Games'] =='2016 Summer') & (data['Medal'].isin(['Gold', 'Silver', 'Bronze']))]"
]
},
{
"cell_type": "code",
"execution_count": 39,
"metadata": {
"collapsed": false,
"scrolled": true
},
"outputs": [
{
"name": "stderr",
"output_type": "stream",
"text": [
"WARNING: Some output was deleted.\n"
]
}
],
"source": [
"ds"
]
},
{
"cell_type": "code",
"execution_count": 38,
"metadata": {
"collapsed": false
},
"outputs": [
{
"data": {
"text/html": [
"\n",
"\n",
"
\n",
" \n",
" \n",
" | \n",
" count | \n",
" unique | \n",
" top | \n",
" freq | \n",
"
\n",
" \n",
" Team | \n",
" | \n",
" | \n",
" | \n",
" | \n",
"
\n",
" \n",
" \n",
" \n",
" Serbia | \n",
" 54 | \n",
" 3 | \n",
" Silver | \n",
" 27 | \n",
"
\n",
" \n",
" Switzerland | \n",
" 11 | \n",
" 3 | \n",
" Gold | \n",
" 6 | \n",
"
\n",
" \n",
"
\n",
"
"
]
},
"execution_count": 38,
"metadata": {
},
"output_type": "execute_result"
}
],
"source": [
"ds.groupby('Team')['Medal'].describe()"
]
},
{
"cell_type": "markdown",
"metadata": {
"collapsed": false
},
"source": [
"__8. What age category did the fewest and the most participants of the 2014 Olympics belong to?__\n",
"\n",
"- [45-55] and [25-35) correspondingly\n",
"- [45-55] and [15-25) correspondingly\n",
"- [35-45] and [25-35) correspondingly\n",
"- [45-55] and [35-45) correspondingly"
]
},
{
"cell_type": "code",
"execution_count": 49,
"metadata": {
"collapsed": false,
"scrolled": true
},
"outputs": [
{
"data": {
"text/plain": [
"(5, 266)"
]
},
"execution_count": 49,
"metadata": {
},
"output_type": "execute_result"
}
],
"source": [
"data[(data['Year']==2014) & (data['Age']>=45) & (data['Age']<=55)]['Name'].nunique(), data[(data['Year']==2014) & (data['Age']>=35) & (data['Age']<=45)]['Name'].nunique()"
]
},
{
"cell_type": "code",
"execution_count": 50,
"metadata": {
"collapsed": false
},
"outputs": [
{
"data": {
"text/plain": [
"(1397, 1193, 150)"
]
},
"execution_count": 50,
"metadata": {
},
"output_type": "execute_result"
}
],
"source": [
"data[(data['Year']==2014) & (data['Age']>=25) & (data['Age']<35)]['Name'].nunique(), data[(data['Year']==2014) & (data['Age']>=15) & (data['Age']<25)]['Name'].nunique(), data[(data['Year']==2014) & (data['Age']>=35) & (data['Age']<45)]['Name'].nunique()"
]
},
{
"cell_type": "markdown",
"metadata": {
"collapsed": false
},
"source": [
"__9. Is it true that there were Summer Olympics held in Lake Placid? Is it true that there were Winter Olympics held in Sankt Moritz?__\n",
"\n",
"- Yes, Yes\n",
"- Yes, No\n",
"- No, Yes\n",
"- No, No"
]
},
{
"cell_type": "code",
"execution_count": 56,
"metadata": {
"collapsed": false
},
"outputs": [
{
"data": {
"text/html": [
"\n",
"\n",
"
\n",
" \n",
" \n",
" | \n",
" ID | \n",
" Age | \n",
" Height | \n",
" Weight | \n",
" Year | \n",
"
\n",
" \n",
" \n",
" \n",
" count | \n",
" 0.0 | \n",
" 0.0 | \n",
" 0.0 | \n",
" 0.0 | \n",
" 0.0 | \n",
"
\n",
" \n",
" mean | \n",
" NaN | \n",
" NaN | \n",
" NaN | \n",
" NaN | \n",
" NaN | \n",
"
\n",
" \n",
" std | \n",
" NaN | \n",
" NaN | \n",
" NaN | \n",
" NaN | \n",
" NaN | \n",
"
\n",
" \n",
" min | \n",
" NaN | \n",
" NaN | \n",
" NaN | \n",
" NaN | \n",
" NaN | \n",
"
\n",
" \n",
" 25% | \n",
" NaN | \n",
" NaN | \n",
" NaN | \n",
" NaN | \n",
" NaN | \n",
"
\n",
" \n",
" 50% | \n",
" NaN | \n",
" NaN | \n",
" NaN | \n",
" NaN | \n",
" NaN | \n",
"
\n",
" \n",
" 75% | \n",
" NaN | \n",
" NaN | \n",
" NaN | \n",
" NaN | \n",
" NaN | \n",
"
\n",
" \n",
" max | \n",
" NaN | \n",
" NaN | \n",
" NaN | \n",
" NaN | \n",
" NaN | \n",
"
\n",
" \n",
"
\n",
"
"
]
},
"execution_count": 56,
"metadata": {
},
"output_type": "execute_result"
}
],
"source": [
"data[(data['Season'] == 'Summer') & (data['City'].str.startswith('Lake Placid'))].describe()"
]
},
{
"cell_type": "code",
"execution_count": 57,
"metadata": {
"collapsed": false,
"scrolled": true
},
"outputs": [
{
"data": {
"text/html": [
"\n",
"\n",
"
\n",
" \n",
" \n",
" | \n",
" ID | \n",
" Age | \n",
" Height | \n",
" Weight | \n",
" Year | \n",
"
\n",
" \n",
" \n",
" \n",
" count | \n",
" 1657.000000 | \n",
" 1563.000000 | \n",
" 238.000000 | \n",
" 159.000000 | \n",
" 1657.000000 | \n",
"
\n",
" \n",
" mean | \n",
" 65513.748944 | \n",
" 26.346769 | \n",
" 173.155462 | \n",
" 69.471698 | \n",
" 1940.975256 | \n",
"
\n",
" \n",
" std | \n",
" 37235.901480 | \n",
" 5.827111 | \n",
" 7.499437 | \n",
" 8.586722 | \n",
" 9.550020 | \n",
"
\n",
" \n",
" min | \n",
" 529.000000 | \n",
" 15.000000 | \n",
" 155.000000 | \n",
" 45.000000 | \n",
" 1928.000000 | \n",
"
\n",
" \n",
" 25% | \n",
" 32961.000000 | \n",
" 22.000000 | \n",
" 168.000000 | \n",
" 65.000000 | \n",
" 1928.000000 | \n",
"
\n",
" \n",
" 50% | \n",
" 63990.000000 | \n",
" 26.000000 | \n",
" 173.000000 | \n",
" 71.000000 | \n",
" 1948.000000 | \n",
"
\n",
" \n",
" 75% | \n",
" 97263.000000 | \n",
" 29.000000 | \n",
" 178.000000 | \n",
" 74.000000 | \n",
" 1948.000000 | \n",
"
\n",
" \n",
" max | \n",
" 135466.000000 | \n",
" 54.000000 | \n",
" 211.000000 | \n",
" 103.000000 | \n",
" 1948.000000 | \n",
"
\n",
" \n",
"
\n",
"
"
]
},
"execution_count": 57,
"metadata": {
},
"output_type": "execute_result"
}
],
"source": [
"data[(data['Season'] == 'Winter') & (data['City'].str.startswith('Sankt Moritz'))].describe()"
]
},
{
"cell_type": "markdown",
"metadata": {
"collapsed": false
},
"source": [
"__10. What is the absolute difference between the number of unique sports at the 1996 Olympics and 2016 Olympics?__\n",
"\n",
"- 3\n",
"- 10\n",
"- 27\n",
"- 34"
]
},
{
"cell_type": "code",
"execution_count": 60,
"metadata": {
"collapsed": false
},
"outputs": [
{
"data": {
"text/plain": [
"(31, 34)"
]
},
"execution_count": 60,
"metadata": {
},
"output_type": "execute_result"
}
],
"source": [
"data[(data['Year'] == 1996)]['Sport'].nunique(), data[(data['Year'] == 2016)]['Sport'].nunique()"
]
},
{
"cell_type": "code",
"execution_count": 0,
"metadata": {
"collapsed": false
},
"outputs": [
],
"source": [
]
}
],
"metadata": {
"kernelspec": {
"display_name": "SageMath (stable)",
"language": "sagemath",
"name": "sagemath"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 2
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython2",
"version": "2.7.15"
}
},
"nbformat": 4,
"nbformat_minor": 0
}