Práctica de Maraton - Ipynb - Colaboratory - Alejandro
Práctica de Maraton - Ipynb - Colaboratory - Alejandro
Práctica de Maraton - Ipynb - Colaboratory - Alejandro
ipynb - Colaboratory
Name Category
87 rows × 2 columns
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 87 entries, 0 to 86
Data columns (total 10 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 id 87 non-null int64
1 Marathon 87 non-null object
2 Name 87 non-null object
3 Category 81 non-null object
4 km4week 87 non-null float64
5 sp4week 87 non-null float64
6 CrossTraining 13 non-null object
7 Wall21 87 non-null object
8 MarathonTime 87 non-null float64
9 CATEGORY 87 non-null object
dtypes: float64(3), int64(1), object(6)
memory usage: 6.9+ KB
#descriptivo de datos
datos_maraton.describe()
https://colab.research.google.com/drive/1JldUTy5iVfGf7htgpg5ZK01IDAeZ0Hr1#scrollTo=JyMHeqAtt410&uniqifier=2&printMode=true 1/8
16/3/24, 22:43 Práctica de maraton.ipynb - Colaboratory
#grafica de histogramas
datos_maraton.hist()
#borrar columnaas
datos_maraton = datos_maraton.drop(columns=['Name'])
datos_maraton = datos_maraton.drop(columns=['id'])
datos_maraton = datos_maraton.drop(columns=['Marathon'])
datos_maraton = datos_maraton.drop(columns=['CATEGORY'])
Category 6
km4week 0
sp4week 0
CrossTraining 74
Wall21 0
MarathonTime 0
dtype: int64
#visualizar datos
datos_maraton
https://colab.research.google.com/drive/1JldUTy5iVfGf7htgpg5ZK01IDAeZ0Hr1#scrollTo=JyMHeqAtt410&uniqifier=2&printMode=true 2/8
16/3/24, 22:43 Práctica de maraton.ipynb - Colaboratory
87 rows × 6 columns
81 rows × 6 columns
datos_maraton['CrossTraining'] .unique()
datos_maraton.replace(valores_cross, inplace=True)
datos_maraton
https://colab.research.google.com/drive/1JldUTy5iVfGf7htgpg5ZK01IDAeZ0Hr1#scrollTo=JyMHeqAtt410&uniqifier=2&printMode=true 3/8
16/3/24, 22:43 Práctica de maraton.ipynb - Colaboratory
<ipython-input-29-1202fe270fef>:1: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead
81 rows × 6 columns
datos_maraton.replace(valores_category,inplace=True)
<ipython-input-31-543da5dd9897>:1: SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead
datos_maraton
81 rows × 6 columns
https://colab.research.google.com/drive/1JldUTy5iVfGf7htgpg5ZK01IDAeZ0Hr1#scrollTo=JyMHeqAtt410&uniqifier=2&printMode=true 4/8
16/3/24, 22:43 Práctica de maraton.ipynb - Colaboratory
datos_maraton = datos_maraton.query('sp4week<1000')
https://colab.research.google.com/drive/1JldUTy5iVfGf7htgpg5ZK01IDAeZ0Hr1#scrollTo=JyMHeqAtt410&uniqifier=2&printMode=true 5/8
16/3/24, 22:43 Práctica de maraton.ipynb - Colaboratory
64 rows × 6 columns
https://colab.research.google.com/drive/1JldUTy5iVfGf7htgpg5ZK01IDAeZ0Hr1#scrollTo=JyMHeqAtt410&uniqifier=2&printMode=true 6/8
16/3/24, 22:43 Práctica de maraton.ipynb - Colaboratory
etiquetas_entrenamiento
54 3.47
28 3.15
31 3.19
84 3.94
47 3.35
...
55 3.50
20 2.99
79 3.90
8 2.83
13 2.88
Name: MarathonTime, Length: 64, dtype: float64
etiquetas_test
9 2.86
12 2.88
21 3.04
26 3.12
38 3.24
39 3.25
41 3.28
46 3.33
48 3.36
49 3.39
62 3.56
68 3.65
69 3.67
71 3.69
75 3.78
83 3.93
Name: MarathonTime, dtype: float64
https://colab.research.google.com/drive/1JldUTy5iVfGf7htgpg5ZK01IDAeZ0Hr1#scrollTo=JyMHeqAtt410&uniqifier=2&printMode=true 7/8
16/3/24, 22:43 Práctica de maraton.ipynb - Colaboratory
#entrenar el modelo
from sklearn.linear_model import LinearRegression
modelo = LinearRegression()
modelo.fit(datos_entrenamiento, etiquetas_entrenamiento)
▾ LinearRegression
LinearRegression()
#comparacion de los valores de nuestros valores de testo con los datos de prediccion
import numpy as np
from sklearn.metrics import mean_squared_error
error = np.sqrt(mean_squared_error(etiquetas_test, predicciones))
print("Error porcentual: %f" % (error*100))
#nueva prediccion
nuevo_corredor = pd.DataFrame(np.array([[1,400,15,0,1.4]]),columns=['Category','km4week','sp4week','CrossTraining','Wall21'])
nuevo_corredor
array([2.35457459])
https://colab.research.google.com/drive/1JldUTy5iVfGf7htgpg5ZK01IDAeZ0Hr1#scrollTo=JyMHeqAtt410&uniqifier=2&printMode=true 8/8