Ames house prices
Ames ia The Ames Housing dataset was compiled by Dean De Cock for use in data science.
With 81 predictors...
2017
11/11
 
  Partecipanti 72 Sottomissioni 1892  
 

The Ames Housing dataset was compiled by Dean De Cock for use in data science.

With 81 predictors describing (almost) every aspect of residential homes in Ames, Iowa, this competition challenges you to predict the final price of each home.

See DeCook “Ames Housing dataset” description in Data Sets.
See Kaggle competition “House Prices: Advanced Regression Techniques”

Submissions are evaluated on Root-Mean-Squared-Error (RMSE) between the logarithm of the predicted value and the logarithm of the observed sales price. (Taking logs means that errors in predicting expensive houses and cheap houses will affect the result equally.)

RMSE = sqrt( mean( ( log(y) – log(haty) )^2 ) )

During the competition, the leaderboard displays your partial score, which is the RMSE for 735 (random) houses of the test set.
At the end of the contest, the leaderboard will display the final score, which is the RMSE for the remaining 735 houses of the test set. The final score will determine the final winner. This method prevents users from overfitting to the leaderboard.

Maximum team size = 3

train <- read.csv(“train.csv”, stringsAsFactors=F)
test <- read.csv(“test.csv”, , stringsAsFactors=F)

fit = lm(log(SalePrice) ~ Yr.Sold + Mo.Sold + Bedroom.AbvGr + Lot.Area, data=train)
yhat = exp( predict(fit, newdata=test) )

write.table(file=“mySubmission.txt”, yhat, row.names = FALSE, col.names = FALSE)

See datadocumentation.txt – full description of the data, prepared by Dean De Cock

The training data has 82 columns which include 23 nominal, 23 ordinal, 14 discrete, and 20 continuous variables (and 2 additional observation identifiers). Here’s a brief version of what you’ll find in the data description file.

SalePrice – the property’s sale price in dollars. This is the target variable that you’re trying to predict.

Order: Observation number
PID: Parcel identification number
MS.SubClass: The building class
MS.Zoning: The general zoning classification
Lot.Frontage: Linear feet of street connected to property
Lot.Area: Lot size in square feet
Street: Type of road access
Alley: Type of alley access
Lot.Shape: General shape of property
Land.Contour: Flatness of the property
Utilities: Type of utilities available
Lot.Config: Lot configuration
Land.Slope: Slope of property
Neighborhood: Physical locations within Ames city limits
Condition.1: Proximity to main road or railroad
Condition.2: Proximity to main road or railroad (if a second is present)
Bldg.Type: Type of dwelling
House.Style: Style of dwelling
Overall.Qual: Overall material and finish quality
Overall.Cond: Overall condition rating
Year.Built: Original construction date
Year.Remod.Add: Remodel date
Roof.Style: Type of roof
Roof.Matl: Roof material
Exterior.1st: Exterior covering on house
Exterior.2nd: Exterior covering on house (if more than one material)
Mas.Vnr.Type: Masonry veneer type
Mas.Vnr.Area: Masonry veneer area in square feet
Exter.Qual: Exterior material quality
Exter.Cond: Present condition of the material on the exterior
Foundation: Type of foundation
Bsmt.Qual: Height of the basement
Bsmt.Cond: General condition of the basement
Bsmt.Exposure: Walkout or garden level basement walls
Bsmt.Fin.Type.1: Quality of basement finished area
Bsmt.Fin.SF.1: Type 1 finished square feet
Bsmt.Fin.Type.2: Quality of second finished area (if present)
Bsmt.Fin.SF.2: Type 2 finished square feet
Bsmt.Unf.SF: Unfinished square feet of basement area
Total.Bsmt.SF: Total square feet of basement area
Heating: Type of heating
Heating.QC: Heating quality and condition
Central.Air: Central air conditioning
Electrical: Electrical system
1st.Flr.SF: First Floor square feet
2nd.Flr.SF: Second floor square feet
Low.Qual.Fin.SF: Low quality finished square feet (all floors)
Gr.Liv.Area: Above grade (ground) living area square feet
Bsmt.Full.Bath: Basement full bathrooms
Bsmt.Half.Bath: Basement half bathrooms
Full.Bath: Full bathrooms above grade
Half.Bath: Half baths above grade
Bedroom.AbvGr: Number of bedrooms above basement level
Kitchen.AbvGr: Number of kitchens
Kitchen.Qual: Kitchen quality
Tot.Rms.AbvGrd: Total rooms above grade (does not include bathrooms)
Functional: Home functionality rating
Fireplaces: Number of fireplaces
Fireplace.Qu: Fireplace quality
Garage.Type: Garage location
Garage.Yr.Blt: Year garage was built
Garage.Finish: Interior finish of the garage
Garage.Cars: Size of garage in car capacity
Garage.Area: Size of garage in square feet
Garage.Qual: Garage quality
Garage.Cond: Garage condition
Paved.Drive: Paved driveway
Wood.Deck.SF: Wood deck area in square feet
Open.Porch.SF: Open porch area in square feet
Enclosed.Porch: Enclosed porch area in square feet
3Ssn.Porch: Three season porch area in square feet
Screen.Porch: Screen porch area in square feet
Pool.Area: Pool area in square feet
Pool.QC: Pool quality
Fence: Fence quality
Misc.Feature: Miscellaneous feature not covered in other categories
Misc.Val: $Value of miscellaneous feature
Mo.Sold: Month Sold
Yr.Sold: Year Sold
Sale.Type: Type of sale
Sale.Condition: Condition of sale




Training set train.csv
600 KB
Test set test.csv
600 KB
data documentation datadocumentation.txt
20 KB
Ames Housing dataset decock.pdf
400 KB
Per partecipare bisogna prima autenticarsi
# Nome Punteggio Prove Ultima prova
1 t.comoglio FINALE 12.17% 35 08.11.2017
13:27
2 m.ressico FINALE 12.17% 14 08.11.2017
13:32
3 m.trabucchi1 FINALE 12.46% 6 06.11.2017
18:41
4 nicolo-p FINALE 12.46% 6 06.11.2017
07:08
5 d.lacaj FINALE 12.46% 4 06.11.2017
08:10
6 davide.stenner FINALE 12.51% 31 03.11.2017
20:26
7 f.devecchi5 FINALE 12.51% 5 04.11.2017
08:10
8 fumagalliroberta94 FINALE 12.51% 1 03.11.2017
21:02
9 petrunistorica FINALE 12.53% 1 10.11.2017
12:32
10 thelaw92iphone FINALE 12.68% 19 10.11.2017
11:19
11 m.rosa18 FINALE 12.74% 74 14.11.2016
16:03
12 e.zucca6 FINALE 12.81% 14 08.11.2017
17:58
13 a.pascali FINALE 12.81% 1 08.11.2017
21:49
14 enricocartella FINALE 13.02% 57 31.10.2017
20:59
15 beatrice.santoro06 FINALE 13.02% 3 02.11.2017
09:28
16 a.valsecchi20 FINALE 13.02% 1 02.11.2017
20:52
17 m.mercandelli5 FINALE 13.54% 20 08.11.2017
10:04
18 g.ronco1 FINALE 13.54% 7 06.11.2017
15:08
19 gpolp FINALE 13.56% 50 14.11.2016
20:33
20 f.facciuto FINALE 13.56% 28 14.11.2016
23:19
21 e.fabrizi1 FINALE 13.56% 25 15.11.2016
08:13
22 g.tornaghi1 FINALE 13.56% 12 08.11.2017
07:55
23 e.pasin FINALE 13.56% 1 08.11.2017
17:08
24 sonia_cucchi FINALE 13.56% 1 06.11.2017
20:33
25 l.granata1 FINALE 13.59% 16 08.11.2017
16:56
26 m.cerliani FINALE 13.59% 10 08.11.2017
16:52
27 AVON VALENTINO FINALE 13.69% 2 05.11.2016
18:05
28 gramaticamarco FINALE 14.01% 156 15.11.2016
16:25
29 r.lavelli1 FINALE 14.01% 121 15.11.2016
16:10
30 d.gualtieri5 FINALE 14.01% 56 15.11.2016
16:12
31 s.dalessio FINALE 14.04% 70 15.11.2016
12:52
32 d.caldara1 FINALE 14.04% 7 14.11.2016
16:23
33 f.cordaro2 FINALE 14.06% 7 07.11.2017
08:42
34 f.roberti FINALE 14.06% 1 08.11.2017
20:03
35 g.maino2 FINALE 14.06% 1 09.11.2017
09:24
36 riccardo.parviero FINALE 14.07% 43 15.11.2016
15:52
37 d.marzagora FINALE 14.07% 23 15.11.2016
15:58
38 elisa.lucci FINALE 14.07% 22 15.11.2016
16:01
39 D.Pagani FINALE 14.09% 175 15.11.2016
16:38
40 f.giorgini FINALE 14.09% 45 15.11.2016
16:42
41 A.Pizzocri FINALE 14.09% 6 15.11.2016
16:43
42 e.gabanelli FINALE 14.11% 11 08.11.2017
09:33
43 fasolini.a50 FINALE 14.15% 8 08.11.2017
17:23
44 g.asti FINALE 14.52% 25 08.11.2017
23:18
45 s.offredi2 FINALE 14.52% 2 08.11.2017
23:21
46 m.fogliata FINALE 14.59% 7 15.11.2016
13:30
47 chiara.gorla FINALE 14.98% 61 15.11.2016
09:42
48 f.pirola13 FINALE 14.98% 5 15.11.2016
11:46
49 a.galli33 FINALE 14.98% 2 15.11.2016
15:30
50 ruud.gullit FINALE 15.07% 22 06.11.2017
10:55
51 v.angius1 FINALE 15.22% 215 14.11.2016
08:36
52 n.vanderhart FINALE 15.22% 40 14.11.2016
08:42
53 a.guevara FINALE 16.12% 16 14.11.2016
20:34
54 g.andromede FINALE 16.61% 35 15.11.2016
13:30
55 l.sella FINALE 16.61% 35 15.11.2016
13:17
56 a.brambilla73 FINALE 16.61% 9 15.11.2016
12:23
57 m.fumagalli68 FINALE 17.04% 53 14.11.2016
15:58
58 m.monaldi FINALE 17.04% 35 15.11.2016
10:41
59 m.antoniazzi1 FINALE 17.95% 15 09.11.2017
10:30
60 g.caccia3 FINALE 17.95% 2 09.11.2017
09:30
61 f.vitti1 FINALE 20.21% 36 15.11.2016
17:57
62 r.monetti1 FINALE 20.21% 28 15.11.2016
15:42
63 c.galimberti19 FINALE 28.75% 5 10.11.2016
13:20
64 solari.aldo FINALE 39.70% 36 01.11.2017
13:46
65 h.akyirefi FINALE 39.70% 8 13.11.2016
17:25