This is the very first part of the big project of my BeCode training.
The real estate company "ImmoEliza" wants to create a machine learning model to make price predictions on real estate sales in Belgium.
We all starting from scratch and, of course, the first step is to collect data and build a dataset.
The purpose of this code is to retrieve precise information about houses and apartments for sale on Immoweb, and to register them on a single csv.
For this first group project, we were 3 :
Since it was the first group project, we decide to all work together, on one computer, and discuss everything we wanted to do and how to do it, instead of splitting the work between us. Like this, we learned differents methods to do the same thing, and choose the one's we agreed on.
Use a python library to collect as much datas as possible.
With this chalenge, we proved that we are :
- Able to scrape a website
- Able to build a dataset from scratch
- Able to implement a strategy to collect as much datas as possible.
- Deadline: 07/09/2020 17:00
- Duration: 3 days
- Situation: 3rd week of the BeCode's formation about AI and Machine learning
Updates
10/09/20 : A real README, at last.
All the work done was made during the BeCode's AI's formation 2020-2021.
18285 entries with 23 columns
You can see the data scrapped in the "data branch" or just here
COMPLETE