top of page

Exploratory Data Analysis in Lake Lure, NC.

Jason Ismail

Updated: May 12, 2021

This Project featured quite a bit of cleanup. I was working with a Airbnb dataset for home rentals near Lake Lure NC. There was quite a bit of information that did not apply to what I was interested in for this project.


My Notebook can be found here:



Photo Courtesy of Romanticasheville.com


I started out with 74 features in my dataset. I looked over every column and made quite a few tough choices. The dataset had quite a few null values. Many of the columns were unrelated to my topic.




“I think it is important when possible for a Data Scientist to do their own EDA. That way they can build a deeper understanding of the dataset.”

For this project I used my skills as a Mathematician to help me decide which values were considered outliers.

Here I was calculating quartiles and finding the inner quartile range. The assumption is that if you go 1.5 times the Inner quartile range past the first and third quartiles respectively you can find a good spot to start considering whether or not you have outliers in the dataset. In this case I was only considering the upper bound for prices because low prices could mean a hidden deal.


There were some real hidden gems in this dataset.



I found some awesome reviews as well as descriptions of the properties from the owners.


So I cleaned the data up using some tricks I had learned while learning about Natural Language Processing.



Once the data was cleaned up I eventually used fuzzy matching to try and see which of the descriptions featured keywords that would indicate that the property was a waterfront property. After all typically the owners would brag about the lake if they were in fact situated on the lake. This helped me zero in on prospective homes to visit.



Comments


Commenting has been turned off.

DON'T MISS THE FUN.

Thanks for submitting!

Looking to Hire?

Connect with a Versatile Data Scientist

 

 


Are you in need of tailored data science solutions for your business? I'm here to help. With a Master's Degree in Data Science and a Bachelor's in Mathematics, I bring a blend of academic rigor and practical experience to the table.

Expertise in Building Comprehensive Data Solutions:

Proficient in developing end-to-end data science projects, including the collection, cleaning, and analysis of raw data.
Specialized in Python.


Technical Proficiencies:

Skilled in using Pandas, Yolo, NumPy, PyTorch and Keras/TensorFlow for creating sophisticated Deep Neural Networks.
Experienced in computer vision and leveraging Nvidia CUDA for high-performance computing tasks.


Personal Qualities:

Recognized by peers, mentors, and students as a dedicated and hardworking professional. I come with a long list of references.


Known for facing challenges head-on and being a supportive team player.
Skilled at making complex concepts accessible and relatable, with a passion for continuous learning.


Contact Information:

Jason Ismail
Masters in Data Science, Bachelors in Mathematics
LinkedIn Profile
Phone (Text Only): 719-322-8479

About Me

Data Science

Data Science isn't just my career; it's the realization of a lifelong passion where my love for mathematics, programming, and technology converge. Over the past 20 years, I've nurtured a deep fondness for computers, starting from building them to exploring their immense capabilities.

My academic path initially led me to programming and then chemistry, where I excelled nationally in the 98th percentile. This experience, however, led to an epiphany - it was the mathematical elements within chemistry that truly captivated me. This revelation steered me towards a scholarship in Mathematics and a subsequent career in teaching.

But the true calling came with Data Science. Here, I found an exhilarating opportunity to transform abstract mathematical theories into impactful, real-world applications. My focus now is on cutting-edge areas such as Artificial Intelligence, Neural Networks, Computer Vision, and Reinforcement Learning - fields where I can blend my analytical skills with creative problem-solving to innovate and advance the boundaries of technology.

Data Science for me is more than a profession; it's a canvas where I paint with numbers and algorithms, creating solutions that matter.

POST ARCHIVE

bottom of page