This aim of this project is to clean and format a dataset containing information about shark attacks. For afterwords, analyze the data to produce visualizations that will either prove or disprove hypotheses.
- Numpy
- Pandas
- Seaborn
- Matplotlib.pyplot
- Counter
- Re
1. Are sharks getting tired of attacking, or are humans getting more annoying?
2. If they're not getting tired, why is that?
3.Are men more attaked than women or the contrary?
- Exploration of the dataset & stating hypothesis.
- Cleaning of the dataset.
- Transforming the dataset.
- Create visuals to prove or dispruve the hypothesis.
- Explanations of the foundings & conclusions.
1. Are sharks getting tired of attacking, or are humans getting more annoying?
As we can see in this plot, the shark attacks have increased in the last century. We can atribute that to different hypothesis.
- is that we have more resources to navigate
- change of weather , although we don't have the data to demonstrate that
- change on the human life conditions
- maybe also change in the sense of peril
Let's see the data in another graphic so that we can analize it deeper.
In this graphic we see only the number of attacks from the 1800 to the actual moment.
To be able to give response on which of our 4 hypothesis is true, we still need to analize deeper.
Let's understand why the sharks are not only not getting, tired, but getting fiercer.
2. If they're not getting tired, why is that?
Shall we take a look at the activities people were doing when the attackes occurred?
The answer is obviously, YES, WE SHALL!!
From the data we get from the graphic, we can assume that the great mass of the attacks come from lifestyle.
Let's get more specific and see only data on the activities from the last 50 years.
In perspective, we can see a decrease on the maybe considered more "survival" colum, fishing. And an increase of the "surf" column. But what also caught my attention is the shark interacting accidents has neither decreases or increases, so I want to check if that is because they are mostly concentrated recently or they're equally divided throughout the years.
We can see that most of the shark interacting attacks are on increase in recent years. So maybe the hypothesis that the sense of peril has changed might also be true. Also meaning that humans are getting more annoying.
But is it all humans or just a fraction?
3.Are men more attaked than women or the contrary?
Let's prove the obvious answer, that we're not going to say."
First let's check how the attacks over the years get distributed against the gender of the victims.
Is it always more or less the same pattern or is it diverse?
As we can see from the plot, men have always been the winners of getting attacked by sharks.
Now let's also check if we can extract more information from the age and the gender.
Is it just a fase, or are they like that?
We can see from this box plot that between men and women, women number of attacks tend to be more stable along the age, the min and the max are more distant and the 50% of the data is also wider, meaning the data is more stable. Whilst men tend to be attacked by sharks on a more concentrated range of age.
Maybe that has something to do with the activities that men and women are doing when the attacked happens.
The men normally do more adrenaline activities, so maybe they also have the peril sense lower and provoke more atacks? Let's see how many of the attacks where provoked between man and women.
We see that most of the provoked attacks are by men. Let's also check shark interacting, because os sense of peril, to compare men and women participation, to end our hypothesis.
At least 75% of the shark interactinc¡g attacks are by men. Let's see how many of them are provoked.
Seing that a great number of attacks are provoked, and all the other data we analized: We can definitely say that sharks are not getting tired, they're just getting angrier as we humans change our lifestyle, and become more annoying, by getting more stupid, mostly men.










