Analysis of Red Bull Racing and Mercedes Benz AMG F1 Racing Dynasties

Upneet Bir

Introduction and Motivation

Formula 1 is the world's highest class of racing. The FIA Formula One World Championship has been one of the premier forms of racing around the world since its inaugural season in 1950. The word "formula" in the name refers to the set of rules to which all participants cars must conform. A Formula One season consists of a series of races, known as Grands Prix, which take place worldwide on purpose-built circuits and on public roads. Taken from kaggle.com Our objective in this tutorial is to take a look at two of the most recent dominating teams. Red Bull Racing Honda and Mercedes Benz AMG F1. From 2010 - 2013, Red Bull won four straight championships as a team. Mercedes Benz is currently on a 7 year win streak of championships For our analysis we will look at Red Bull from 2010 - 2013 and Mercedes Benz from 2016 - 2019

How can we compare these teams success if they raced in two different generations?

Within Formula 1, there are specific categories that have stayed the same.

The major categories we will look at are:

  1. Winning Percentage
  2. DNF (Did Not Finish) Percentage
  3. Qualifying Exploration
  4. Starting at Pole Analysis

My hypothesis is that Mercedes Benz will be the better team in most of these categories, especially Winning Percentage. This is solely because I have watched F1 for the past 2 years so all I know is Mercedes Benz winning the championships.

Data Collection and Description

Our first order of business is to gather our data. Here is the data set on kaggle.com

Red Bull Data Filtering

What we are going to do here is get the data of Red Bull from 2010 - 2013. This is going to help because we can filter data later using this table. For example later I will be using this time frame to calculate the first and last race which will help me filter data.

Mercedes Benz Data Filtering 2016 - 2019

This step is to get the race data of MB from 2016 - 2019 just like we did with Red Bull

So now we have the races between 2010 - 2013 for Red Bull Racing and can start our analysis

Red Bull Data Exploration Step 1

So now we have our first plot. This plot shows us how frequently Red Bull finished in one position. Clearly we can see that out of the 154 races between 2010 - 2013, the Red Bull Racing team finished on the top of the podium over 40 times. Along with that, they finsihed on the podium (1st, 2nd, or 3rd) over 80 times. This accounts for more than half of their finishes So this preliminary graph shows us that Red Bull pulled in a majority of the points to be gotten over these four years.

The most suprising fact was that out of 154 races, a Red Bull driver was on the podium for more than half of the races

So we can see there is null data ( \N ) but that will not affect us in our analysis, since we are not going to be able to compare car times across generations since each year there are different versions of cars. Since the cars we are comparing do not overlap, we can not compare their lap times because the engineering of each car is not the same

Mercedes Benz Exploration Step 1

Step 1 Results and Analysis

So now, comparing the amazing success of these teams we see that Mercedes Benz over these four years had more success than Red Bull. There was over 15% more podium percentage and 10 percent more winning percentage

So far, we see that Mercedes has the upper hand when it comes to winning races and finishing in the top 3 in general. What this data also shows us is that Mercedes Benz was that even though their data set was 12 races bigger (something that we can not adjust because it wouldn't be fair to Red Bull), Mercedez Benz still had the higher ratio of wins. In racing, more races = more wear on car engines and body parts = less productive performance. Based on our preliminary exploration, we see the Mercedes Benz didn't lose performance even when they had atleast 2280 miles of wear more on their car. (The minimum race distance in F1 is 190 miles)

To recap on Step 1, one alternative to our data wrangling apprach would be to manually go through and select the records we want

While on the topic of wear and tear on the car, lets see how consistent these cars were. We will define a DNF as Did Not Finish.

Step 2 DNF Consistency

Lets begin the process of digging deeper and looking into how consistent each team was.

The first way to measure how consistent they were is to look at how good their engineering department was, basically how many times did they fail to finish a race and score 0 points?

The DNF shows is this data. In our table it is any statusId that is not 1. 1 means the racer finished with no problem. Any number other than that there was a problem such as a crash, engineering failure, pitstop failure, and finished too low.

What can we take away from this step?

Before I started this step I predicted that there would not be too much of a difference in the DNF data because its possible a Red Bull or Mercedes fell out of the points but still finished the race and it wasn't reflected in our data.

After looking at the statistical data in this step, I see that I was wrong. The difference between these two teams was over 5% or 7 races which means that over a higher amount of races, Mercedes Benz was more consistent than Red Bull was even though Red Bull had a slightly smaller data sample. (166 races for MB vs 154 races for RB)

What we can try to do is see how efficient they were compared to other teams at the time.

The next most important thing to look at is qualifying data. How good was the car at starting position? This gives us a sense of the ability of the car without traffic and without wear and tear. So peak performance

This will be step 3: Qualifying Exploration

Step 3 Qualifying Exploration

Using this data, lets make a frequency plot to see how often they qualified in each position I see there is some Null Data. Lets assume that that means the car didn't qualify and give it a position 23 to make the data more readible

Step 3 Analysis

From our Step 3 work we don't get any data that will help lead us to a conlcusion about the qualifying performances of these cars. Here we do see that MB started from pole more often and 2nd and 3rd more often than Red Bull did. But still there is no conclusive advantage to either team. We can resonably conlcude that both teams were just as equal in their qualifying periods

We need to take a closer look into this data. How did these teams fair when starting from pole? Which team was able to convert their pole positions into Wins and which team "blew it" more often?

Data Exploration Step 4 Starting at Pole Exploration

Lets take a look at how successful each team was at converting the best starting position on the grid into a win. And if they didn't win, how far down the order did they drop?

Step 4 Analysis

So looking at the plots, we can see that while both performed very well from pole generally, Red Bull lost more than 5 positions 6 times from pole. Mercedes, on the other hand, only lost more than 5 positions from pole 4 times.

Even towards the top of the table (within 0 to -5 places lost) we see a much tighter clustering of points. The way we can test this is by getting the average of positions lost.

My hypothesis looking at the scatterplots is that Mercedes Benz is much better at getting and keeping the lead

Lets go further into the data and find the chances that

Red Bull start and finish 1st Mercedes start and finish 1st Complete a Hypothesis Test

Hypothesis: A driver for Mercedes Benz F1 was starting on pole was more likely to finish and win the race

Null Hypothesis: A driver for MB is not more likely to win a race when starting on pole than RB

What we see from this data is that Mercedes is much more efficient at starting and finishing from pole position. An advantage of over more than 10% compared to Red Bull. Also that a MB driver is more likely to win from pole than RB.

Some more points to take away from this calculation was that Mercedes Benz did infact have less places lost overall, by about 0.7 places.

Now while doing that I decided that looking at the spread within (0 to -5) should be done by looking at all values outside of -5 as outliers. So looking at the output provided there I can conclude that there is no difference in the average places lost within 0 to -5. Both lost 1.0 places on average.

The whole idea of this analysis was to see which car performed better at staying ahead. Qualifying allowed both teams to fully show their teams engineering potential when there is no traffic on track and perfect conditions for the car to succeed. The race day data challenges the teams to work on their stategical powers and stay ahead. So our data shows us that Mercedes Benz stategists were much better overall at keeping their leads in the race and the drivers were more efficient at navigating through traffic and extending their lead in open air (open air allows for better aerodynamic performance for cars = faster lap times)

Conclusion and Understanding

Now that we have explored all this data, what conclusions can we draw?

To summarize the major categories we looked at were:

  1. Winning Percentage
  2. DNF (Did Not Finish) Percentage
  3. Qualifying Exploration
  4. Starting at Pole Analysis

Why did I take this approach?

The reason I took this approach was too take a high to low look at the data. So I wanted to start by looking at the winning percentage of the team which is the first thing that comes to mind. After that I wanted to move onto consistency of the teams, then looking at the deeper reasons for success which come before the day of the race. So seeing how these teams qualified onto the grid and then lastly seeing which team did the best with the opportunity.

Results of this Analysis

This data set did not have data associated to the results of constructors standings in 2010 and forward. So after some simple googling I found this data:

2013:

1st Place - Red Bull 596

2nd Place - Mercedes Benz 360

Difference: 236

2019:

1st Place - Mercedes Benz 739

2nd Place - Ferrari 504

Difference: 235

I can see that both Mercedes Benz and Red Bull won the championships by the same margin (off by 1 pt).

Mercedes edged out Red Bull in all of these categories. From 2016 - 2019, Mercedes displayed a dominance that was unparalleled in this age. With a podium percentage of over 69% and a winning percentage of over 34%, Mercedes showed that its amazing engineering and large budget paid off with 4 straight constructor titles

What should we walk away with?

Formula 1 teams are consistently trying to improve and inovate on pervious success and failure. The whole point of this analysis and tutorial was to show the recipe of success for these teams. Both teams succeeded not because they won many races (which they did) but because of their consistentcy. The whole point of racing is to be an unpredictable sport. The driver errors, crashes, and new rule regulations are supposed to make F1 unpredictable. For example in the 2020 F1 season there was a 3 way tie for the 3rd place in constructors title championship, Ferrari F1 plummeted down the standings from 2nd in 2019 to 6th in 2020. To summarize, F1 and racing was created to be a world of unpredictabliliy and excitement. Mercedes Benz and Red Bull in their winning streaks were able to reverse that and create their dynasties. They created a sense of normal in the racing community that led to Sebatian Vettel's 9 GP wins in a row and memes about how every race podium would be the same because Mercedes Benz were so dominant.

Resources:

Red Bull Racing

Mercedes Benz AMG Racing

Forumla 1 Standings 2019

Formula 1 Standings 2020

What are some further things to look into?

The world of F1 is one that is dominated by Money. Teams that have a higher budget are always at the top of the table. For example in 2019, Mercedes had a budget of 484 million dollars and won the Construtors title One thing I could look into, if in a bigger group rather than myself, would be to see how efficiently teams spend money. Just like our Moneyball project. There was also no table that contained the budgets for F1 teams