Scoring Overview

Purpose

The Image2Biomass competition uses math to check how good our predictions are. You try to guess how much plant matter is in a pasture image. The organizers use a scoring formula to see how close the guesses are to the real values. This score is called a weighted R².

Weighted R²

This is the main math formula used to score our predictions. Don’t worry if it looks intimidating. We’ll break it down step by step:

Rw2 = 1 j wj (yj y^j )2 j wj ( yj y¯w )2
R_w² = 1 - ( Σ w_j (y_j - ŷ_j)² ) / ( Σ w_j (y_j - ȳ_w)² )

In simple terms:

The score tells us what fraction of the total error was explained by our predictions. A perfect score is 1.0.

Weighted Mean

This is how we calculate the weighted average of all the true biomass values:

y¯w = jwjyj jwj
ȳ_w = ( Σ w_j y_j ) / ( Σ w_j )

This gives more influence to rows that are considered more important (have a higher weight).

Residual Sum of Squares

SSres = j wj (yj y^j )2
SS_res = Σ w_j (y_j - ŷ_j)²

This adds up all the squared differences between our predictions and the truth, multiplied by how important each one is. The smaller this value, the better.

Total Sum of Squares

SStot = j wj ( yj y¯w )2
SS_tot = Σ w_j (y_j - ȳ_w)²

This measures how spread out the true values are, again weighted by importance. It sets the baseline for comparison.

Component Weights

The scoring system gives more weight to some components. This pie chart shows how each one contributes to the final score:

Component Weights Pie Chart

7. What Do These Biomass Values Look Like?

This bar chart shows the average amount of each biomass type in the training data. It helps you understand what is “typical.”

Average Biomass Bar Chart

8. Fit vs. Mean?

This chart compares our model’s R² score to a dumb baseline that just predicts the mean. It shows that our model explains a lot more of the variation in the data:

R2 Improvement Chart