B It works but why? (Matching experimental data to a random equation)

AI Thread Summary
The discussion centers on the analysis of potential electric car deliveries, utilizing a formula that approximates daily trip counts based on available trips, a lowest bound, and an overlap factor. The formula, which includes a square root, yields results that closely match experimental data, prompting questions about its validity and reasoning. Participants suggest that the accuracy of predictions is paramount, and it's acceptable to state "because it works" if questioned about the formula's origin. To further validate the square root's appropriateness, plotting data on a log-log scale is recommended, as it can reveal the relationship between variables. Overall, the focus is on ensuring the formula's effectiveness for predicting profits in the electric delivery model.
adriandwor
Messages
1
Reaction score
0
TL;DR Summary
A random equation matches experimental values, but why?
Hey guys,

I've about a week left to submit my final paper for my trade degree in transportation.

The paper is about an analysis of potential implementation of an electric car for direct deliveries in my area where I live.
In part of it, I try to analyze how many possible trips a car like that could have a day and from that deduce potential yearly earnings.
I have about 5000 data points of trips in the area that cover 1 year span.

Data looks something like this:

dvHUnEs.png


Experimental are trips that have been arranged manually according to the data. Calculated is where I used a formula:

Sqrt ((Available trips - Lowest_bound) +Overlap_Factor) + Lowest_bound

It seems to come very close to the actual number of trips an electric car can have on that day but I don't understand why it comes so close and I don't really understand why I used this formula. Maybe someone can explain if it's just luck or there is some reason for using a square root. I'll probably be asked why this formula at the presentation.

Available trips - Total of all available trips on that day in this area.
Lowest bound - Least amount of possible trips taken (I calculated it by having a total distance of driving all available chronologically trips one by one, the pure driving time by amount of trips and add 6 minutes for each trip for loading/unloading time. 6 minutes is the average load/unload for 5000 trips last year. It gives 39 minutes. Time a car is available on that day, divided by 39 minutes gives a lowest_bound number of trips.

Overlap factor - I used excel to count the number of overlapping trips on a particular day with the idea that more trips can be taken as coloads if they happen to be available for pickup within 30 minutes of each other and 15 min driving distance from one another. In that case trip counts as 1 extra in the same time as the other.

I figured some of the overlapping trips + some of the trips not taken in lowest_bound will also be driven. I used a square root for the calculation but I can't explain why other than I've seen an equation some time ago that uses a sqrt.

It seems to me like it works.

Can anyone explain why? I'm a bit worried I'll be asked at my presentation why I chose to do that.

I appreciate your time.
 
Mathematics news on Phys.org
Can I suggest that you attach a full scale screen grab, so we can read the chart?
You don't have to embed it in the post, there's an "Attach files" option (I think it's in the lower right corner of your editor).
 
For a limited set of data you can always find an equation that works but will usually be a mystery.

At Cornell, some researchers built a software tool called Eureqa that discovered the equations of motion for a compound pendulum and really impressed the scientific community. The equations made sense physics wise as we knew how to derive them.

Later the same software was used to find the hidden equations behind some biology data and it did which made the biologists quite happy. However they couldn’t publish their results because the couldn’t explain the equation, it just worked.
 
jedishrfu said:
However they couldn’t publish their results because the couldn’t explain the equation, it just worked.
Funny, when Cauchy did that he didn't just get published, he got an equation named after him (again)! o0)

It seems to me like your goal is to calculate profits. In that case, isn't the accuracy of your predictions the most important thing? If someone asks you "why did you use this formula", what's wrong with saying "because it works"? You can track sum of residuals squared (see wikipedia) to quantify how good your formula is.

If you really want to prove that a square root is the right functional form, then I suggest you plot your experimental data vs. the "available trips" on a log-log scale. If the slope of this data is nearly 1/2 (particularly for large values of "available trips"), then the square root was good call. This is most useful when you have a large spread in the values of "available trips". This is the standard trick when you're trying to fit a power law of the form ##y = Ax^B##.
 
Seemingly by some mathematical coincidence, a hexagon of sides 2,2,7,7, 11, and 11 can be inscribed in a circle of radius 7. The other day I saw a math problem on line, which they said came from a Polish Olympiad, where you compute the length x of the 3rd side which is the same as the radius, so that the sides of length 2,x, and 11 are inscribed on the arc of a semi-circle. The law of cosines applied twice gives the answer for x of exactly 7, but the arithmetic is so complex that the...
Thread 'Unit Circle Double Angle Derivations'
Here I made a terrible mistake of assuming this to be an equilateral triangle and set 2sinx=1 => x=pi/6. Although this did derive the double angle formulas it also led into a terrible mess trying to find all the combinations of sides. I must have been tired and just assumed 6x=180 and 2sinx=1. By that time, I was so mindset that I nearly scolded a person for even saying 90-x. I wonder if this is a case of biased observation that seeks to dis credit me like Jesus of Nazareth since in reality...
Fermat's Last Theorem has long been one of the most famous mathematical problems, and is now one of the most famous theorems. It simply states that the equation $$ a^n+b^n=c^n $$ has no solutions with positive integers if ##n>2.## It was named after Pierre de Fermat (1607-1665). The problem itself stems from the book Arithmetica by Diophantus of Alexandria. It gained popularity because Fermat noted in his copy "Cubum autem in duos cubos, aut quadratoquadratum in duos quadratoquadratos, et...
Back
Top