Notifications
Clear all

New To This - Help With Understanding Chi-Square and Residuals

28 Posts
14 Users
0 Reactions
5 Views
(@steward-souten)
Posts: 32
Registered
Topic starter
 

Hi Community

Completely new to surveying and have virtually no experience in surveying, or network adjustments so my apologies for asking what may seem like simple or silly questions.

I am currently using StarNet to adjust a network of 3 stations, 1 is known & fixed, and the bearing is known and fixed.

I can run the adjustment however the Chi-Square Test failed on the upper bound. See attached of my output.

The fact that it failed is not the point as the data has no value other then to try and teach myself how to adjust a network. I am trying to understand what a Chi-square does.

From my research I get the following definitions:-

Definition 1 -
Chi-Square test is a statistical test that compares the actual measured observations against adjusted observations in terms of how widely they vary with respect to each other.

Definition 2 - The Chi-Square distribution is used to test the sample variance to see if it is in agreement with the population variance.

I think both definitions are saying the same thing but worded differently. Can someone further explain what a Chi-Square is in terms of surveying?

What is bothering me is that during my research the terms sample and population is mentioned over and over again. This bothers me because if you take my example of 3 stations, 1 is known and is fixed to a known bearing, there is no sample just a population. What I mean by that is I have observed angles and distances on all 3 stations which to me means that my data relates to the population - Does anyone agree? Can someone explain what sample and population is in terms of surveying?

The term residual also keeps cropping up along with Error, from my research the terms can be defined as:-

Error -
The error of an observed value is the deviation of the observed value from the (unobservable) true value of a quantity of interest (for example, a population mean).

Residual - The residual of an observed value is the difference between the observed value and the estimated value of the quantity of interest (for example, a sample mean).

Can someone explain what residuals are in terms of surveying?

I would really appreciate it If anyone can shed any light on this as i am struggling with these concepts.

Many thanks

S.Souten

(I have attached a picture of my simple 3 station traverse and output from StarNet showing that Chi-square has failed)

Attached files

 
Posted : May 11, 2017 1:26 pm
(@lee-d)
Posts: 2382
Registered
 

At the risk of getting crushed by more knowledgeable folks on this forum...

In very simple terms, the Chi square test is a ratio of the amount of adjustment applied to a network (the a posteriori error) to the initial estimates that are given for each measurement (the a priori error). What it tells you is if your initial assumptions about your data are valid. Things that contribute to the initial error estimates include the accuracy of the instrument used, estimates of the possible centering error of the instrument and target, and in the case of a GPS baseline the error estimates provided by the baseline processor.

Residuals are the difference between an observed value and an adjusted value, or most probable value. As I understand it, and again to put it in very simple terms, let's assume we measure a distance ten times with a 100' steel chain of approximately 450'. Due to various factors, we are going to come up with variations in our ten distances. Let's assume that our shortest distance is 449.94 and our largest is 450.03. Least squares will test each value in that range and come up with a most probable value; the MPV will be the value that has the smallest residuals. If I remember correctly, the formula for this is the square root of the sum of the squares of the residuals divided by the number of observations minus one - hence the name Least Squares.

Feel free to batter me at your leisure fellow posters...

 
Posted : May 11, 2017 1:53 pm
(@lee-d)
Posts: 2382
Registered
 

BTW, to truly understand least squares you need to have at least a working knowledge of calculus and especially linear algebra. My son started school as a Geomatics major and figured out not too far into it that a dual major in mathematics was going to be of great benefit.

 
Posted : May 11, 2017 1:57 pm
(@kent-mcmillan)
Posts: 11419
 

Steward Souten, post: 427881, member: 12714 wrote:
Can someone explain what residuals are in terms of surveying?

Sure. Survey measurements generally are not error-free quantities. They are best thought of as estimates of the true value of some quantity such as an angle, distance, or height difference. While the sort of estimates that a skilled surveyor can produce are for some practical purposes what many people would consider to be exact determinations of what was measured, they never are truly so perfect and always contain errors of some magnitude.

For example, a distance of nominally 300 ft. measured with a tape may be in error by 0.03 ft. or more and the same distance measured with a good quality EDM may be in error by only 0.003 ft.

The presence of angular errors may be demonstrated by a number of methods, the easiest one to describe being the classical method of surveying closed polygons where the geometric test of comparing the sum of the interior angles of the figure to the exact theoretical value gives a means of detecting some types of errors in the work.

The typical method of adjustment involves making small corrections to measurements so that the geometric tests are met - interior angles sum to (n-2) x 180å¡, for example, where n is the number of sides. Those small corrections added to measured quantities are known as residuals. While the residual is typically treated as being a correction that is added to a measurement , it's worth noting that in some survey literature, the residuals are conceived of as being the errors contained in the measurements and are to be subtracted from the measurements. Star*Net deals with residuals as small corrections to be added to observations (measurements).

 
Posted : May 11, 2017 2:35 pm
(@kent-mcmillan)
Posts: 11419
 

In good quality survey measurements, errors should tend to be random (as likely to be + as -) and to cluster around a mean value of 0.

The random errors commonly found in survey measurements are typically treated as having a pattern such that the likelihood of a random error of a particular magnitude drops off as the error gets larger. In mathematical terms, the errors in survey measurements are commonly treated as having a mound-shaped Gaussian distribution. that is described by a characteristic value known as the standard error.

 
Posted : May 11, 2017 2:40 pm
(@kent-mcmillan)
Posts: 11419
 

The standard error is a value characteristic of a measuring process, such as measuring angles with a particular instrument to targets centered over ground marks, that the errors in approximately 68% of a sufficiently large series of measurements would not be expected to exceed.

In order to compare the magnitues of residuals in examining the outputs of an adjustment such as Star*Net performs, the residuals are standardized by dividing them by the standard errors that were expected of the process that produced the measurement.

 
Posted : May 11, 2017 2:46 pm
(@warren-smith)
Posts: 830
Registered
 

Nice primer, guys!

 
Posted : May 11, 2017 2:55 pm
(@bill93)
Posts: 9834
 

I pretty much agree with everything that's been said, but have the urge to talk about it, too.

In your example, you gave the program some estimates of the standard errors. Those need to be realistic estimates of what your instruments and procedures produce, such that 68% of the time (one standard deviation) your results are within the range of what you gave the program.

The program found that after doing its best adjustment, your overall residuals (the amount that your measurements didn't fit the best estimate it made of what you were measuring) was 1.517 times the standard error (averaged as rms over all types of measurements).

Of course, it also breaks it down so you see that the angle measurements fit pretty close to as good as you told it (1.127), the distance measurements fit much better than you told it (0.131), and the zenith values were not nearly as good as you told it (2.292). So the zenith measurements were the culprit.

The chi-sq test failed because, with the amount of redundancy you have in your collection of measurements, it was reasonable for the residuals to come out no more than 1.210 times your standard errors, but not 1.517 times.

 
Posted : May 11, 2017 3:21 pm
(@mathteacher)
Posts: 2081
Registered
 

One question not addressed was the difference between a sample and the population. If a distance is measured 20 times, then we have a sample of 20 measurements drawn from the population made up of every single measurement of that distance that could be made. Our sample size is 20, but the population is infinite.

 
Posted : May 12, 2017 4:26 am
(@kent-mcmillan)
Posts: 11419
 

MathTeacher, post: 427972, member: 7674 wrote: One question not addressed was the difference between a sample and the population. If a distance is measured 20 times, then we have a sample of 20 measurements drawn from the population made up of every single measurement of that distance that could be made. Our sample size is 20, but the population is infinite.

There are two different approaches that may be used to estimate the standard errors of survey measurements. One is based upon prior evaluation by testing and the other treats the standard errors as unknown quantities to be worked out from analysis of the residuals of the adjustment.

The former method is efficient in that the standard errors used to weight measurements are derived from a sufficiently large sample size that they are well characterized, with small uncertainties in the values of the standard errors. The second method tends to be very inefficient where the standard errors are to be worked out from relatively few redundant measurements, which is equivalent to estimating the standard error of some measuring process from a small number of test measurements.

Some processes, such as angle and distance measurements, taking into account target and instrument centering, are much more easily characterized by testing beforehand. The manufacturers even specify standard errors that their equipment should not perform worse than under normal conditions.

 
Posted : May 12, 2017 5:06 am
(@kevin-samuel)
Posts: 1043
 

The Chi-Square test is a "goodness of fit" test in lay terms. It tells you how well your weighting strategy fits with your actual data.

The chi-square test can be failed by exceeding the bounds on either the upper or lower bounds.

Exceeding the lower bounds may be serious. This may indicate that your weighting strategy is too conservative or it could fail because of a small data set with low redundancy.

Exceeding the upper limit is ALWAYS serious. This may indicate that your weighting strategy is too optimistic, there me be blunders in the data, or there may be uncorrected systematic errors in the data set.

 
Posted : May 12, 2017 5:32 am
(@jim-in-az)
Posts: 3361
Registered
 

Lee D, post: 427890, member: 7971 wrote: BTW, to truly understand least squares you need to have at least a working knowledge of calculus and especially linear algebra. My son started school as a Geomatics major and figured out not too far into it that a dual major in mathematics was going to be of great benefit.

I disagree. A good grasp of Statistics is whats important.

 
Posted : May 12, 2017 5:41 am
(@lee-d)
Posts: 2382
Registered
 

Jim in AZ, post: 427991, member: 249 wrote: I disagree. A good grasp of Statistics is whats important.

Statistics to interpret the results. Linear algebra to perform the adjustment.

 
Posted : May 12, 2017 5:58 am
(@jim-in-az)
Posts: 3361
Registered
 

Lee D, post: 427997, member: 7971 wrote: Statistics to interpret the results. Linear algebra to perform the adjustment.

I'll agree with that, but does anyone actually do this longhand?

 
Posted : May 12, 2017 6:41 am
(@mightymoe)
Posts: 9920
Registered
 

Jim in AZ, post: 428020, member: 249 wrote: I'll agree with that, but does anyone actually do this longhand?

I know surveyors who longhand calculated compass rule adjustments, dmd areas, average end area volumes, geodetic coordinates, solar observations, and on and on (heck I used to do all those, some of them almost everyday),,,,,,,,,
but I've never heard of anyone who sits down and does least squares,,,,,,,,,
It's a black box operation.

 
Posted : May 12, 2017 6:52 am
(@mathteacher)
Posts: 2081
Registered
 

Back to the original questions. Getting good statistical results from the given network design is likely to be difficult and might result from pure chance.

Perhaps adding a point or two would help.

 
Posted : May 12, 2017 6:59 am
(@lee-d)
Posts: 2382
Registered
 

Jim in AZ, post: 428020, member: 249 wrote: I'll agree with that, but does anyone actually do this longhand?

As a practical matter, no. I'm under the influence of my son who just completed two semesters of Measurement Science. For his final he had to devise an LSA for an example traverse and write a program to execute it.

 
Posted : May 12, 2017 7:04 am
(@bill93)
Posts: 9834
 

MathTeacher, post: 428023, member: 7674 wrote: Getting good statistical results from the given network design is likely to be difficult

Why is that? I would think a good combination of angle and distance measurements would work fine.

 
Posted : May 12, 2017 7:10 am
(@jim-in-az)
Posts: 3361
Registered
 

Bill93, post: 428025, member: 87 wrote: Why is that? I would think a good combination of angle and distance measurements would work fine.

That's an awfully linear geometrical figure...

 
Posted : May 12, 2017 7:22 am
(@bill93)
Posts: 9834
 

Jim in AZ, post: 428029, member: 249 wrote: That's an awfully linear geometrical figure

You're probably thinking of angle-only intersections to measure or stake a point, which would have a lot of leverage against you making the point very uncertain in distance from your setup points. But that's no problem if you are measuring distances too. Run an example and look at the ellipses.

 
Posted : May 12, 2017 7:33 am
Page 1 / 2