Reply to topic  [ 16 posts ]  Go to page 1, 2  Next
(very) Basic stats help 
Author Message
Occasionally has a life
User avatar

Joined: Mon Apr 27, 2009 6:20 pm
Posts: 446
Location: ~/
Reply with quote
I trained as a biologist so my knowledge of statistics is minimal :)

I've just managed to get some image analysis for HCS running on our cluster. I'd like to prove that the results for my HPC technique are statistically similar to using their existing software, what form of statistics should I run on it? The data is in the form:
Code:
Well       HPC      current method
A01       12345    12345
A02       56789    56789

_________________
I was nickholway on the old boards.


Sat Feb 06, 2010 4:54 pm
Profile WWW
Spends far too much time on here
User avatar

Joined: Thu Apr 23, 2009 11:36 pm
Posts: 3527
Location: Portsmouth
Reply with quote
I don't think i fully understand the question - but I think you are basially asking how you can prove that the numbers in the column for HPC and those in the column for current method are very similar??

If so, then can't you just find out the difference between the two in every case? Then you can have an average difference, minimum and maximum difference, and even a spread with pretty graphs etc showing every value, giving an idea of the distribution of values.

Please note, I've never studied statistics in my life - but that's how I would do it!

:P

_________________
Image


Sat Feb 06, 2010 10:56 pm
Profile
I haven't seen my friends in so long
User avatar

Joined: Thu Apr 23, 2009 7:10 pm
Posts: 5490
Location: just behind you!
Reply with quote
paired student T-test could be the answer.

_________________
johnwbfc wrote:
I care not which way round it is as long as at some point some sort of semi-naked wrestling is involved.

Amnesia10 wrote:
Yes but the opportunity to legally kill someone with a giant dildo does not happen every day.

Finally joined Flickr


Sat Feb 06, 2010 11:15 pm
Profile
Spends far too much time on here
User avatar

Joined: Thu Apr 23, 2009 11:36 pm
Posts: 3527
Location: Portsmouth
Reply with quote
I was going to suggest that next. :lol: :P

_________________
Image


Sat Feb 06, 2010 11:34 pm
Profile
I haven't seen my friends in so long
User avatar

Joined: Fri Apr 24, 2009 6:37 am
Posts: 6954
Location: Peebo
Reply with quote
Excel can do t-tests if you load the Analysis toolpack.

_________________
When they put teeth in your mouth, they spoiled a perfectly good bum.
-Billy Connolly (to a heckler)


Sun Feb 07, 2010 9:10 am
Profile
I haven't seen my friends in so long
User avatar

Joined: Thu Jun 18, 2009 5:10 pm
Posts: 5836
Reply with quote
If it were me I'd try to prove both difference and similarity.

T-test proves difference
Correlation proves similarity

_________________
Jim

Image


Sun Feb 07, 2010 9:31 am
Profile
I haven't seen my friends in so long
User avatar

Joined: Thu Apr 23, 2009 7:10 pm
Posts: 5490
Location: just behind you!
Reply with quote
davrosG5 wrote:
Excel can do t-tests if you load the Analysis toolpack.

Always best to avoid Excel for stats :D Minitab at the least.


rustybucket wrote:
If it were me I'd try to prove both difference and similarity.

T-test proves difference
Correlation proves similarity

T-test proves if the difference is significant according the parameters set out. You can have sets of results that are perfectly correlated but are different and the OP wants to prove the results are not different.

but i could be wong :lol:

_________________
johnwbfc wrote:
I care not which way round it is as long as at some point some sort of semi-naked wrestling is involved.

Amnesia10 wrote:
Yes but the opportunity to legally kill someone with a giant dildo does not happen every day.

Finally joined Flickr


Sun Feb 07, 2010 10:56 am
Profile
Occasionally has a life
User avatar

Joined: Mon Apr 27, 2009 6:20 pm
Posts: 446
Location: ~/
Reply with quote
Thanks for the help.

Here's a scatterplot for my data:

Image

And the original file in csv format is here (22kb). My technique is image2_PixelIntensityMean and Average Outer Intensity is the existing technique.

The same tool which I'm doing the image analysis on has an inbuilt T test and it comes up with:

Code:
TreatmentMean
  11825.6049632696

ControlMean
  7357.68928203906

TreatmentSD
  3477.07048308103

ControlSD
  3157.57509178113

TreatmentSE
  177.438510067524

ControlSE
  161.134329153865

DifferenceMean
  4467.91568123058

DifferenceSD
  964.113200756248

DifferenceSE
  49.1996957527977

tStatistic
  90.8118559041397

pValue
  0


(For some reason it insists on renaming the fields treatment is my technique.)

Does the above table look sensible? I don't know how many decimal places the P value is to.

_________________
I was nickholway on the old boards.


Last edited by Coref on Sun Feb 07, 2010 11:50 am, edited 1 time in total.



Sun Feb 07, 2010 11:46 am
Profile WWW
Occasionally has a life
User avatar

Joined: Mon Apr 27, 2009 6:20 pm
Posts: 446
Location: ~/
Reply with quote
bobbdobbs wrote:
davrosG5 wrote:
Excel can do t-tests if you load the Analysis toolpack.

Always best to avoid Excel for stats


Telling biologists that their past few year's work maybe a pile of poo because they'd used Excel's own regression functions is a hobby of mine. :)

Quote:
:D Minitab at the least.


I'd go for R myself (if I knew enough stats).

_________________
I was nickholway on the old boards.


Sun Feb 07, 2010 11:48 am
Profile WWW
Occasionally has a life
User avatar

Joined: Mon Apr 27, 2009 6:20 pm
Posts: 446
Location: ~/
Reply with quote
Actually, from what I've read on the t test, it may not work as the data that I'm looking at are not normalised. I'm wondering if carrying out the same statistics as the scientist originally performed (in this case working out the efficacy of various drugs on cells) and then doing a T test on his results and my own.

_________________
I was nickholway on the old boards.


Sun Feb 07, 2010 12:47 pm
Profile WWW
I haven't seen my friends in so long
User avatar

Joined: Thu Apr 23, 2009 7:10 pm
Posts: 5490
Location: just behind you!
Reply with quote
Coref wrote:
Actually, from what I've read on the t test, it may not work as the data that I'm looking at are not normalised. I'm wondering if carrying out the same statistics as the scientist originally performed (in this case working out the efficacy of various drugs on cells) and then doing a T test on his results and my own.


mann whitney?

This is why I like the fact we have a dedicated statistician at work to go and ask :D

_________________
johnwbfc wrote:
I care not which way round it is as long as at some point some sort of semi-naked wrestling is involved.

Amnesia10 wrote:
Yes but the opportunity to legally kill someone with a giant dildo does not happen every day.

Finally joined Flickr


Sun Feb 07, 2010 1:49 pm
Profile
Doesn't have much of a life

Joined: Thu Apr 23, 2009 8:23 pm
Posts: 638
Location: 3959 miles from the centre of the Earth - give or take a bit
Reply with quote
First off, can you clarify what the two columns of figures represent - are they individual results or are the A/B/C blocks replicates?

Second, are you trying to prove that the PixelIntensity method gives comparable results to the AOT method?

_________________
i7 860 @ 3.5GHz, GTX275, 4GB DDR3


Sun Feb 07, 2010 3:26 pm
Profile
Occasionally has a life
User avatar

Joined: Mon Apr 27, 2009 6:20 pm
Posts: 446
Location: ~/
Reply with quote
dogbert10 wrote:
First off, can you clarify what the two columns of figures represent - are they individual results or are the A/B/C blocks replicates?


Each row in the file represents the same image, but with two different methods of analysis one in each column. The other column is actually a coordinate (rows are letters, columns are numbers eg B1 is 2nd from left in the top column).

Quote:
Second, are you trying to prove that the PixelIntensity method gives comparable results to the AOT method?


Exactly.

_________________
I was nickholway on the old boards.


Sun Feb 07, 2010 8:09 pm
Profile WWW
Doesn't have much of a life

Joined: Thu Apr 23, 2009 8:23 pm
Posts: 638
Location: 3959 miles from the centre of the Earth - give or take a bit
Reply with quote
Right.

So, summing all the results for each block (A, B, C etc), it is clear that the pixel method is consistently "over counting", which isn't surprising. However, if you plot the totals against each other, you find that there is an almost perfect linear relationship between the two methods. (AOI = 0.898xPixel - 78289) with an R-squared of 0.9913. For each individual data point, the equation is AOI = 0.8739xPixel - 2976.6 with an R squared of 0.9261.

So, while the two methods give quite different results, there is a very strong relationship between the two. So, if you use the pixel method and apply the above equation, you get reusults that are quite close to those from the AOI method.

Here's block A:

Pixel AOI calculated
14836.56298 10266.68164 9989.072388
13948.07763 9259.987305 9212.625041
6812.057559 1813.310059 2976.457101
8836.744484 4011.632568 4745.831005
5655.597539 1482.571533 1965.826689
15833.87909 10933.84375 10860.62694
5449.987808 1227.18811 1786.144345
15648.57683 10695.43164 10698.69129
6458.702376 2258.579102 2667.660006
15303.77094 10883.83203 10397.36542
9439.19292 4675.243164 5272.310693
13390.46867 10026.29004 8725.330571
11123.52801 5451.89209 6744.251128
14322.2248 9913.797852 9539.592253
12166.87358 6828.822754 7656.030822
15872.66113 11304.67285 10894.51856
13034.07752 8356.905273 8413.880345
13245.9075 9129.931641 8598.998564
13466.1725 8569.248047 8791.488148
13099.45097 8024.505859 8471.010203
15464.76456 9860.850586 10538.05775
12566.94026 8175.209473 8005.649093
15169.36717 10313.85254 10279.90997
6220.324126 6889.153809 2459.341254


Is that what you were after?

_________________
i7 860 @ 3.5GHz, GTX275, 4GB DDR3


Mon Feb 08, 2010 8:30 am
Profile
Occasionally has a life
User avatar

Joined: Mon Apr 27, 2009 6:20 pm
Posts: 446
Location: ~/
Reply with quote
Thanks for that. The correlation is pretty good. I had a quick look at the outliers on my graph. Most of them are caused by some form of artefact in the field. The thresholding picks the artefact up rather than the cells. I need to look at the original technique to see how they get around that.

I'm still trying to get hold of a copy of the original data so I can compare IC50s between techniques.

_________________
I was nickholway on the old boards.


Tue Feb 09, 2010 8:32 pm
Profile WWW
Display posts from previous:  Sort by  
Reply to topic   [ 16 posts ]  Go to page 1, 2  Next

Who is online

Users browsing this forum: No registered users and 33 guests


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum

Search for:
Jump to:  
Powered by phpBB® Forum Software © phpBB Group
Designed by ST Software.