{"id":1808,"date":"2011-08-25T14:43:03","date_gmt":"2011-08-25T18:43:03","guid":{"rendered":"http:\/\/blogs.law.harvard.edu\/kotrc\/?p=1808"},"modified":"2011-08-25T14:43:03","modified_gmt":"2011-08-25T18:43:03","slug":"negative-eigenvalues-negative-eigenconfidence","status":"publish","type":"post","link":"https:\/\/archive.blogs.harvard.edu\/kotrc\/2011\/08\/25\/negative-eigenvalues-negative-eigenconfidence\/","title":{"rendered":"Negative Eigenvalues, Negative Eigenconfidence"},"content":{"rendered":"<p>Had another slow start. I&#8217;m a bit more optimistic about getting started earlier for next week, since that&#8217;s when the pool will start opening at its usual, earlier time\u2014which will give me some motivation to get out early, exercise, and settle down to work at a more reasonable hour than, say, the 10:30 start I managed today. In support of that, I spent a bit of time once I had sat down at my desk this morning mapping out the next week, juggling time between exercise, research, and the many career events going on next week. This feels a bit better.<\/p>\n<p>Added some more writing to the methods section of the morphospace write-up I had started on Tuesday, which felt good. Sentence by sentence, the thesis will come together. Once I got stuck\u2014on what justification to present for not including resting stages in my analysis, at which point I realized I don&#8217;t really know anything about resting stages in diatoms\u2014I decided to abandon ship and switch to something more productive. Fortunately, R was open and waiting for me.<\/p>\n<p>I thought the first task I&#8217;d do was to calculate the %age of the total variance in the morphospace data is captured by the two axes chosen to represent it. It&#8217;s not entirely clear what a &#8220;good&#8221; %age is, but in some PCA cases I&#8217;ve seen people like having 80% or so; in Boyce&#8217;s thesis it&#8217;s just over 50% I think. Finding out how to do this has not turned out to be easy. As far as I had understood before, the magnitude of the eigenvalues were an indication of variance, such that the sum of the eigenvalues was equal to (or related through a constant to) the total variance. Supposedly the <em>cmdscale()<\/em>\u00a0function in R supplies a readily-calculated value (called <em>GOF, <\/em>for goodness of fit), i.e. the eigenvalues of the two axes as a proportion of the sum of all of the eigenvalues. However, a little bit of digging in the R documentation and the R forums suggests that things aren&#8217;t so simple. Apparently when using non-Euclidean distance measures (like the one I&#8217;m using), you can run into negative eigenvalues. This may or may not ruin the calculation of the <em>GOF <\/em>values, depending on whether the negative eigenvalues are subtracted out of the sum of the eigenvalues. It is entirely unclear whether this happens in the R version I am using, and therefore whether the values I&#8217;m getting are valid or not. Of course, I completely don&#8217;t understand at all what negative eigenvalues mean, or why the sum of the eigenvalues should equal the variance&#8230;. but never mind.<\/p>\n<p>Even if they are correct, the values provided by the <em>cmdscale() <\/em>function for my data set are absolutely abysmal. The values returned, presumably for axes 1 and 2, are 13% and 17%. That adds up to 30%, so that gives basically no confidence at all that the arrangement of the taxa we&#8217;re seeing in the plot resembles their higher-dimensional arrangement at all. Yech! Disappointment.<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Had another slow start. I&#8217;m a bit more optimistic about getting started earlier for next week, since that&#8217;s when the pool will start opening at its usual, earlier time\u2014which will give me some motivation to get out early, exercise, and settle down to work at a more reasonable hour than, say, the 10:30 start I [&hellip;]<\/p>\n","protected":false},"author":2222,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[14607,13584],"tags":[16233],"class_list":["post-1808","post","type-post","status-publish","format-standard","hentry","category-research-journal","category-timekeeping","tag-morphospace"],"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/archive.blogs.harvard.edu\/kotrc\/wp-json\/wp\/v2\/posts\/1808","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/archive.blogs.harvard.edu\/kotrc\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/archive.blogs.harvard.edu\/kotrc\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/kotrc\/wp-json\/wp\/v2\/users\/2222"}],"replies":[{"embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/kotrc\/wp-json\/wp\/v2\/comments?post=1808"}],"version-history":[{"count":3,"href":"https:\/\/archive.blogs.harvard.edu\/kotrc\/wp-json\/wp\/v2\/posts\/1808\/revisions"}],"predecessor-version":[{"id":1811,"href":"https:\/\/archive.blogs.harvard.edu\/kotrc\/wp-json\/wp\/v2\/posts\/1808\/revisions\/1811"}],"wp:attachment":[{"href":"https:\/\/archive.blogs.harvard.edu\/kotrc\/wp-json\/wp\/v2\/media?parent=1808"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/kotrc\/wp-json\/wp\/v2\/categories?post=1808"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/kotrc\/wp-json\/wp\/v2\/tags?post=1808"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}