Zen and the art of STM publishing (with apologies to Robert Pirsig - 1)
If quality were a measurable parameter, it would not have a normal distribution (in the statistical sense). At best it would be nominal, i.e. categorical. In a recent PLoS ONE article by Liz Allen et al (2), the authors define a four point scale with the quality of a paper being assessed as “landmark”, “major addition to knowledge”, “useful step forward”, or just “for the record”. But this is, as Pirsig would describe, still a romantic view of reality. Allen et al are after all ranking the publications sourced by their own institute and all estimates of quality are very much in the eye of the beholder.
John Ioaniddis and colleagues (3) have argued, most current published research findings are false. For a research finding to be true, as Liz Allen and her colleagues define it, two things must be true. First, the hypothesis underpinning the study must be correct, and secondly the experimental methodology must be powerful enough to provide a conclusive result. Allen et als paper only reviews cases where they believe both of these test are positive. But there are three other possibilities: that the hypothesis is correct but the design is flawed and yields a negative result (“false negatives”), that the hypothesis is wrong but the statistical result is positive (“false positives”), or that neither is true (“noise”). It is likely that in terms of published articles, the “noise” category is the most numerous, followed by the “false positives”. By contrast, “false negatives” will be quite rare, and articles detailing real progress will be fewer still.
Peter Binfield, Managing Editor at PLos argues (4) that most journals make the peer review process unnecessarily complex and time-consuming by trying to assess whether a paper will have “substantial impact” or “significant advance”, rather than just focussing on methodological rigor., and allowing posterity to be the judge of significance. In other words, the classical STM peer-review process supports the view that quality is a continuous parameter, whereas, in reality, with the benefit of sufficient hindsight, the probabilities of either the design of the hypothesis being correct are either 0 or 1.
So article quality doesn’t end in a decimal point, and if it is deemed to be an important factor, then it should be measured from within the research program that funded the work. After all, as a tax payer, scientific progress means wealth, health and a better world for my grandchildren. Impact Factors, Eigenfactors, and Hirsch Indices aren’t really going to fire me up at the next Election…
Notes:
1. Zen and the art of motorcycle maintenance, Robert Pirsig.
http://en.wikipedia.org/wiki/Zen_and_the_Art_of_Motorcycle_Maintenance
2. Looking for Landmarks: The Role of Expert Review and Bibliometric Analysis in Evaluating Scientific Publication Outputs. PLoS ONE 4(6): e5910
http://www.plosone.org/article/info:doi/10.1371/journal.pone.0005910
3. Ioannidis JPA (2005) Why Most Published Research Findings Are False. PLoS Med 2(8): e124.
http://www.plosmedicine.org/article/info:doi/10.1371/journal.pmed.0020124
4. PLoS ONE: Background, future, development and article-level metrics, Peter Binfield
http://conferences.aepic.it/index.php/elpub/elpub2009/paper/view/114/51
John Ioaniddis and colleagues (3) have argued, most current published research findings are false. For a research finding to be true, as Liz Allen and her colleagues define it, two things must be true. First, the hypothesis underpinning the study must be correct, and secondly the experimental methodology must be powerful enough to provide a conclusive result. Allen et als paper only reviews cases where they believe both of these test are positive. But there are three other possibilities: that the hypothesis is correct but the design is flawed and yields a negative result (“false negatives”), that the hypothesis is wrong but the statistical result is positive (“false positives”), or that neither is true (“noise”). It is likely that in terms of published articles, the “noise” category is the most numerous, followed by the “false positives”. By contrast, “false negatives” will be quite rare, and articles detailing real progress will be fewer still.
Peter Binfield, Managing Editor at PLos argues (4) that most journals make the peer review process unnecessarily complex and time-consuming by trying to assess whether a paper will have “substantial impact” or “significant advance”, rather than just focussing on methodological rigor., and allowing posterity to be the judge of significance. In other words, the classical STM peer-review process supports the view that quality is a continuous parameter, whereas, in reality, with the benefit of sufficient hindsight, the probabilities of either the design of the hypothesis being correct are either 0 or 1.
So article quality doesn’t end in a decimal point, and if it is deemed to be an important factor, then it should be measured from within the research program that funded the work. After all, as a tax payer, scientific progress means wealth, health and a better world for my grandchildren. Impact Factors, Eigenfactors, and Hirsch Indices aren’t really going to fire me up at the next Election…
Notes:
1. Zen and the art of motorcycle maintenance, Robert Pirsig.
http://en.wikipedia.org/wiki/Zen_and_the_Art_of_Motorcycle_Maintenance
2. Looking for Landmarks: The Role of Expert Review and Bibliometric Analysis in Evaluating Scientific Publication Outputs. PLoS ONE 4(6): e5910
http://www.plosone.org/article/info:doi/10.1371/journal.pone.0005910
3. Ioannidis JPA (2005) Why Most Published Research Findings Are False. PLoS Med 2(8): e124.
http://www.plosmedicine.org/article/info:doi/10.1371/journal.pmed.0020124
4. PLoS ONE: Background, future, development and article-level metrics, Peter Binfield
http://conferences.aepic.it/index.php/elpub/elpub2009/paper/view/114/51
Labels: Impact Factor, PLos ONE, quality

