<?xml version="1.0" encoding="utf-8" standalone="no"?>
<!DOCTYPE article SYSTEM "http://www.atmos-chem-phys-discuss.net/inc/acpd/copernicus.dtd">
<article language="en">
	<journal>
		<journal_title>Atmospheric Chemistry and Physics Discussions</journal_title>
		<journal_url>www.atmos-chem-phys-discuss.net</journal_url>
		<issn>1680-7367</issn>
		<eissn>1680-7375</eissn>
		<volume_number>9</volume_number>
		<issue_number>3</issue_number>
		<publication_year>2009</publication_year>
	</journal>
	<doi>10.5194/acpd-9-14141-2009</doi>
	<article_url>http://www.atmos-chem-phys-discuss.net/9/14141/2009/</article_url>
	<abstract_html>http://www.atmos-chem-phys-discuss.net/9/14141/2009/acpd-9-14141-2009.html</abstract_html>
	<fulltext_pdf>http://www.atmos-chem-phys-discuss.net/9/14141/2009/acpd-9-14141-2009.pdf</fulltext_pdf>
	<start_page>14141</start_page>
	<end_page>14164</end_page>
	<publication_date>2009-06-26</publication_date>
	<article_title content_type="html">Comment on &quot;Quantitative performance metrics for stratospheric-resolving chemistry-climate models&quot; by  Waugh and Eyring</article_title>
	<authors>
		<author numeration="1" affiliations="1">
			<name>V. Grewe</name>
			<email>volker.grewe@dlr.de</email>
		</author>
		<author numeration="2" affiliations="1">
			<name>R. Sausen</name>
		</author>
	</authors>
	<affiliations>
		<affiliation numeration="1" content_type="html">Deutsches Zentrum für Luft- und Raumfahrt, Institut für Physik der  Atmosphäre, Oberpfaffenhofen, 82230 Wessling, Germany</affiliation>
	</affiliations>
	<abstract content_type="html">This comment focuses on the statistical limitations of a model grading, as
applied by D. Waugh and V. Eyring (2008) (WE08). The grade &lt;i&gt;g&lt;/i&gt; is calculated
for a specific diagnostic, which basically relates the difference of model
and observational data to the standard deviation in the observational
dataset. Monte Carlo simulations show that this method is not leading to
statistical significant gradings. Moreover, the difference between two models
is hardly significant. The results of the statistical tests performed in WE08
agree with our findings. However, most of those tests are based on special
cases, which implicitely assume that observations are available without any
errors and that the interannual variability of the observational data and the
model data are equal. Without these assumptions the grading becomes basically
insignificant. We further show that the inclusion of confidence intervals
into the grading approach is necessary, since it has the potential to change
the grading results drastically.</abstract>
	<references>
		<reference numeration="1" content_type="text"> Lary, D. J. and Aulov, O.: Space-based measurements of HCl: Intercomparison and historical contex, J Geophys Res., 113, D15S04, doi:10.1029/2007JD008715, 2008. </reference>
		<reference numeration="2" content_type="text"> Waugh, D. W. and Eyring, V.: Quantitative performance metrics for stratospheric-resolving chemistry-climate models, Atmos. Chem. Phys., 8, 5699–5713, 2008. </reference>
		<reference numeration="3" content_type="text"> World Meteorological Organisation (WMO): Scientfic Assessment of Ozone Depletion, Geneva, 2006. </reference>
	</references>
</article>

