<p>The MERLIN corpus has a mulitlayer annotation. The texts are lemmatized and part-of-speech-tagged. Furthermore, in addition to a minimally correct version of the text (target hypothesis), specific features of the learner language have been annotated. Go to <ahref="../de/C_research.php#annotations"target="_blank">MERLIN for research</a> to learn more about whether the single layers result from manual or automatic annotations (NLP). </p>
<p>The MERLIN corpus has a mulitlayer annotation. The texts are lemmatized and part-of-speech-tagged. Furthermore, in addition to a minimally correct version of the text (target hypothesis), specific features of the learner language have been annotated. Go to <ahref="C_research.php#annotations"target="_blank">MERLIN for research</a> to learn more about whether the single layers result from manual or automatic annotations (NLP). </p>
<p>The target hypotheses form the basis for annotations of learner language features (L2 features) . The "minimal target hypothesis"<strong>(TH1)</strong> is a minimally intervening version of the learner text that is orthographically and grammatically correct. Annotations of grammatical and orthographical learner language features refer to them (EA1). </p>
<p> In the explorative, smaller MERLIN core corpus, further L2 features regarding vocabulary, pragmatics, sociolinguistic appropriateness, and intelligibility have been annotated (EA2). Very often, those phenomena are not errors. These pilot annotations a rather explorative nature and should be interpreted with caution. They refer to the "extended target hypothesis" (<strong>TH2</strong>).</p>
<p>All L2 feature annotations have been deduced from various sources and described in detail in the <ahref="../de/C_download.php"target="_blank">annotation scheme</a>. You can review the development and origin of the indicators on which the annotation scheme is based at <ahref="../de/C_research.php#annotations"target="_blank">MERLIN for reserach</a>. The MERLIN annotations followed a strict policy of reliability control. Again, you can read more about this at <ahref="../de/C_research.php#anchor233"target="_blank">MERLIN for research
<p>All L2 feature annotations have been deduced from various sources and described in detail in the <ahref="C_download.php"target="_blank">annotation scheme</a>. You can review the development and origin of the indicators on which the annotation scheme is based at <ahref="C_research.php#annotations"target="_blank">MERLIN for reserach</a>. The MERLIN annotations followed a strict policy of reliability control. Again, you can read more about this at <ahref="C_research.php#anchor233"target="_blank">MERLIN for research
</a>.</p>
<divid="anchor1"></div>
<h3><ahref="#anchor1"onClick="toggle('#content1','#img1')"><imgid="img1"src="../de/img/toggle-expand.png"></a> Excursus: Interpretating „errors“ with target hypotheses </h3>
<h3><ahref="#anchor1"onClick="toggle('#content1','#img1')"><imgid="img1"src="img/toggle-expand.png></a> Excursus: Interpretating „errors“ with target hypotheses </h3>
<div id="content1"class="content">
<p>As learner language (L2) is regarded as an evolving language system in its own right, annotations were not merely based on error coding, but also took into account other linguistic characteristics.</p>
<p>In order to determine whether and to what extent a text deviates incorrectly, there must be a clear idea of what a learner presumably intended to write. In a learner text collection (learner corpus), it is important to make this interpretation explicit to make annotations more easily understandable and to avoid problems of reliability. Therefore, the MERLIN team formulated target hypotheses (TH) that are a corrected version of the learner texts. The team followed the rules developed for the <ahref="http://www.linguistik.hu-berlin.de/institut/professuren/korpuslinguistik/forschung/falko"target="_blank"class="reference">FALKO corpus</a> and adapted them to the project needs where necessary (cf. Reznicek/Lüdeling et al. 2012). </p>
<p><strong>Target hypothesis 2 (TH2) = lexically and pragmatically akcetable version of the learner text</strong></p>
<p>The "extended target hypothesis" aims at creating an <strong>acceptable</strong> (for a native speaker) version of the original learner text. <strong>TH2</strong> takes into account more language dimensions that often regard context-dependent phenomena like vocabulary and pragmatics. This assessment could only be made for a smaller part of the MERLIN corpus, the core corpus. It consists of a collection of texts which received either A2 or B2 ratings (for Italian: A2 and B1/B1+). <br/>
</p>
<p>For examples and more details see <ahref="../de/C_research.php#annotations"target="_blank">MERLIN for research</a>.</p>
<p>For examples and more details see <ahref="C_research.php#annotations"target="_blank">MERLIN for research</a>.</p>
</div>
<h2><aname="featurelist"></a>Annotated L2 features with examples</h2>
<p>The following contains lists of L2 features annotated in the MERLIN corpus that are illsutrated by examples from the languages in question.<br/>
@@ -268,7 +268,7 @@ ITA: *[A queste cita di posto?] </td>
<p><spanclass="StilSmall">* [...] tag-relevant extracts of learner language expressions {...} correction of the erroneous learner expression</span></p>
@@ -407,7 +407,7 @@ ITA: *[A queste cita di posto?] </td>
<p><spanclass="StilSmall">* [...] tag-relevant extracts of learner language expressions {...} correction of the erroneous learner expression</span></p>
@@ -584,7 +584,7 @@ Budeš mít narozeniny? Jaký dárek si přejete?</td>
<p><spanclass="StilSmall">* [...] tag-relevant extracts of learner language expressions {...} correction of the erroneous learner expression</span></p>
@@ -626,7 +626,7 @@ Budeš mít narozeniny? Jaký dárek si přejete?</td>
</p>
<p><spanclass="StilSmall">* [...] tag-relevant extracts of learner language expressions {...} correction of the erroneous learner expression</span></p>
</div>
<p><strong>Hint</strong>: A comprehensive overview of the annotated features is provided in the <ahref="../de/C_download.php#annotations"target="_blank">annotation scheme</a>. To learn how to search MERLIN for annotated features go to <ahref="#"onclick="document.forms['glossary'].submit();"class="a.reference"><?phpecho$trans['help_search'][$_SESSION['lang']];?></a>.</p>
<p><strong>Hint</strong>: A comprehensive overview of the annotated features is provided in the <ahref="C_download.php#annotations"target="_blank">annotation scheme</a>. To learn how to search MERLIN for annotated features go to <ahref="#"onclick="document.forms['glossary'].submit();"class="a.reference"><?phpecho$trans['help_search'][$_SESSION['lang']];?></a>.</p>