-
Karin Schöne authoredKarin Schöne authored
help-annis-glossary.php 18.61 KiB
<div id="main">
<?php
// side bar
require('F_mainsidebar.php');
?>
<form name="simple" action="index.php" method="post"><input type="hidden" name="curscr" value="F_simplesearch.php"></form>
<form name="advanced" action="index.php" method="post"><input type="hidden" name="curscr" value="F_advancedsearch.php"></form>
<form name="documents" action="index.php" method="post"><input type="hidden" name="curscr" value="F_documentsearch.php"></form>
<form name="statistics" action="index.php" method="post"><input type="hidden" name="curscr" value="F_statistics.php"></form>
<div id="mainpartwrapper">
<div id="mainpart3">
<div id="content-menu3">
<!--INSERT-->
<h1>Help for the search</h1>
<p>You can search the MERLIN corpus with the help of the search and visualisation software <a href="https://www.linguistik.hu-berlin.de/de/institut/professuren/korpuslinguistik/corpus-tools/annis-tutorials/gui-tutorial" target="_blank" class="reference">ANNIS</a>.</p>
<form>
<input class="bt" type="button" value="Search MERLIN in ANNIS" onclick="window.location.href='https://merlin-platform.eu/annis/'"/>
</form>
<p><strong>Getting started:</strong> Open the search. Choose an example search from <strong>↘</strong> <em><strong>Help/Examples</strong></em> to get an impression how the ANNIS search works. Now, you can modify the query. Choose and copy an annotation (see <a href="#annotations">section 2</a>) or use the <strong>↘</strong> <em><strong>Query Builder</strong></em> as described in <a href="#metadata">section 3</a> to search for a specific L2 feature.</p>
<h2><a name="infosearch" id="infosearch"></a>1 Explanation of your search output </h2>
<p><img src="img/ANNIS-SCREEN-HELP.png" alt="ANNIS help" width="100%" /></p>
<ol>
<li>Search field displaying your query in the query language</li>
<li>Options: export search results or perform a frequency analysis</li>
<li>Choose L2 corpus </li>
<li>Number of tokens displayed left and right from the key word</li>
<li>Metadata, i. e. information on the learner and the ratings, as well as statistical information</li>
<li>Detail from the learner text (L2 text) that contains the word or the feature you are looking for </li>
<li>TH1 = minimally corrected, i. e. orthographically and grammatically acceptable version of the learner texts; TH2 = sociolinguistically acceptable version of the learner text; TH1Diff and TH2Diff = description of the deviation between the L2 text and the target hypothesis</li>
<li>Categorical description of the error or the L2 feature [EA_category] </li>
<li>Actual manifestation of the feature/error [_type]</li>
<li>displays the complete learner text</li>
</ol>
<p><img src="img/hint_bulb.png" alt="hint bulb" /><span class="StilSmall"> For automatic annotations displayed under "automatic grid" (POS annotations, lemmas, t-units, sentences) see <a href="C_research.php#anchor234" target="_blank">MERLIN for research</a>.</span><br /></p>
<p> </p>
<h2><a name="annotations" id="annotations"></a>2 Search MERLIN for L2 features</h2>
<p>In the following section, all annotated learner language features are listed according to their categorical description [EA_category] and specific manifestation [_type]. Copy the annotation names (tags) into the ANNIS search window to start a search.</p>
<p><img src="img/hint_bulb.png" alt="hint bulb" /><span class="StilSmall"> For concrete examples of annotated learner language features, see <a href="C_annotation.php#featurelist" target="_blank">MERLIN Annotations</a>, and for a detailed description of annotations and their scope (tag span) and annotation rules, see the <a href="C_download.php#annotations" target="_blank">MERLIN Annotation scheme</a>.</span></p>
<div id="anchor1"></div>
<h3><a href="#anchor1" onClick="toggle('#content1','#img1')"><img id="img1" src="img/toggle-expand.png"></a> G_ Grammar</h3>
<div id="content1" class="content">
<p>
<table cellspacing="0" cellpadding="2">
<tr>
<td><strong>EA_category=/G_Agr/</strong></td>
<td>agreement (subject and verb)</td>
</tr>
<tr>
<td><strong>EA_category=/G_Art/</strong></td>
<td>article</td>
</tr>
<tr>
<td><strong>EA_category=/G_Clit/</strong></td>
<td>ITA: clitic</td>
</tr>
<tr>
<td><strong>EA_category=/G_Conj/</strong></td>
<td>conjunction</td>
</tr>
<tr>
<td><strong>EA_category=/G_Inflect_Inexist/</strong></td>
<td>inexistent inflection (nouns, adj, verb)</td>
</tr>
<tr>
<td><strong>EA_category=/G_Morphol_Wrong/</strong></td>
<td>wrong inflection (nouns, pronouns, adj)</td>
</tr>
<tr>
<td><strong>EA_category=/G_Neg/</strong></td>
<td>negation general</td>
</tr>
<tr>
<td>G_Neg_g_neg_type="negdoub"</td>
<td>CZE: double negation</td>
</tr>
<tr>
<td><strong>EA_category=/G_Pos/</strong></td>
<td>part of speech error</td>
</tr>
<tr>
<td><strong>EA_category=/G_Prep/</strong></td>
<td>preposition</td>
</tr>
<tr>
<td><strong>EA_category=/G_Refl_pronrefl/</strong></td>
<td>reflexive pronoun</td>
</tr>
<tr>
<td>G_Refl_type="pronreflposs"</td>
<td>CZE: possessive reflexive pronoun</td>
</tr>
<tr>
<td><strong>EA_category=/G_Valency/</strong></td>
<td>verb valency: number of obligatory arguments</td>
</tr>
<tr>
<td><strong>EA_category=/G_Verb_compl/</strong></td>
<td>verb formation (morphol.)</td>
</tr>
<tr>
<td><strong>EA_category=/G_Verb_main/</strong></td>
<td>main verb</td>
</tr>
<tr>
<td>G_Verb_type="asp"</td>
<td>verb: aspect (CZE+ITA)</td>
</tr>
<tr>
<td>G_Verb_type="md"</td>
<td>verb: mood</td>
</tr>
<tr>
<td>G_Verb_type="tns"</td>
<td>verb: tense</td>
</tr>
<tr>
<td>G_Verb_type="vc"</td>
<td>verb: voice</td>
</tr>
<tr>
<td><strong>EA_category=/G_Wo/</strong></td>
<td>wor order general</td>
</tr>
<tr>
<td>G_Wo_type="womaincl"</td>
<td>word order in main clause</td>
</tr>
<tr>
<td>G_Wo_type="wosubcl"</td>
<td>word order in subordinate clause</td>
</tr>
</table>
</div>
<div id="anchor2"></div>
<h3><a href="#anchor2" onClick="toggle('#content2','#img2')"><img id="img2" src="img/toggle-expand.png"></a> O_ Orthography</h3>
<div id="content2" class="content"><p>
<table cellspacing="0" cellpadding="2">
<col width="131">
<col width="216">
<tr>
<td><strong>EA_category=/O_Abbrev/</strong></td>
<td>abbreviation</td>
</tr>
<tr>
<td><strong>EA_category=/O_Apostr/</strong></td>
<td>GER+ITA: apostrophe</td>
</tr>
<tr>
<td><strong>EA_category=/O_Capit/</strong></td>
<td>capitalization</td>
</tr>
<tr>
<td><strong>EA_category=/O_Graph/</strong></td>
<td>general grapheme error</td>
</tr>
<tr>
<td>O_Graph_graphgen_act_type</td>
<td>CZE+ITA: diacritical marks</td>
</tr>
<tr>
<td>O_Graph_type="trans"</td>
<td>grapheme transposition</td>
</tr>
<tr>
<td><strong>EA_category=/O_Punct/</strong></td>
<td>punctuation</td>
</tr>
<tr>
<td><strong>EA_category=/O_Wordbd/</strong></td>
<td>word boundary</td>
</tr>
</table>
</p>
</div>
<div id="anchor3"></div>
<h3><a href="#anchor3" onClick="toggle('#content3','#img3')"><img id="img3" src="img/toggle-expand.png"></a> G_ Intelligibility**</h3>
<div id="content3" class="content"><p>
<table cellspacing="0" cellpadding="2">
<tr>
<td><strong>EA_category=/H_Intelltxt/</strong></td>
<td>intelligibility of text</td>
</tr>
<tr>
<td><strong>EA_category=/H_Intelltxt/H_Intellts/</strong></td>
<td>intelligibility of sentence</td>
</tr>
</table>
</p>
</div>
<div id="anchor4"></div>
<h3><a href="#anchor4" onClick="toggle('#content4','#img4')"><img id="img4" src="img/toggle-expand.png"></a> V_ Vocabulary**</h3>
<div id="content4" class="content"><p>
<table cellspacing="0" cellpadding="2">
<tr>
<td><strong>EA_category=/V_FS/</strong></td>
<td>formulaic sequence</td>
</tr>
<tr>
<td>V_FS_type="colloc"</td>
<td>formulaic sequence: collocation</td>
</tr>
<tr>
<td>V_FS_type="idiom"</td>
<td>formulaic sequence: idiom</td>
</tr>
<tr>
<td>V_FS_type="commphras"</td>
<td>formulaic sequence: com<span id="page38R_mcid0"><span role="presentation" dir="ltr">municative phraseologism</span></span></td>
</tr>
<tr>
<td><strong>EA_category=/V_Sequence_lexgrammer_inc/</strong></td>
<td>incomprehensible sequence caused by accumulation of lexical/grammatical error(s)</td>
</tr>
<tr>
<td><strong>EA_category=/V_FS_form/</strong></td>
<td>formulaic sequence: form error </td>
</tr>
<tr>
<td>V_form_word_fs_nonexist_range</td>
<td>non-existing form (word or formulaic sequence)</td>
</tr>
<tr>
<td><strong>EA_category=/V_semdenot_word_fs/</strong></td>
<td>semantic error: denotation (word or formulaic sequence)</td>
</tr>
<tr>
<td><strong>EA_category=/V_semconn_at_word_fs/</strong></td>
<td>semantic error: connotation (attitude), (word or formulaic sequence)</td>
</tr>
<tr>
<td><strong>EA_category=/V_semimprec/</strong></td>
<td>semantic error: precision (word or formulaic sequence)</td>
</tr>
<tr>
<td><strong>EA_category=/V_Wordform/</strong></td>
<td>general word formation error</td>
</tr>
<tr>
<td>V_Wordform_type="deriv"</td>
<td>word formation error: derivation</td>
</tr>
<tr>
<td height="23">V_Wordform_type="comp"</td>
<td>word formation error: composition</td>
</tr>
</table>
</p>
</div>
<div id="anchor5"></div>
<h3><a href="#anchor5" onClick="toggle('#content5','#img5')"><img id="img5" src="img/toggle-expand.png"></a> C_ Coherence/Cohesion**</h3>
<div id="content5" class="content"><p>
<table cellspacing="0" cellpadding="2">
<tr>
<td><strong>EA_category=/C_Con_accur/</strong></td>
<td>connector accuracy</td>
</tr>
<tr>
<td><strong>EA_category=/C_Coh_jump/</strong></td>
<td>content jumps</td>
</tr>
<tr>
<td><strong>EA_category=/C_Coh_ref/</strong></td>
<td>reference</td>
</tr>
<tr>
<td><strong>EA_category=/C_Coh_txtstruct/</strong></td>
<td>metacommunicative device</td>
</tr>
</table>
</p>
</div>
<div id="anchor6"></div>
<h3><a href="#anchor6" onClick="toggle('#content6','#img6')"><img id="img6" src="img/toggle-expand.png"></a> S_ Sociolinguistic appropriateness**</h3>
<div id="content6" class="content"><p>
<table cellspacing="0" cellpadding="2">
<tr>
<td>S_Txt_type="grfw"</td>
<td>salutations/complimentary closes</td>
</tr>
<tr>
<td>S_Txt_type="opcl"</td>
<td>opening/closing formulae</td>
</tr>
<tr>
<td>S_Form_type="gen"</td>
<td>inappropriate style (formality)</td>
</tr>
<tr>
<td>S_Form_type="addr"</td>
<td>inappropriate addressing (formality)</td>
</tr>
<tr>
<td>S_Var_type="clit"</td>
<td>ITA: lexicalised clitics (verbi procomplementari)</td>
</tr>
<tr>
<td>S_Var_type="duppron"</td>
<td>ITA: personal pronoun redundancy</td>
</tr>
<tr>
<td>S_Var_type="synstr"</td>
<td>ITA: marked syntactic structures</td>
</tr>
<tr>
<td>S_Var_type="che"</td>
<td>ITA: 'che polivalente'</td>
</tr>
<tr>
<td>S_Var_type="woweil"</td>
<td>GER: main clause word order after 'weil'</td>
</tr>
<tr>
<td>S_Var_type="partik"</td>
<td>GER: modal particles</td>
</tr>
</table>
</p>
</div>
<div id="anchor7"></div>
<h3><a href="#anchor7" onClick="toggle('#content7','#img7')"><img id="img7" src="img/toggle-expand.png"></a> P_ Pragmatics**</h3>
<div id="content7" class="content"><p>
<table cellpadding="2" cellspacing="0">
<col width="131">
<col width="216">
<tr>
<td><strong>EA_category=/P_Pol_dir/</strong></td>
<td width="407">politeness: overly direct language form</td>
</tr>
<tr>
<td><strong>EA_category=/P_Request/</strong></td>
<td>REQUEST general</td>
</tr>
<tr>
<td>P_Request_type="direct"</td>
<td>direct REQUEST</td>
</tr>
<tr>
<td>P_Request_type="indirect"</td>
<td>indirect REQUEST</td>
</tr>
</table>
</p>
</div>
<p><span class="StilSmall">** Note: these error categories are only accessible for a subset of MERLIN texts. See <a href="C_annotation.php#featurelist" target="_blank">MERLIN Annotations / Annotation structure</a>.</span></p>
<div id="anchor8"></div>
<h3><a href="#anchor8" onClick="toggle('#content8','#img8')"><img id="img8" src="img/toggle-expand.png"></a> Further specification of error categories</h3>
<div id="content8" class="content">
<table cellpadding="2" cellspacing="0">
<tr>
<td width="170"><strong>add</strong></td>
<td>superfluous (added) element</td>
</tr>
<tr>
<td width="170"><strong>ambig</strong></td>
<td>ambigues - type of error can't be specified</td>
</tr>
<tr>
<td width="170"><strong>ch</strong></td>
<td> wrong choice of element </td>
</tr>
<tr>
<td width="170"><strong>merge</strong></td>
<td>elements are wrongly merged</td>
</tr>
<tr>
<td width="170"><strong>o</strong></td>
<td>omitted element </td>
</tr>
<tr>
<td width="170"><strong>pos</strong></td>
<td>wrong position</td>
</tr>
<tr>
<td width="170"><strong>split</strong></td>
<td>elements are wrongly split</td>
</tr>
</table>
</div>
<p> </p>
<h2 dir="ltr"><a name="metadata" id="metadata"></a>3 Narrowing the search using metadata </h2>
<p dir="ltr">Use ANNIS's <strong><em>Query Builder </em></strong>to search for features or a combination of features while narrowing the query based on specific metadata.</p>
<ol>
<li dir="ltr" aria-level="1">
Open the ↘ <em><strong>Query Builder</strong></em> in ANNIS. </li>
<li dir="ltr" aria-level="1">
Choose <em>↘ </em><strong><em>Word sequences and meta information</em></strong>. </li>
<li dir="ltr" aria-level="1">
Select the corresponding feature and its attribute under <strong><em>Linguistic sequence</em></strong> ↘ <strong><em>Initialize</em></strong> ↘ <strong><em>Add</em></strong>.</li>
<li dir="ltr" aria-level="1">
In the <strong><em>Toolbar</em></strong>, click on ↘ <strong><em>Create AQL Query</em></strong> to paste the query into the search field. </li>
</ol>
<p>To restrict the query to a specific group of learners (e. g. by L1 or age) or a specific CEFR level (fair rating), select a metadata category before pasting the query into the search field (step 4) under ↘ <em><strong>Meta information</strong></em> ↘ <strong><em>Add</em></strong> and tick the required attribute, e. g.: </p>
<p>
<table>
<tbody>
<tr>
<td><strong>_rating_fair_cefr</strong></td>
<td> CEFR level the test received in the re-rating<br /></td>
</tr>
<tr>
<td><strong>_author_L1</strong></td>
<td>Mother tongue of the learner </td>
</tr>
<tr>
<td><strong>_task_topic</strong></td>
<td>Task preceding the text</td>
</tr>
</tbody>
</table>
</p>
<p><strong> Alternatively</strong> you can copy the feature you are searching for from the feature list under <a href="#annotations">section 2</a> and paste it into the ANNIS search field. Then, add the metadata using the following scheme, to restrict your search to specific texts:</p>
<ul>
<li dir="ltr" aria-level="1">
<em>& meta::_rating_fair_cefr="B1"</em> [A1, A2, B1+, B2] </li>
<li><em>& meta::_author_L1="German" </em>[English, Russian, Arabic, etc.]</li>
</ul>
<p><img src="img/hint_bulb.png" alt="hint bulb" /><span class="StilSmall"> The <a href="https://korpling.github.io/ANNIS/4.5/user-guide/interface/index.html" target="_blank" class="reference">ANNIS User Guide</a> offers a thorough introduction to using the ANNIS interface.</span></p>
<h2 dir="ltr"><a name="freqanalysis" id="freqanalysis"></a>4 Retrieve statistical informationen </h2>
<p dir="ltr">To get an indication of the frequency of certain L2 features use the ANNIS search. </p>
<ol>
<li>
Search for specific L2 features as described in section 2 or use <a href="#globEA_category">global error categories</a>.</li>
<li>
Then, click on ↘ <em><strong>Frequency Analysis </strong></em>[2] and subsequently on the right on ↘ <em><strong>Perform Frequency Analysis</strong></em>. You will retrieve a statistical analysis of the annotated features within the category in question. </li>
<li>Amend your query according to the following scheme, to restrict the search to a certain CEFR level, e. g. B1: <em><strong>& meta::_rating_fair_cefr="B1"</strong>. </em></li>
</ol>
<p><img src="img/ANNIS-FREQ-ANALYSIS.png" alt="Freq Analysis" width="100%" /></p>
<h4><a name="globEA_category" id="globEA_category"></a>Global error categories <br />
</h4>
<table>
<tbody>
<tr>
<td>EA_category=/G_.*/</td>
<td>phenomena at the grammatical level</td>
</tr>
<tr>
<td>EA_category=/O_.*/</td>
<td>phenomena at the orthographical level </td>
</tr>
<tr>
<td>EA_category=/H_.*/</td>
<td>phenomena at the level of intellegibility</td>
</tr>
<tr>
<td>EA_category=/C_.*/</td>
<td>phenomena at the level of coherence / cohesion </td>
</tr>
<tr>
<td>EA_category=/V_.*/</td>
<td>phenomena at the lexical level</td>
</tr>
<tr>
<td>EA_category=/S_.*/</td>
<td>phenomena at the level of sociolinguistic appropriateness </td>
</tr>
<tr>
<td>EA_category=/P_.*/</td>
<td>phenomena at the pragmatic level</td>
</tr>
</tbody>
</table>
</p>
</div>
<!--INSERT END-->
</div>
</div>
</div>
</div>