<div id="main"> <?php // side bar require('F_mainsidebar.php'); ?> <form name="simple" action="index.php" method="post"><input type="hidden" name="curscr" value="F_simplesearch.php"></form> <form name="advanced" action="index.php" method="post"><input type="hidden" name="curscr" value="F_advancedsearch.php"></form> <form name="documents" action="index.php" method="post"><input type="hidden" name="curscr" value="F_documentsearch.php"></form> <form name="statistics" action="index.php" method="post"><input type="hidden" name="curscr" value="F_statistics.php"></form> <div id="mainpartwrapper"> <div id="mainpart3"> <div id="content-menu3"> <!--INSERT--> <h1>Help for the search</h1> <p>You can search the MERLIN corpus with the help of the search and visualisation software <a href="https://www.linguistik.hu-berlin.de/de/institut/professuren/korpuslinguistik/corpus-tools/annis-tutorials/gui-tutorial" target="_blank" class="reference">ANNIS</a>.</p> <form> <input class="bt" type="button" value="Search MERLIN in ANNIS" onclick="window.location.href='https://merlin-platform.eu/annis/'"/> </form> <p><strong>Getting started:</strong> Open the search. Choose an example search from <strong>↘</strong> <em><strong>Help/Examples</strong></em> to get an impression how the ANNIS search works. Now, you can modify the query. Choose and copy an annotation (see <a href="#annotations">section 2</a>) or use the <strong>↘</strong> <em><strong>Query Builder</strong></em> as described in <a href="#metadata">section 3</a> to search for a specific L2 feature.</p> <h2><a name="infosearch" id="infosearch"></a>1 Explanation of your search output </h2> <p><img src="img/ANNIS-SCREEN-HELP.png" alt="ANNIS help" width="100%" /></p> <ol> <li>Search field displaying your query in the query language</li> <li>Options: export search results or perform a frequency analysis</li> <li>Choose L2 corpus </li> <li>Number of tokens displayed left and right from the key word</li> <li>Metadata, i. e. information on the learner and the ratings, as well as statistical information</li> <li>Detail from the learner text (L2 text) that contains the word or the feature you are looking for </li> <li>TH1 = minimally corrected, i. e. orthographically and grammatically acceptable version of the learner texts; TH2 = sociolinguistically acceptable version of the learner text; TH1Diff and TH2Diff = description of the deviation between the L2 text and the target hypothesis</li> <li>Categorical description of the error or the L2 feature [EA_category] </li> <li>Actual manifestation of the feature/error [_type]</li> <li>displays the complete learner text</li> </ol> <p><img src="img/hint_bulb.png" alt="hint bulb" /><span class="StilSmall"> For automatic annotations displayed under "automatic grid" (POS annotations, lemmas, t-units, sentences) see <a href="C_research.php#anchor234" target="_blank">MERLIN for research</a>.</span><br /></p> <p> </p> <h2><a name="annotations" id="annotations"></a>2 Search MERLIN for L2 features</h2> <p>In the following section, all annotated learner language features are listed according to their categorical description [EA_category] and specific manifestation [_type]. Copy the annotation names (tags) into the ANNIS search window to start a search.</p> <p><img src="img/hint_bulb.png" alt="hint bulb" /><span class="StilSmall"> For concrete examples of annotated learner language features, see <a href="C_annotation.php#featurelist" target="_blank">MERLIN Annotations</a>, and for a detailed description of annotations and their scope (tag span) and annotation rules, see the <a href="C_download.php#annotations" target="_blank">MERLIN Annotation scheme</a>.</span></p> <div id="anchor1"></div> <h3><a href="#anchor1" onClick="toggle('#content1','#img1')"><img id="img1" src="img/toggle-expand.png"></a> G_ Grammar</h3> <div id="content1" class="content"> <p> <table cellspacing="0" cellpadding="2"> <tr> <td><strong>EA_category=/G_Agr/</strong></td> <td>agreement (subject and verb)</td> </tr> <tr> <td><strong>EA_category=/G_Art/</strong></td> <td>article</td> </tr> <tr> <td><strong>EA_category=/G_Clit/</strong></td> <td>ITA: clitic</td> </tr> <tr> <td><strong>EA_category=/G_Conj/</strong></td> <td>conjunction</td> </tr> <tr> <td><strong>EA_category=/G_Inflect_Inexist/</strong></td> <td>inexistent inflection (nouns, adj, verb)</td> </tr> <tr> <td><strong>EA_category=/G_Morphol_Wrong/</strong></td> <td>wrong inflection (nouns, pronouns, adj)</td> </tr> <tr> <td><strong>EA_category=/G_Neg/</strong></td> <td>negation general</td> </tr> <tr> <td>G_Neg_g_neg_type="negdoub"</td> <td>CZE: double negation</td> </tr> <tr> <td><strong>EA_category=/G_Pos/</strong></td> <td>part of speech error</td> </tr> <tr> <td><strong>EA_category=/G_Prep/</strong></td> <td>preposition</td> </tr> <tr> <td><strong>EA_category=/G_Refl_pronrefl/</strong></td> <td>reflexive pronoun</td> </tr> <tr> <td>G_Refl_type="pronreflposs"</td> <td>CZE: possessive reflexive pronoun</td> </tr> <tr> <td><strong>EA_category=/G_Valency/</strong></td> <td>verb valency: number of obligatory arguments</td> </tr> <tr> <td><strong>EA_category=/G_Verb_compl/</strong></td> <td>verb formation (morphol.)</td> </tr> <tr> <td><strong>EA_category=/G_Verb_main/</strong></td> <td>main verb</td> </tr> <tr> <td>G_Verb_type="asp"</td> <td>verb: aspect (CZE+ITA)</td> </tr> <tr> <td>G_Verb_type="md"</td> <td>verb: mood</td> </tr> <tr> <td>G_Verb_type="tns"</td> <td>verb: tense</td> </tr> <tr> <td>G_Verb_type="vc"</td> <td>verb: voice</td> </tr> <tr> <td><strong>EA_category=/G_Wo/</strong></td> <td>wor order general</td> </tr> <tr> <td>G_Wo_type="womaincl"</td> <td>word order in main clause</td> </tr> <tr> <td>G_Wo_type="wosubcl"</td> <td>word order in subordinate clause</td> </tr> </table> </div> <div id="anchor2"></div> <h3><a href="#anchor2" onClick="toggle('#content2','#img2')"><img id="img2" src="img/toggle-expand.png"></a> O_ Orthography</h3> <div id="content2" class="content"><p> <table cellspacing="0" cellpadding="2"> <col width="131"> <col width="216"> <tr> <td><strong>EA_category=/O_Abbrev/</strong></td> <td>abbreviation</td> </tr> <tr> <td><strong>EA_category=/O_Apostr/</strong></td> <td>GER+ITA: apostrophe</td> </tr> <tr> <td><strong>EA_category=/O_Capit/</strong></td> <td>capitalization</td> </tr> <tr> <td><strong>EA_category=/O_Graph/</strong></td> <td>general grapheme error</td> </tr> <tr> <td>O_Graph_graphgen_act_type</td> <td>CZE+ITA: diacritical marks</td> </tr> <tr> <td>O_Graph_type="trans"</td> <td>grapheme transposition</td> </tr> <tr> <td><strong>EA_category=/O_Punct/</strong></td> <td>punctuation</td> </tr> <tr> <td><strong>EA_category=/O_Wordbd/</strong></td> <td>word boundary</td> </tr> </table> </p> </div> <div id="anchor3"></div> <h3><a href="#anchor3" onClick="toggle('#content3','#img3')"><img id="img3" src="img/toggle-expand.png"></a> G_ Intelligibility**</h3> <div id="content3" class="content"><p> <table cellspacing="0" cellpadding="2"> <tr> <td><strong>EA_category=/H_Intelltxt/</strong></td> <td>intelligibility of text</td> </tr> <tr> <td><strong>EA_category=/H_Intelltxt/H_Intellts/</strong></td> <td>intelligibility of sentence</td> </tr> </table> </p> </div> <div id="anchor4"></div> <h3><a href="#anchor4" onClick="toggle('#content4','#img4')"><img id="img4" src="img/toggle-expand.png"></a> V_ Vocabulary**</h3> <div id="content4" class="content"><p> <table cellspacing="0" cellpadding="2"> <tr> <td><strong>EA_category=/V_FS/</strong></td> <td>formulaic sequence</td> </tr> <tr> <td>V_FS_type="colloc"</td> <td>formulaic sequence: collocation</td> </tr> <tr> <td>V_FS_type="idiom"</td> <td>formulaic sequence: idiom</td> </tr> <tr> <td>V_FS_type="commphras"</td> <td>formulaic sequence: com<span id="page38R_mcid0"><span role="presentation" dir="ltr">municative phraseologism</span></span></td> </tr> <tr> <td><strong>EA_category=/V_Sequence_lexgrammer_inc/</strong></td> <td>incomprehensible sequence caused by accumulation of lexical/grammatical error(s)</td> </tr> <tr> <td><strong>EA_category=/V_FS_form/</strong></td> <td>formulaic sequence: form error </td> </tr> <tr> <td>V_form_word_fs_nonexist_range</td> <td>non-existing form (word or formulaic sequence)</td> </tr> <tr> <td><strong>EA_category=/V_semdenot_word_fs/</strong></td> <td>semantic error: denotation (word or formulaic sequence)</td> </tr> <tr> <td><strong>EA_category=/V_semconn_at_word_fs/</strong></td> <td>semantic error: connotation (attitude), (word or formulaic sequence)</td> </tr> <tr> <td><strong>EA_category=/V_semimprec/</strong></td> <td>semantic error: precision (word or formulaic sequence)</td> </tr> <tr> <td><strong>EA_category=/V_Wordform/</strong></td> <td>general word formation error</td> </tr> <tr> <td>V_Wordform_type="deriv"</td> <td>word formation error: derivation</td> </tr> <tr> <td height="23">V_Wordform_type="comp"</td> <td>word formation error: composition</td> </tr> </table> </p> </div> <div id="anchor5"></div> <h3><a href="#anchor5" onClick="toggle('#content5','#img5')"><img id="img5" src="img/toggle-expand.png"></a> C_ Coherence/Cohesion**</h3> <div id="content5" class="content"><p> <table cellspacing="0" cellpadding="2"> <tr> <td><strong>EA_category=/C_Con_accur/</strong></td> <td>connector accuracy</td> </tr> <tr> <td><strong>EA_category=/C_Coh_jump/</strong></td> <td>content jumps</td> </tr> <tr> <td><strong>EA_category=/C_Coh_ref/</strong></td> <td>reference</td> </tr> <tr> <td><strong>EA_category=/C_Coh_txtstruct/</strong></td> <td>metacommunicative device</td> </tr> </table> </p> </div> <div id="anchor6"></div> <h3><a href="#anchor6" onClick="toggle('#content6','#img6')"><img id="img6" src="img/toggle-expand.png"></a> S_ Sociolinguistic appropriateness**</h3> <div id="content6" class="content"><p> <table cellspacing="0" cellpadding="2"> <tr> <td>S_Txt_type="grfw"</td> <td>salutations/complimentary closes</td> </tr> <tr> <td>S_Txt_type="opcl"</td> <td>opening/closing formulae</td> </tr> <tr> <td>S_Form_type="gen"</td> <td>inappropriate style (formality)</td> </tr> <tr> <td>S_Form_type="addr"</td> <td>inappropriate addressing (formality)</td> </tr> <tr> <td>S_Var_type="clit"</td> <td>ITA: lexicalised clitics (verbi procomplementari)</td> </tr> <tr> <td>S_Var_type="duppron"</td> <td>ITA: personal pronoun redundancy</td> </tr> <tr> <td>S_Var_type="synstr"</td> <td>ITA: marked syntactic structures</td> </tr> <tr> <td>S_Var_type="che"</td> <td>ITA: 'che polivalente'</td> </tr> <tr> <td>S_Var_type="woweil"</td> <td>GER: main clause word order after 'weil'</td> </tr> <tr> <td>S_Var_type="partik"</td> <td>GER: modal particles</td> </tr> </table> </p> </div> <div id="anchor7"></div> <h3><a href="#anchor7" onClick="toggle('#content7','#img7')"><img id="img7" src="img/toggle-expand.png"></a> P_ Pragmatics**</h3> <div id="content7" class="content"><p> <table cellpadding="2" cellspacing="0"> <col width="131"> <col width="216"> <tr> <td><strong>EA_category=/P_Pol_dir/</strong></td> <td width="407">politeness: overly direct language form</td> </tr> <tr> <td><strong>EA_category=/P_Request/</strong></td> <td>REQUEST general</td> </tr> <tr> <td>P_Request_type="direct"</td> <td>direct REQUEST</td> </tr> <tr> <td>P_Request_type="indirect"</td> <td>indirect REQUEST</td> </tr> </table> </p> </div> <p><span class="StilSmall">** Note: these error categories are only accessible for a subset of MERLIN texts. See <a href="C_annotation.php#featurelist" target="_blank">MERLIN Annotations / Annotation structure</a>.</span></p> <div id="anchor8"></div> <h3><a href="#anchor8" onClick="toggle('#content8','#img8')"><img id="img8" src="img/toggle-expand.png"></a> Further specification of error categories</h3> <div id="content8" class="content"> <table cellpadding="2" cellspacing="0"> <tr> <td width="170"><strong>add</strong></td> <td>superfluous (added) element</td> </tr> <tr> <td width="170"><strong>ambig</strong></td> <td>ambigues - type of error can't be specified</td> </tr> <tr> <td width="170"><strong>ch</strong></td> <td> wrong choice of element </td> </tr> <tr> <td width="170"><strong>merge</strong></td> <td>elements are wrongly merged</td> </tr> <tr> <td width="170"><strong>o</strong></td> <td>omitted element </td> </tr> <tr> <td width="170"><strong>pos</strong></td> <td>wrong position</td> </tr> <tr> <td width="170"><strong>split</strong></td> <td>elements are wrongly split</td> </tr> </table> </div> <p> </p> <h2 dir="ltr"><a name="metadata" id="metadata"></a>3 Narrowing the search using metadata </h2> <p dir="ltr">Use ANNIS's <strong><em>Query Builder </em></strong>to search for features or a combination of features while narrowing the query based on specific metadata.</p> <ol> <li dir="ltr" aria-level="1"> Open the ↘ <em><strong>Query Builder</strong></em> in ANNIS. </li> <li dir="ltr" aria-level="1"> Choose <em>↘ </em><strong><em>Word sequences and meta information</em></strong>. </li> <li dir="ltr" aria-level="1"> Select the corresponding feature and its attribute under <strong><em>Linguistic sequence</em></strong> ↘ <strong><em>Initialize</em></strong> ↘ <strong><em>Add</em></strong>.</li> <li dir="ltr" aria-level="1"> In the <strong><em>Toolbar</em></strong>, click on ↘ <strong><em>Create AQL Query</em></strong> to paste the query into the search field. </li> </ol> <p>To restrict the query to a specific group of learners (e. g. by L1 or age) or a specific CEFR level (fair rating), select a metadata category before pasting the query into the search field (step 4) under ↘ <em><strong>Meta information</strong></em> ↘ <strong><em>Add</em></strong> and tick the required attribute, e. g.: </p> <p> <table> <tbody> <tr> <td><strong>_rating_fair_cefr</strong></td> <td> CEFR level the test received in the re-rating<br /></td> </tr> <tr> <td><strong>_author_L1</strong></td> <td>Mother tongue of the learner </td> </tr> <tr> <td><strong>_task_topic</strong></td> <td>Task preceding the text</td> </tr> </tbody> </table> </p> <p><strong> Alternatively</strong> you can copy the feature you are searching for from the feature list under <a href="#annotations">section 2</a> and paste it into the ANNIS search field. Then, add the metadata using the following scheme, to restrict your search to specific texts:</p> <ul> <li dir="ltr" aria-level="1"> <em>& meta::_rating_fair_cefr="B1"</em> [A1, A2, B1+, B2] </li> <li><em>& meta::_author_L1="German" </em>[English, Russian, Arabic, etc.]</li> </ul> <p><img src="img/hint_bulb.png" alt="hint bulb" /><span class="StilSmall"> The <a href="https://korpling.github.io/ANNIS/4.5/user-guide/interface/index.html" target="_blank" class="reference">ANNIS User Guide</a> offers a thorough introduction to using the ANNIS interface.</span></p> <h2 dir="ltr"><a name="freqanalysis" id="freqanalysis"></a>4 Retrieve statistical informationen </h2> <p dir="ltr">To get an indication of the frequency of certain L2 features use the ANNIS search. </p> <ol> <li> Search for specific L2 features as described in section 2 or use <a href="#globEA_category">global error categories</a>.</li> <li> Then, click on ↘ <em><strong>Frequency Analysis </strong></em>[2] and subsequently on the right on ↘ <em><strong>Perform Frequency Analysis</strong></em>. You will retrieve a statistical analysis of the annotated features within the category in question. </li> <li>Amend your query according to the following scheme, to restrict the search to a certain CEFR level, e. g. B1: <em><strong>& meta::_rating_fair_cefr="B1"</strong>. </em></li> </ol> <p><img src="img/ANNIS-FREQ-ANALYSIS.png" alt="Freq Analysis" width="100%" /></p> <h4><a name="globEA_category" id="globEA_category"></a>Global error categories <br /> </h4> <table> <tbody> <tr> <td>EA_category=/G_.*/</td> <td>phenomena at the grammatical level</td> </tr> <tr> <td>EA_category=/O_.*/</td> <td>phenomena at the orthographical level </td> </tr> <tr> <td>EA_category=/H_.*/</td> <td>phenomena at the level of intellegibility</td> </tr> <tr> <td>EA_category=/C_.*/</td> <td>phenomena at the level of coherence / cohesion </td> </tr> <tr> <td>EA_category=/V_.*/</td> <td>phenomena at the lexical level</td> </tr> <tr> <td>EA_category=/S_.*/</td> <td>phenomena at the level of sociolinguistic appropriateness </td> </tr> <tr> <td>EA_category=/P_.*/</td> <td>phenomena at the pragmatic level</td> </tr> </tbody> </table> </p> </div> <!--INSERT END--> </div> </div> </div> </div>