Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
documentation-platform:data-processing [2024/01/12 14:19] – removed - external edit (Unknown date) 127.0.0.1documentation-platform:data-processing [2024/04/10 10:17] (current) Seraina Nadig
Line 1: Line 1:
 +<WRAP twothirds column>
 +====== Data processing and analysis ======
 +</WRAP>
 +
 +<WRAP colsmall><WRAP rightalign>[[documentation-platform:start|Back to the overview]]</WRAP></WRAP>
 +<WRAP clear/>
 +
 +As every field has its own ways of analysing data, the best practices for data processing heavily depend on the methods you choose for your research. However, some things are relevant for all kinds of research:
 +<WRAP center round box 80%>
 +  * **Keep several copies of your data:** \\ It is important to have both physical and virtual copies of your research data as back-up. It is also advisable to work with a systematic versioning system: 
 +  * **Ensure the integrity of your data** \\ Take measures to make sure your data is accurate, consistent and complete, e.g. using automation to prevent mistakes arising from manually entered data. Chapter 3 in the CESSDA Data Management Expert Guide contains a detailed guide on this topic: [[https://dmeg.cessda.eu/Data-Management-Expert-Guide/3.-Process/Data-entry-and-integrity|Data entry and integrity]]
 +  * **Choose interoperable file formats:** \\ When processing data, you may have to decide on file formats for the output of your analysis. Make sure to use file formats that have high compatibility and are widely used (see [[documentation-platform:standard-data-formats]]).
 +  * **Be careful with personal/sensitive data:** \\ If your data contains personal information, use anonymization / de-identification procedures before carrying out data analysis (see [[documentation-platform:data-protection]]).
 +  * **Implement data security measures:** \\ Make sure your data is stored securely and can only be accessed by authorized users (see [[documentation-platform:data-security|]]).
 +
 +</WRAP>
 +
 +----
 +
 +===== Resources =====
 +We recommend familiarizing yourself with the tools that could be useful for processing your research data:
 +
 +{{:documentation-platform:sshoc-logo.png?nolink&200|}} <wrap button> [[https://marketplace.sshopencloud.eu/|SSH Open Marketplace]]</wrap>\\ The SSH Open Marketplace is a European discovery platform for resources from the Social Sciences and Humanities (SSH) field. It does not only offer language resources but also workflows that are carefully described in a step-by-step guide. For example, you can find a workflow on linguistic annotation of corpora here.
 +
 +----
 +
 +{{:clarin_europe.png?nolink&200|}} <wrap button>[[https://www.clarin.eu/content/tools|CLARIN Tools]]</wrap>
 +
 +CLARIN centers offer a wide variety of tools that help researchers explore and analyse language data. An interface has been created that combines all these tools:
 +
 +The [[https://switchboard.clarin.eu/|CLARIN Language Resource Switchboard]] is a tool that helps you to find a matching language processing web application for your data. After uploading a file or entering a URL, you can select which task to perform. The Switchboard will then provide you with a list of available CLARIN tools to analyse the input.
 +
 +**Have you developed your own tool which could be useful for other researchers?** You can add it to the Switchboard Tool Registry. Find out more about sharing your tools [[documentation-platform:data-sharing|here]].
 +
 +----
 +
 +{{:documentation-platform:forschungsdaten.info-weiss.png?nolink&200|}} <wrap button>[[https://forschungsdaten.info/themen/organisieren-und-aufbereiten/|Forschungsdaten.info]]</wrap> \\ This website designed for researchers from DACH countries discusses a lot of topics on research data management in great detail. You might find specific information that is relevant for your research project, for example here:
 +
 +[[https://forschungsdaten.info/praxis-kompakt/tools/|Useful tools for research data management]]\\
 +[[https://forschungsdaten.info/themen/organisieren-und-aufbereiten/bearbeiten-und-analysieren-grosser-daten/|Working with large amounts of data]]\\
 +[[https://forschungsdaten.info/themen/organisieren-und-aufbereiten/datenvisualisierung/|Visualizing data]]\\
 +[[https://forschungsdaten.info/themen/organisieren-und-aufbereiten/datenuebertragung/|Data transfer when working with sensitive data]] \\ (see also the [[working-groups:sensitive-personal-data|CLARIN-CH working group]] on this topic)
 +
 +/* TO DO: find other tools
 +    * Tools for processing/converting data:
 +          * https://corpus-tools.org/home/ \\ Collection of tools (ANNIS, Hexatomic, Pepper, Salt) for facilitating interoperability of corpora, created by the Humboldt-Universität zu Berlin */
 +
 +
  
documentation-platform/data-processing.txt · Last modified: 2024/04/10 10:17 (external edit)