METHODOLOGICAL PRINCIPLES FOR CONSTRUCTING LANGUAGE CORPORA BASED ON MEDIA DISCOURSE
Keywords:
media discourse, corpus design, representativeness, metadata enrichmentAbstract
The digital revolution has transformed media into a primary source of linguistic data. However, the transient and heterogeneous nature of media texts, namely, spanning news reports, social media posts, and multimedia broadcasts, requires a structured methodological approach. This article explores the core principles of corpus design, focusing on representativeness, sampling, metadata enrichment, and ethical considerations. By adhering to these principles, researchers can create robust datasets capable of supporting diachronic and synchronic linguistic analysis.
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.