Persian Verbal Module for NooJ
--------------------------------

Author: Marzieh RABIEI
CRIT, Le Centre de Recherches Interdisciplinaires et Transculturelles, Université de Franche-Comté, Besançon, France
Date: 2026-01

############################################################################
MODULE CONTENT
(Dictionaries, morphological grammars, properties and corpus)

############################################################################
DICTIONARIES (files .dic / .nod – folder: Dictionaries)

This module contains lexical resources for contemporary Persian verbs, including
both simple verbs and compound verbs (light verb constructions).

- VERBS_V_1.dic
  Main dictionary of Persian verbs.
  It includes:
    • simple verbs
    • compound verbs of the form N + kardan, N + dashtan, etc.

Each lexical entry is associated with an inflectional paradigm defined
in the morphological grammar files (.nof), for example:
  FLX=KARDAN
  FLX=KARDAN_LVC
  FLX=DASHTAN_LVC

The compiled dictionary (.nod) is generated automatically by NooJ.

############################################################################
MORPHOLOGICAL GRAMMARS (files .nof – folder: Grammars)

- verbs.nof

This file defines the inflectional paradigms for Persian verbs.
It covers:
  • simple verbal inflection
  • compound verbs with light verbs
  • a specific distinction between simple verbs and verbs used
    in compositional contexts (LVC)

For compound verbs, dedicated paradigms (e.g. KARDAN_LVC) are used
to ensure that inflection applies exclusively to the light verb,
while preserving the compositional structure.

############################################################################
PROPERTIES FILE (properties.def – folder: Properties)

- properties.def

This file defines the linguistic features used in the module, including:
  • verbal category (V)
  • person, number, tense and mood features
  • syntactic valency (tr / intr)


############################################################################
CORPUS (files .not / .txt – folder: Corpus)

The module includes a Persian corpus used for testing and validation.
The corpus is extracted from a contemporary Persian literary work,
namely the novel *Savushun* by Simin Daneshvar, first published by
Kharazmi Publishing, Tehran (1969).

This corpus is used to verify:
  • verb recognition
  • inflectional correctness
  • the behavior of compound verbs in context

############################################################################
USAGE NOTES

To use this module in NooJ:
1. Load the dictionaries and grammars via Preferences → Lexical Analysis.
2. Open a Persian text and apply Lexical Analysis.
3. Verb forms (simple and compound) should be correctly recognized
   and annotated.

############################################################################
REMARKS

This module focuses on verbal morphology.
Syntactic disambiguation grammars are not included at this stage.

Any feedback or remarks are welcome.

