Package org.getalp.dbnary.languages
Class OntolexBasedRDFDataHandler
- java.lang.Object
-
- org.getalp.dbnary.model.DbnaryModel
-
- org.getalp.dbnary.languages.OntolexBasedRDFDataHandler
-
- All Implemented Interfaces:
IWiktionaryDataHandler
- Direct Known Subclasses:
PostTranslationDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
,WiktionaryDataHandler
public class OntolexBasedRDFDataHandler extends DbnaryModel implements IWiktionaryDataHandler
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description protected static class
OntolexBasedRDFDataHandler.PosAndType
-
Field Summary
Fields Modifier and Type Field Description protected org.apache.jena.rdf.model.Model
aBox
Deprecated.use getFeatureBox()protected org.apache.jena.rdf.model.Resource
currentCanonicalForm
protected String
currentEncodedLexicalEntryName
protected org.apache.jena.rdf.model.Resource
currentLexEntry
Deprecated.protected LexicalEntry
currentLexicalEntry
protected CounterSet
currentLexieCount
Deprecated.protected org.apache.jena.rdf.model.Resource
currentMainLexEntry
Deprecated.should rely on currentPage (not a Resource anymoreprotected Page
currentPage
protected org.apache.jena.rdf.model.Resource
currentSense
protected HierarchicalSenseNumber
currentSenseNumber
protected Set<PronunciationPair>
currentSharedPronunciations
protected AbstractGlossFilter
glossFilter
protected org.apache.jena.rdf.model.Resource
lexvoExtractedLanguage
protected org.apache.jena.rdf.model.Resource
lexvoSectionLanguage
protected String
longEditionLanguageCode
protected String
longSectionLanguageCode
protected int
nbEntries
protected static HashMap<String,OntolexBasedRDFDataHandler.PosAndType>
posAndTypeValueMap
protected CounterSet
reifiedNymCount
protected String
shortEditionLanguageCode
protected String
shortSectionLanguageCode
protected CounterSet
translationCount
protected String
WIKT
-
Fields inherited from class org.getalp.dbnary.model.DbnaryModel
DBNARY_NS_PREFIX, LEXVO, tBox
-
-
Constructor Summary
Constructors Constructor Description OntolexBasedRDFDataHandler(String longEditionLanguageCode, String tdbDir)
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description protected void
addOtherFormPropertiesToLexicalEntry(org.apache.jena.rdf.model.Resource lexEntry, HashSet<PropertyObjectPair> properties)
org.apache.jena.rdf.model.Resource
addTo(org.apache.jena.rdf.model.Resource target, Set<org.apache.commons.lang3.tuple.Pair<org.apache.jena.rdf.model.Property,org.apache.jena.rdf.model.RDFNode>> pv)
org.apache.jena.rdf.model.Resource
addToCurrentWordSense(Set<org.apache.commons.lang3.tuple.Pair<org.apache.jena.rdf.model.Property,org.apache.jena.rdf.model.RDFNode>> pv)
void
buildDatacubeObservations(String l, TranslationGlossesStat tgs, EvaluationStats.Stat es, String dumpFileVersion)
void
closeDataset()
close the dataset that eventually backs up the different feature boxes.protected String
computeNymId(String nym)
protected String
computeNymId(String nym, String pagename)
protected String
computeOtherFormResourceName(org.apache.jena.rdf.model.Resource lexEntry, HashSet<PropertyObjectPair> properties)
protected String
computeSenseNum()
void
computeStatistics(org.apache.jena.rdf.model.Model statsModel, org.apache.jena.rdf.model.Model sourceModel, String dumpVersion)
static void
createGeneralStatisticsObservation(org.apache.jena.rdf.model.Model statsBox, String dumpVersion, String prefix, String lang, long translationCount, long pageCount, long entryCount, long senseCount)
org.apache.jena.rdf.model.Resource
createGlossResource(String gloss)
org.apache.jena.rdf.model.Resource
createGlossResource(String gloss, int rank)
org.apache.jena.rdf.model.Resource
createGlossResource(StructuredGloss gloss)
org.apache.jena.rdf.model.Resource
createGlossResource(StructuredGloss gloss, int rank)
static void
createNymRelationObservation(org.apache.jena.rdf.model.Model box, String dumpVersion, String prefix, String lang, NymRelation nym, long cr)
static void
createTranslationObservation(org.apache.jena.rdf.model.Model statsBox, String dumpVersion, String prefix, String sourceLanguage, String targetLanguage, long c)
org.apache.jena.rdf.model.Resource
currentLexinfoPos()
String
currentPagename()
String
currentWiktionaryPos()
protected OntolexBasedRDFDataHandler.PosAndType
decodePartOfSpeech(String pos)
void
dump(org.apache.jena.rdf.model.Model box, OutputStream out, String format)
Write a serialized represention of this model in a specified language.void
dumpAllFeaturesAsHDT(OutputStream ostream, boolean isExolex)
void
enableEndolexFeatures(ExtractionFeature f)
Enable the extraction of morphological data in a second Model if available.void
enableExolexFeatures(ExtractionFeature f)
void
finalizeLanguageSection()
void
finalizePageExtraction()
String
getCurrentEntryLanguage()
returns the language of the current EntryString
getEncodedPageName(String pageName, String pos, int defNumber)
org.apache.jena.rdf.model.Model
getEndolexFeatureBox(ExtractionFeature f)
org.apache.jena.rdf.model.Model
getExolexFeatureBox(ExtractionFeature f)
String
getExtractedLanguage()
returns the short (2 letter code) id of the language of the language editionorg.apache.jena.rdf.model.Model
getFeatureBox(ExtractionFeature f)
AbstractGlossFilter
getGlossFilter()
protected String
getGlossResourceName(StructuredGloss gloss)
org.apache.jena.rdf.model.Resource
getLexEntry(String languageCode, String pageName, String pos, int defNumber)
org.apache.jena.rdf.model.Resource
getLexEntry(String encodedPageName, org.apache.jena.rdf.model.Resource typeResource)
org.apache.jena.rdf.model.Resource
getPageResource(String page)
org.apache.jena.rdf.model.Resource
getPageResource(String vocable, boolean dontLinkWithType)
String
getPageResourceIRI(String vocable)
String
getPrefix()
void
initializeLanguageSection(String language)
void
initializeLexicalEntry(String pos)
protected org.apache.jena.rdf.model.Resource
initializeLexicalEntry(String pos, org.apache.jena.rdf.model.Resource lexinfoPOS, org.apache.jena.rdf.model.Resource type)
org.apache.jena.rdf.model.Resource
initializeLexicalEntry__noModel(String originalPOS, org.apache.jena.rdf.model.Resource normalizedPOS, org.apache.jena.rdf.model.Resource normalizedType)
void
initializePageExtraction(String wiktionaryPageName)
boolean
isDisabled(ExtractionFeature f)
protected boolean
isResourceCompatible(org.apache.jena.rdf.model.Resource r, HashSet<PropertyObjectPair> properties)
protected void
mergePropertiesIntoResource(HashSet<PropertyObjectPair> properties, org.apache.jena.rdf.model.Resource res)
int
nbEntries()
void
populateMetadata(org.apache.jena.rdf.model.Model metadataModel, org.apache.jena.rdf.model.Model sourceModel, String dumpFilename, String extractorVersion, boolean isExolex)
org.apache.jena.rdf.model.Resource
posResource(String pos)
org.apache.jena.rdf.model.Resource
posResource(OntolexBasedRDFDataHandler.PosAndType pat)
void
registerAlternateSpelling(String alt)
org.apache.jena.rdf.model.Resource
registerExample(String ex, Set<org.apache.commons.lang3.tuple.Pair<org.apache.jena.rdf.model.Property,org.apache.jena.rdf.model.RDFNode>> context)
Register example ex for the current lexical sense.org.apache.jena.rdf.model.Resource
registerExample(Set<org.apache.commons.lang3.tuple.Pair<org.apache.jena.rdf.model.Property,org.apache.jena.rdf.model.RDFNode>> context)
void
registerInflection(String languageCode, String pos, String inflection, String canonicalForm, int defNumber, HashSet<PropertyObjectPair> props)
void
registerInflection(String languageCode, String pos, String inflection, String canonicalForm, int defNumber, HashSet<PropertyObjectPair> props, HashSet<PronunciationPair> pronunciations)
void
registerInflection(InflectionData key, Set<String> value)
org.apache.jena.rdf.model.Resource
registerNewDefinition(String def)
Register definition def for the current lexical entry.org.apache.jena.rdf.model.Resource
registerNewDefinition(String def, int lvl)
Register definition def for the current lexical entry.org.apache.jena.rdf.model.Resource
registerNewDefinition(String def, String senseNumber)
Register a definition with the given sense number.void
registerNymRelation(String target, String synRelation)
void
registerNymRelation(String target, String synRelation, org.apache.jena.rdf.model.Resource gloss, String usage)
void
registerNymRelationOnCurrentSense(String target, String synRelation, org.apache.jena.rdf.model.Resource gloss, String usage)
void
registerNymRelationToEntity(String target, String synRelation, org.apache.jena.rdf.model.Resource entity)
void
registerNymRelationToEntity(String target, String nymRelation, org.apache.jena.rdf.model.Resource entity, org.apache.jena.rdf.model.Resource gloss, String usage)
void
registerPronunciation(String pron, String lang)
protected void
registerPronunciation(org.apache.jena.rdf.model.Resource writtenRepresentation, String pron, String lang)
void
registerPropertyOnCanonicalForm(org.apache.jena.rdf.model.Property p, org.apache.jena.rdf.model.RDFNode r)
void
registerPropertyOnLexicalEntry(org.apache.jena.rdf.model.Property p, org.apache.jena.rdf.model.RDFNode r)
void
registerTranslation(String lang, org.apache.jena.rdf.model.Resource currentGloss, String usage, String word)
protected org.apache.jena.rdf.model.Resource
registerTranslationToEntity(org.apache.jena.rdf.model.Resource entity, String lang, org.apache.jena.rdf.model.Resource currentGloss, String usage, String word)
org.apache.jena.rdf.model.Resource
typeResource(String pos)
org.apache.jena.rdf.model.Resource
typeResource(OntolexBasedRDFDataHandler.PosAndType pat)
-
Methods inherited from class org.getalp.dbnary.model.DbnaryModel
setGlobalDbnaryPrefix, uriEncode, uriEncode, uriEncode
-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
Methods inherited from interface org.getalp.dbnary.api.IWiktionaryDataHandler
registerNymRelationOnCurrentSense
-
-
-
-
Field Detail
-
aBox
@Deprecated protected org.apache.jena.rdf.model.Model aBox
Deprecated.use getFeatureBox()
-
currentLexEntry
@Deprecated protected org.apache.jena.rdf.model.Resource currentLexEntry
Deprecated.
-
currentLexicalEntry
protected LexicalEntry currentLexicalEntry
-
currentSense
protected org.apache.jena.rdf.model.Resource currentSense
-
currentSenseNumber
protected HierarchicalSenseNumber currentSenseNumber
-
translationCount
protected CounterSet translationCount
-
reifiedNymCount
protected CounterSet reifiedNymCount
-
shortEditionLanguageCode
protected final String shortEditionLanguageCode
-
longEditionLanguageCode
protected final String longEditionLanguageCode
-
shortSectionLanguageCode
protected String shortSectionLanguageCode
-
longSectionLanguageCode
protected String longSectionLanguageCode
-
lexvoExtractedLanguage
protected org.apache.jena.rdf.model.Resource lexvoExtractedLanguage
-
lexvoSectionLanguage
protected org.apache.jena.rdf.model.Resource lexvoSectionLanguage
-
nbEntries
protected int nbEntries
-
WIKT
protected String WIKT
-
currentEncodedLexicalEntryName
protected String currentEncodedLexicalEntryName
-
currentLexieCount
@Deprecated protected CounterSet currentLexieCount
Deprecated.
-
currentMainLexEntry
@Deprecated protected org.apache.jena.rdf.model.Resource currentMainLexEntry
Deprecated.should rely on currentPage (not a Resource anymore
-
currentPage
protected Page currentPage
-
currentCanonicalForm
protected org.apache.jena.rdf.model.Resource currentCanonicalForm
-
currentSharedPronunciations
protected Set<PronunciationPair> currentSharedPronunciations
-
posAndTypeValueMap
protected static HashMap<String,OntolexBasedRDFDataHandler.PosAndType> posAndTypeValueMap
-
glossFilter
protected final AbstractGlossFilter glossFilter
-
-
Method Detail
-
getCurrentEntryLanguage
public String getCurrentEntryLanguage()
returns the language of the current Entry- Specified by:
getCurrentEntryLanguage
in interfaceIWiktionaryDataHandler
- Returns:
- a language code
-
getExtractedLanguage
public String getExtractedLanguage()
Description copied from interface:IWiktionaryDataHandler
returns the short (2 letter code) id of the language of the language edition- Specified by:
getExtractedLanguage
in interfaceIWiktionaryDataHandler
- Returns:
- wiktionary edition short language code
-
closeDataset
public void closeDataset()
Description copied from interface:IWiktionaryDataHandler
close the dataset that eventually backs up the different feature boxes.Does nothing when there is no dataset backing up the boxes.
- Specified by:
closeDataset
in interfaceIWiktionaryDataHandler
-
enableEndolexFeatures
public void enableEndolexFeatures(ExtractionFeature f)
Description copied from interface:IWiktionaryDataHandler
Enable the extraction of morphological data in a second Model if available.- Specified by:
enableEndolexFeatures
in interfaceIWiktionaryDataHandler
- Parameters:
f
- Feature
-
getFeatureBox
public org.apache.jena.rdf.model.Model getFeatureBox(ExtractionFeature f)
- Specified by:
getFeatureBox
in interfaceIWiktionaryDataHandler
-
getEndolexFeatureBox
public org.apache.jena.rdf.model.Model getEndolexFeatureBox(ExtractionFeature f)
- Specified by:
getEndolexFeatureBox
in interfaceIWiktionaryDataHandler
-
getExolexFeatureBox
public org.apache.jena.rdf.model.Model getExolexFeatureBox(ExtractionFeature f)
- Specified by:
getExolexFeatureBox
in interfaceIWiktionaryDataHandler
-
enableExolexFeatures
public void enableExolexFeatures(ExtractionFeature f)
- Specified by:
enableExolexFeatures
in interfaceIWiktionaryDataHandler
-
isDisabled
public boolean isDisabled(ExtractionFeature f)
- Specified by:
isDisabled
in interfaceIWiktionaryDataHandler
-
initializePageExtraction
public void initializePageExtraction(String wiktionaryPageName)
- Specified by:
initializePageExtraction
in interfaceIWiktionaryDataHandler
-
finalizePageExtraction
public void finalizePageExtraction()
- Specified by:
finalizePageExtraction
in interfaceIWiktionaryDataHandler
-
initializeLanguageSection
public void initializeLanguageSection(String language)
- Specified by:
initializeLanguageSection
in interfaceIWiktionaryDataHandler
-
finalizeLanguageSection
public void finalizeLanguageSection()
- Specified by:
finalizeLanguageSection
in interfaceIWiktionaryDataHandler
-
getLexEntry
public org.apache.jena.rdf.model.Resource getLexEntry(String languageCode, String pageName, String pos, int defNumber)
-
getLexEntry
public org.apache.jena.rdf.model.Resource getLexEntry(String encodedPageName, org.apache.jena.rdf.model.Resource typeResource)
-
currentWiktionaryPos
public String currentWiktionaryPos()
- Specified by:
currentWiktionaryPos
in interfaceIWiktionaryDataHandler
-
currentLexinfoPos
public org.apache.jena.rdf.model.Resource currentLexinfoPos()
- Specified by:
currentLexinfoPos
in interfaceIWiktionaryDataHandler
-
posResource
public org.apache.jena.rdf.model.Resource posResource(OntolexBasedRDFDataHandler.PosAndType pat)
-
typeResource
public org.apache.jena.rdf.model.Resource typeResource(OntolexBasedRDFDataHandler.PosAndType pat)
-
posResource
public org.apache.jena.rdf.model.Resource posResource(String pos)
-
typeResource
public org.apache.jena.rdf.model.Resource typeResource(String pos)
-
decodePartOfSpeech
protected OntolexBasedRDFDataHandler.PosAndType decodePartOfSpeech(String pos)
-
initializeLexicalEntry
public void initializeLexicalEntry(String pos)
- Specified by:
initializeLexicalEntry
in interfaceIWiktionaryDataHandler
-
initializeLexicalEntry
protected org.apache.jena.rdf.model.Resource initializeLexicalEntry(String pos, org.apache.jena.rdf.model.Resource lexinfoPOS, org.apache.jena.rdf.model.Resource type)
-
initializeLexicalEntry__noModel
public org.apache.jena.rdf.model.Resource initializeLexicalEntry__noModel(String originalPOS, org.apache.jena.rdf.model.Resource normalizedPOS, org.apache.jena.rdf.model.Resource normalizedType)
-
registerPropertyOnCanonicalForm
public void registerPropertyOnCanonicalForm(org.apache.jena.rdf.model.Property p, org.apache.jena.rdf.model.RDFNode r)
- Specified by:
registerPropertyOnCanonicalForm
in interfaceIWiktionaryDataHandler
-
registerPropertyOnLexicalEntry
public void registerPropertyOnLexicalEntry(org.apache.jena.rdf.model.Property p, org.apache.jena.rdf.model.RDFNode r)
- Specified by:
registerPropertyOnLexicalEntry
in interfaceIWiktionaryDataHandler
-
registerAlternateSpelling
public void registerAlternateSpelling(String alt)
- Specified by:
registerAlternateSpelling
in interfaceIWiktionaryDataHandler
-
registerNewDefinition
public org.apache.jena.rdf.model.Resource registerNewDefinition(String def)
Description copied from interface:IWiktionaryDataHandler
Register definition def for the current lexical entry.This method will compute a sense number based on the rank of the definition in the entry.
It is equivalent to registerNewDefinition(def, 1);
- Specified by:
registerNewDefinition
in interfaceIWiktionaryDataHandler
- Parameters:
def
- a string- Returns:
-
registerNewDefinition
public org.apache.jena.rdf.model.Resource registerNewDefinition(String def, int lvl)
Description copied from interface:IWiktionaryDataHandler
Register definition def for the current lexical entry.This method will compute a sense number based on the rank of the definition in the entry, taking into account the level of the definition. 1, 1a, 1b, 1c, 2, etc.
- Specified by:
registerNewDefinition
in interfaceIWiktionaryDataHandler
- Parameters:
def
- the definition stringlvl
- an integer giving the level of the definition (1 or 2).- Returns:
-
registerNewDefinition
public org.apache.jena.rdf.model.Resource registerNewDefinition(String def, String senseNumber)
Register a definition with the given sense number.- Specified by:
registerNewDefinition
in interfaceIWiktionaryDataHandler
- Parameters:
def
- the definition stringsenseNumber
- a string giving the sense number of the definition.- Returns:
-
computeSenseNum
protected String computeSenseNum()
-
registerTranslationToEntity
protected org.apache.jena.rdf.model.Resource registerTranslationToEntity(org.apache.jena.rdf.model.Resource entity, String lang, org.apache.jena.rdf.model.Resource currentGloss, String usage, String word)
-
registerTranslation
public void registerTranslation(String lang, org.apache.jena.rdf.model.Resource currentGloss, String usage, String word)
- Specified by:
registerTranslation
in interfaceIWiktionaryDataHandler
-
getPageResource
public org.apache.jena.rdf.model.Resource getPageResource(String vocable, boolean dontLinkWithType)
-
getPageResource
public org.apache.jena.rdf.model.Resource getPageResource(String page)
-
mergePropertiesIntoResource
protected void mergePropertiesIntoResource(HashSet<PropertyObjectPair> properties, org.apache.jena.rdf.model.Resource res)
-
isResourceCompatible
protected boolean isResourceCompatible(org.apache.jena.rdf.model.Resource r, HashSet<PropertyObjectPair> properties)
-
addOtherFormPropertiesToLexicalEntry
protected void addOtherFormPropertiesToLexicalEntry(org.apache.jena.rdf.model.Resource lexEntry, HashSet<PropertyObjectPair> properties)
-
computeOtherFormResourceName
protected String computeOtherFormResourceName(org.apache.jena.rdf.model.Resource lexEntry, HashSet<PropertyObjectPair> properties)
-
registerInflection
public void registerInflection(String languageCode, String pos, String inflection, String canonicalForm, int defNumber, HashSet<PropertyObjectPair> props, HashSet<PronunciationPair> pronunciations)
- Specified by:
registerInflection
in interfaceIWiktionaryDataHandler
-
registerInflection
public void registerInflection(String languageCode, String pos, String inflection, String canonicalForm, int defNumber, HashSet<PropertyObjectPair> props)
- Specified by:
registerInflection
in interfaceIWiktionaryDataHandler
- Parameters:
languageCode
- the language code of the inflectionpos
- the part of speech of the inflected forminflection
- inflected formcanonicalForm
- canonical formdefNumber
- definition number of the word sense associated to the formprops
- morpho syntactic properties to be registered with the inflected form
-
registerInflection
public void registerInflection(InflectionData key, Set<String> value)
- Specified by:
registerInflection
in interfaceIWiktionaryDataHandler
-
registerNymRelationToEntity
public void registerNymRelationToEntity(String target, String synRelation, org.apache.jena.rdf.model.Resource entity)
-
registerNymRelationToEntity
public void registerNymRelationToEntity(String target, String nymRelation, org.apache.jena.rdf.model.Resource entity, org.apache.jena.rdf.model.Resource gloss, String usage)
-
registerNymRelation
public void registerNymRelation(String target, String synRelation)
- Specified by:
registerNymRelation
in interfaceIWiktionaryDataHandler
-
getGlossFilter
public AbstractGlossFilter getGlossFilter()
- Specified by:
getGlossFilter
in interfaceIWiktionaryDataHandler
-
createGlossResource
public org.apache.jena.rdf.model.Resource createGlossResource(String gloss)
- Specified by:
createGlossResource
in interfaceIWiktionaryDataHandler
-
createGlossResource
public org.apache.jena.rdf.model.Resource createGlossResource(StructuredGloss gloss)
- Specified by:
createGlossResource
in interfaceIWiktionaryDataHandler
-
createGlossResource
public org.apache.jena.rdf.model.Resource createGlossResource(String gloss, int rank)
- Specified by:
createGlossResource
in interfaceIWiktionaryDataHandler
-
createGlossResource
public org.apache.jena.rdf.model.Resource createGlossResource(StructuredGloss gloss, int rank)
- Specified by:
createGlossResource
in interfaceIWiktionaryDataHandler
-
getGlossResourceName
protected String getGlossResourceName(StructuredGloss gloss)
-
registerNymRelation
public void registerNymRelation(String target, String synRelation, org.apache.jena.rdf.model.Resource gloss, String usage)
- Specified by:
registerNymRelation
in interfaceIWiktionaryDataHandler
-
registerNymRelationOnCurrentSense
public void registerNymRelationOnCurrentSense(String target, String synRelation, org.apache.jena.rdf.model.Resource gloss, String usage)
- Specified by:
registerNymRelationOnCurrentSense
in interfaceIWiktionaryDataHandler
-
registerPronunciation
public void registerPronunciation(String pron, String lang)
- Specified by:
registerPronunciation
in interfaceIWiktionaryDataHandler
-
registerPronunciation
protected void registerPronunciation(org.apache.jena.rdf.model.Resource writtenRepresentation, String pron, String lang)
-
dump
public void dump(org.apache.jena.rdf.model.Model box, OutputStream out, String format)
Description copied from interface:IWiktionaryDataHandler
Write a serialized represention of this model in a specified language. The language in which to write the model is specified by the lang argument. Predefined values are "RDF/XML", "RDF/XML-ABBREV", "N-TRIPLE", "TURTLE", (and "TTL") and "N3". The default value, represented by null, is "RDF/XML".- Specified by:
dump
in interfaceIWiktionaryDataHandler
- Parameters:
box
- the Model to be dumpedout
- an OutputStreamformat
- a String
-
dumpAllFeaturesAsHDT
public void dumpAllFeaturesAsHDT(OutputStream ostream, boolean isExolex)
- Specified by:
dumpAllFeaturesAsHDT
in interfaceIWiktionaryDataHandler
-
nbEntries
public int nbEntries()
- Specified by:
nbEntries
in interfaceIWiktionaryDataHandler
-
currentPagename
public String currentPagename()
- Specified by:
currentPagename
in interfaceIWiktionaryDataHandler
-
getPrefix
public String getPrefix()
-
addTo
public org.apache.jena.rdf.model.Resource addTo(org.apache.jena.rdf.model.Resource target, Set<org.apache.commons.lang3.tuple.Pair<org.apache.jena.rdf.model.Property,org.apache.jena.rdf.model.RDFNode>> pv)
-
addToCurrentWordSense
public org.apache.jena.rdf.model.Resource addToCurrentWordSense(Set<org.apache.commons.lang3.tuple.Pair<org.apache.jena.rdf.model.Property,org.apache.jena.rdf.model.RDFNode>> pv)
- Specified by:
addToCurrentWordSense
in interfaceIWiktionaryDataHandler
-
registerExample
public org.apache.jena.rdf.model.Resource registerExample(String ex, Set<org.apache.commons.lang3.tuple.Pair<org.apache.jena.rdf.model.Property,org.apache.jena.rdf.model.RDFNode>> context)
Description copied from interface:IWiktionaryDataHandler
Register example ex for the current lexical sense.- Specified by:
registerExample
in interfaceIWiktionaryDataHandler
- Parameters:
ex
- the example stringcontext
- map of property + RDFNode that are to be attached to the example object.- Returns:
- a Resource
-
registerExample
public org.apache.jena.rdf.model.Resource registerExample(Set<org.apache.commons.lang3.tuple.Pair<org.apache.jena.rdf.model.Property,org.apache.jena.rdf.model.RDFNode>> context)
-
computeStatistics
public void computeStatistics(org.apache.jena.rdf.model.Model statsModel, org.apache.jena.rdf.model.Model sourceModel, String dumpVersion)
- Specified by:
computeStatistics
in interfaceIWiktionaryDataHandler
-
createGeneralStatisticsObservation
public static void createGeneralStatisticsObservation(org.apache.jena.rdf.model.Model statsBox, String dumpVersion, String prefix, String lang, long translationCount, long pageCount, long entryCount, long senseCount)
-
createNymRelationObservation
public static void createNymRelationObservation(org.apache.jena.rdf.model.Model box, String dumpVersion, String prefix, String lang, NymRelation nym, long cr)
-
createTranslationObservation
public static void createTranslationObservation(org.apache.jena.rdf.model.Model statsBox, String dumpVersion, String prefix, String sourceLanguage, String targetLanguage, long c)
-
populateMetadata
public void populateMetadata(org.apache.jena.rdf.model.Model metadataModel, org.apache.jena.rdf.model.Model sourceModel, String dumpFilename, String extractorVersion, boolean isExolex)
- Specified by:
populateMetadata
in interfaceIWiktionaryDataHandler
-
buildDatacubeObservations
public void buildDatacubeObservations(String l, TranslationGlossesStat tgs, EvaluationStats.Stat es, String dumpFileVersion)
- Specified by:
buildDatacubeObservations
in interfaceIWiktionaryDataHandler
-
-