Comments (2)
i was now able to check this, and at least with english texts it works out of the box:
library(udpipe)
library(koRpus.lang.en)
model <- udpipe_download_model(language="english")
ud_en <- udpipe_load_model(model$file_model)
x <- udpipe_annotate(ud_en, x="This is my sample text.")
x_kRp <- readTagged(
as.data.frame(x),
lang="en",
tagger="manual",
doc_id=as.data.frame(x)[1,"doc_id"],
mtx_cols=c(
token="token",
tag="xpos",
lemma="lemma"
)
)
x_kRp
can now be used with koRpus
.
from korpus.
hi,
this is currently not supported out of the box. looks like a worthwhile feature request ;)
the tagged data frame would have to be converted into a compatible set of columns and the tagset used would have to be checked for compatibility with koRpus' language packages.
in theory, koRpus::readTagged()
should be straight forward, then.
from korpus.
Related Issues (20)
- Missing tags for Danish HOT 6
- incomplete import of LCC corpus HOT 7
- Error: Specified directory cannot be found: ~/bin/treetagger/bin HOT 4
- URLs and sequences of punctuation in documents cause some readability measures to fail HOT 1
- How can I extract proper nouns? HOT 8
- Issue on Windows HOT 1
- Error in path.expand(path) : argument 'path' incorrect HOT 24
- Error: english-lexicon.txt not found HOT 3
- Working in Python? HOT 1
- option lexicon HOT 6
- Treetagger do not worh in both koRpus and teststem packages HOT 1
- Incorrect calculation of MTLD? HOT 3
- readability() returns error message HOT 9
- treetegger working with a dataset in R HOT 6
- Error in reading corpus database HOT 5
- character vector "measure" seems to be ignored by lex.div; Fehler in x[["end"]] : Indizierung außerhalb der Grenzen; Fehler in 1:lastValidIndex : Resultat wäre zu langer Vektor HOT 2
- Getting "Awww, this should not happen" error even though the sys.tt.call runs sucessfully HOT 6
- TT.tokenizer not found HOT 3
- Flesch Formula multiplier HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from korpus.