Comments (5)
Hi @klaapbakken ! This functionality is indeed supported via the ModelSpec
class. You can access the ModelSpec
instance associated with a model matrix generated by get_model_matrix
, e.g.:
from formulaic import model_matrix
df = ....
mm = model_matrix('y ~ x', df)
ms = mm.model_spec
You can then use that model spec to generate a new model matrix from new data using:
mm2 = ms.get_model_matrix(df2)
Hope that helps! In lieu of better documentation (still coming!), you can view the ModelSpec
class here: https://github.com/matthewwardrop/formulaic/blob/master/formulaic/model_spec.py
from formulaic.
I'll leave this open until this has been properly integrated into the docs.
from formulaic.
Hi Matthew. Great work !
I'm using the suggested approach to save/restore the state of ModelSpec within our ML framework with success, so I would suggest just a small sugar addition to catch the context when using the recovered ModelSpec: in my work I found very useful to include this utility under sugar.py
def get_model_matrix(model_spec, data, *, context=0, **kwargs):
if isinstance(context, int):
if hasattr(sys, "_getframe"):
frame = sys._getframe(context + 1)
context = LayeredMapping(frame.f_locals, frame.f_globals)
else:
context = None # pragma: no cover
return model_spec.get_model_matrix(data, context=context, **kwargs)
the body is the same as model_matrix, but with different Materializer introduced by the model_spec param.
In the previous example ms2 could be used like here:
mm2 = formulaic.get_model_matrix(mm2, df2)
with the effect of catching custom transformers referred by the formula
from formulaic.
Excellent idea @fabio-rosa-ors ! I've incorporated this into the model_matrix
sugar method, so now you can just do:
model_matrix(<old model matrix>, <new data>)
Will be in Formulaic 0.3
from formulaic.
Great !
from formulaic.
Related Issues (20)
- drop both columns in dependent variable and design matrix when missings occur HOT 5
- DOC: Explicitly mention support for multiple variables on the left hand side HOT 3
- Terms not being evaluated in get_model_matrix() HOT 2
- 17 tests fail: ModuleNotFoundError: No module named 'interface_meta' HOT 2
- How can the encoding choices for one dataset be reused for another? HOT 3
- Intercept term breaks when RHS formula begins with a parentheses HOT 2
- How do I set the reference level for a categorical term? HOT 4
- Support for sympy >= 1.10 HOT 3
- ENH: Preserve variable order as they appear in formulas HOT 5
- 2 tests fail HOT 1
- Interaction between two categorical covariates sometimes switches order, causing error HOT 3
- Intercept is not added after being removed HOT 4
- Proposal: support columns representing multiple features HOT 3
- Formulaic struggles with NAs and `poly()` syntax HOT 3
- Escaped variables and functions HOT 3
- How to include structural zeros? HOT 1
- Retain Column Names for sparse model matrices HOT 4
- Formulaic not raising an exception when required fields are missing in the dataset HOT 2
- Allow formatting the categorical encoded variables HOT 4
- Throw error when formula has parameters that are not available HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from formulaic.