Comments (3)
This is issue related to CSV.jl not DataFrames.jl so I transferred it.
The thing you request is handled by missingstring
keyword argument. You can read about it here https://csv.juliadata.org/stable/reading.html#missingstring.
If this is clear and enough for you I propose to close this issue.
from csv.jl.
To be clear, I did NOT want anything to change in DataFrames.jl. In fact I was preemptively request you to not change the default behavior. (Not sure what's a good way to do that? Maybe I should have posted in discourse instead.) DataFrame.jl's default behavior right now is missingstring=""
. That is how it should be. In Pandas, the default behavior is the equivalent of missingstring=["NA", "NAN", "NULL"]
, which is very annoying.
I am not sure right now if importing and then exporting a csv using CSV.jl changes the data. If it does, then yes it would be an issue worth addressing.
I will close this issue.
(Also, I'm new to Julia and it's been so great to see your discussions, videos, and your work generally @bkamins . Thank you.)
from csv.jl.
Tested this csv file with CSV.jl and it seems to work as I would expect. It takes ""
as missing
or when there are two commas one after another. "NA"
or NA
and similar values get treated as strings.
string,value1,value2,value3,value4,value5,value6,mis1,mis2,num
a,1,1,1,1,1,1,,"",1.1
b,NA,"NA",N/A,"N/A",NULL,"NULL",,"",1.2
from csv.jl.
Related Issues (20)
- "writeshortest not defined" on macOS HOT 1
- UndefVarError: writeshortest not defined HOT 1
- Parsing based on first row when select, header and skipto are provided
- `CSV.io` is not defined
- CSV.File breaks with multiple input CSVs
- Reading large CSV files is slow/crashes HOT 1
- Performance regression since v0.8.0 HOT 1
- `stripwhitespace=true` not removing trailing white space? HOT 1
- skipto breaks if there is a quote in the skipped rows HOT 3
- getproperty on File makes internal use of dot notation problematic HOT 1
- big integers are parsed as Float64
- Too many missing warnings HOT 3
- writeheader=true ineffective in combination with header=
- Do not convert quoted cells
- CSV.write should conditionally convert type unstable iterators
- [Bug] CSV.read randomly changes eltype of column HOT 7
- pool kwarg documentation HOT 1
- There is no clear method reading non-UTF8 gzipped file in example
- burntsushi's issue HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from csv.jl.