What's with UNIX and its CSV obsession?
What's with UNIX and its CSV obsession?
Oh boy, this is gonna be fun.
I need to parse #CSV in #Java. Parsing it into String[][] is easy enough, but here's the hard part: I also need to parse the numbers in the CSV file (into a double, BigDecimal, or the like), and those numbers could be formatted in any of the myriad number/currency formats that Excel might produce.
Web searching isn't getting me very far, and Excel itself is of course closed source so I can't look there.
Any advice would be much appreciated!
#Vivaldi も #ReadingList の #CSV でのインポート・エクスポートに対応してるから、サービス終了する #Pocket からエクスポートしたデータをインポートできるかもしれない。
CSV のフィールドとかを揃えるために事前にごにょごにょやる必要はあるかも
A solid 24-hour sprint later -- I had to wrangle data from tables in PDFs to get a solid baseline to work with (#csv here: https://pastebin.com/JGLxQMbT) -- and I've got all of the polling stations for tomorrow's #election in Tarrant county mapped out in #osm #OpenStreetMap :D
https://overpass-turbo.eu/s/23ok
I experimented a bit within the polling_station:* namespace. Thinking I might be settling into a pattern, at least for #Texas. I dunno if I want to build up explicit relations between polling places and their (typical) associated buildings/areas or not. So for now, I chucked the relation data in a tag instead.
Ideally, future elections won't be such a pain to get bootstrapped; most polling stations tend to stay the same across elections.
@0xF21D it is a #TSV file, but aside from out-commented lines, all entries are just one after the other, thus should just import fine.
I don't use #CSV as I want to be able to write full sentences in data fields and standardized on this since some fields may need #commas and #spaces, whereas #tabs work perfectly fine as seperators...
What are some of your favorite #Open #Document formats? Do you use #csv, #ical, #html and #css, odt, xml, json, #wav, #flac regularly? What #Free formats do you rely on? You can also use plain text files for a variety of functions from telling stories and taking notes to creating #todo.txt or similar files or #emacs org mode files. Have a favorite use for plain text files? Share it with us. Join us in celebrating #DocumentFreedomDay on March 26, 2025. #DFF #FLOSS
@falcennial it's helpful for #sysadmins and anyone who is running their own #firewall (regardpess if WAF, hosted/managed or DIY/bare metal), as they can just pull that Feed-URL to (un)block stuff automatically...
git
just to diff&merge stuff because they only offer "add" and "replace" when it.comes to #CSV / #TSV input.@krutonium also #Tabs don't fuck with other peoples' #IDE and correlate with #indentation depth easily.
https://www.youtube.com/watch?v=V7PLxL8jIl8
@Chigaze @rmondello @chrisgervais also good #PasswordManagers will support exporting & importing #CSV / #TSV files...
My recommendations are #Enpass and #KeePassXC / #KeePassDX ...
After some thoughts on the matter I've decided to do nothing about the #CSV deprecation in #PHP8.4 with league/csv here's why
https://nyamsprod.com/blog/csv-and-php8-4/
@Girgias I believe in french we say ... passer la patate chaude
Example #164: Piechart SVG generation/visualization of CSV data using https://thi.ng/geom & https://thi.ng/csv
(Like the previous/recent hexgrid example, this one too relies on still unreleased features & API updates of the geom package. If you want to try it out yourself, currently you'll need to clone & build the repo yourself, see instructions in the readme/wiki...)
Demo:
https://demo.thi.ng/umbrella/geom-csv-piechart/
Source code:
https://github.com/thi-ng/umbrella/blob/develop/examples/geom-csv-piechart/src/index.ts
@rmflight
I just took a look at this based on your recommendation. It's kind of pretty. But I would never use it because it is not scriptable. Scriptable data manipulations (such as header cleanup, column and row filtering) are QA-able, repeatable, and easily modified for similar data sets.
[#archives] C'est discret MAIS les données de l'inventaire de la sous-série 2U #Courdassises du #Morbihan 1811-1940 sont décrites au dossier ET téléchargeables en #CSV ! https://rechercher.patrimoines-archives.morbihan.fr/archive/fonds/FRAD056_0000002U2/inventaire/n:16?RECH_S=ploemeur&RECH_eadid=FRAD056_0000002U2&Archives.RECH_Valid=&type=inventaire (2e icône inventaire pdf)
De quoi :
- faire des stats faciles (plus de 5200 individus jusqu'en 1921, avec des vols, des faux, meurtres ou encore infanticides, attentats à la pudeur)
- prémacher le travail pour croiser avec #généalogie
@chessert @VoxofGod @CassandraZeroCovid @null
Weird, cuz TSV also works on https://infosec.space , https://mstdn.social and https://social.tchncs.de ...
After all, I merely put one domain per line and use # to put comments in entire lines. So the only " failing " entries are the comment lines...
The difference between #TSV and #CSV is the usa of #Tabs instead of #Commas and in all tests I could.perform this doesn't make a difference as it's basically #ASCII-compatible #UTF8-formatted #Text without even any indentations whatsoever...
#HowToThing #008 — CSV parsing & filtering into structured data via https://thi.ng/csv and creating a multi-plot data visualization via https://thi.ng/viz (along with a range of other helpful packages for various side aspects).
The attached visualization shows a lin-log plot of new COVID cases between March 2020 - Dec 2021:
- Daily world total as line plot
- UK (red) and USA (blue) cases as interleaved bar plots
(All data from: https://ourworldindata.org/coronavirus)
Full source code:
https://gist.github.com/postspectacular/6a379a2bb8cd46e242163b9c9563522f