shakedown.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
A community for live music fans with roots in the jam scene. Shakedown Social is run by a team of volunteers (led by @clifff and @sethadam1) and funded by donations.

Administered by:

Server stats:

255
active users

#text

3 posts3 participants0 posts today

Recently I've combined various functions which I've been using in other projects (e.g. my personal PKM toolchain) and published them as new library thi.ng/text-analysis for better re-use:

- customizable, composable & extensible tokenization (transducer based)
- ngram generation
- Porter-stemming & stopword removal
- vocabulary (bi-directional index) creation
- dense & sparse multi-hot vector encoding/decoding
- histograms (incl. sorted versions)
- tf-idf (term frequency & inverse document frequency), multiple strategies
- k-means clustering (with k-means++ initialization & customizable distance metrics)
- similarity/distance functions (dense & sparse versions)
- central terms extraction

The attached code example (also in the project readme) uses this package to creeate a clustering of all ~210 #ThingUmbrella packages, based on their assigned tags/keywords...

The library is not intended to be a full-blown NLP solution, but I keep on finding myself running into these functions/concepts quite often, and maybe you'll find them useful too...

He reunido las herramientas que he hecho en un solo lanzador de mis "Pachi-apps" jeje, el resultado es un "MicroOS" (Que de OS no tiene nada jajaja pero me gusta cómo suena) tiene las herramienta que he hecho y que suelo utilizar. Pronto disponible en mi GitHub y Codeberg (Que ya estoy migrando a este último)

Replied in thread

Not gonna lie, I read "they had to call an ambulance for him" and assumed this was going to be a request for financial support. Like, "my kid had an ambulance called, so now we're $8k in the hole" kind of thing.

Pleasantly surprised that's not the case, but I wish I wasn't!


#text #healthcare