Skip to main content
- ..
- Theresa O'Connor shared a link to the group: USS Loma Prieta, NCC-26848. (Facebook) ()
- @evacide absolutely! ()
- @torgo hell yeah! ()
- Late January link roundup ()
- @castastrophe yay, welcome! ()
- Don’t judge a book by its cover ()
- Boost (Mastodon):
He told them, "This show is about diversity and working together in concert. The two people who most represent that diversity are Nichelle Nichols and George Takei, and if they can't be a part of this, then I'm not interested either." He was willing to walk away from a job—a rare act of principle in this industry—because he understood that Star Trek's promise of infinite diversity in infinite combinations was not just a slogan, but a responsibility.
— George Takei :verified: 🏳️🌈🖖🏽 (@georgetakei@universeodon.com) ()
- @torgo yay! @erynofwales and I have been watching it with EJ; it’s her first time. ()
- A week of links ()
- Rikki’s!! (Threads) ()
- Boost (Mastodon):
A PERSON POSTED A THOUGHTFUL COMMENT ON MY BLOG AND I FEEL LIKE IT’S 2011 AGAIN. BRING BACK LONG FORM EXCHANGES IN PLACED WE CONTROL.
— willowbl00 (@willowbl00@indieweb.social) ()
- Il Duce Arancione ()
- Wikipedia turned twenty five today! ()
- 38 ()
- Theresa O'Connor wrote on Eryn Wells's profile. (Facebook) ()
- Some illogical links on World Logic Day ()
- Boost (Mastodon):
This weekend, I wrote about the state murder of Renee Good, and the fascist claim that her murderer enjoys absolute immunity, and a common-sense reversal of qualified immunity that protects civilians from authorities, rather than the other way around.
Alignment with supremacy is a vile choice. Raising the natural costs for those who make that choice by putting first things first. Absolute Immunity by JuliusGoat
— A.R. Moxon, Verified Duck 🦆 (@JuliusGoat@mastodon.social) ()
- (Threads) ()
- Boost (Mastodon):
if you want stable and reliable software, give developers stable and reliable employment
— Zach Leatherman (@zachleat@zachleat.com) ()
- @PavelASamsonov the original: https:// pbfcomics.com/comics/book-worl d/ ()
- The Hague for the lot of them. #america #authoritarianism #fuckice ()
- Boost (Mastodon):
Also, bureaucracy was created to be sure everyone is treated as equal. I love it when it’s self improving instead of papered-over (like our legal system!).
J. Nathan Matias 🦣 (@natematias@social.coop)
I'm kinda grateful no one told me, growing up, that "trust but verify" is an anagram for b u r e a u c r a c y
— willowbl00 (@willowbl00@indieweb.social) ()
- [citation needed] ()
- Fresh links for the new year ()
- Boost (Mastodon):
All we have are Mussolini cosplayers who can’t make the trains run on time, Rockefeller LARPers who can’t build a refinery, and Nikola Tesla tribute acts who’ve never shipped hardware. History will remember this as the age of the cover band. We are ruled by guys who watched a YouTube documentary about Napoleon and said “I could do that” but meant the hat.
— JA Westenberg (@Daojoan@mastodon.social) ()
- Boost (Mastodon):
All that the Turing Test proves is that human are much, much stupider than Alan Turing ever suspected.
— evacide (@evacide@hachyderm.io) ()
- Little Women ()
- Boost (Mastodon):
Why #Markdown 's emphasis syntax (`**`) fails outside of Western languages: A deep dive into #CommonMark 's “delimiter run” flaws and their impact on #CJK users. A must-read for anyone interested in #internationalization and the future of Markdown:
As Markdown has become the standard for LLM outputs, we are now forced to witness a common and unsightly mess where Markdown emphasis markers (**) remain unrendered and exposed, as seen in the image. This is a chronic issue with the CommonMark specification---one that I once reported about ten years ago---but it has been left neglected without any solution to this day.
The technical details of the problem are as follows: In an effort to limit parsing complexity during the standardization process, CommonMark introduced the concept of "delimiter runs." These runs are assigned properties of being "left-flanking" or "right-flanking" (or both, or neither) depending on their position. According to these rules, a bolded segment must start with a left-flanking delimiter run and end with a right-flanking one. The crucial point is that whether a run is left- or right-flanking is determined solely by the immediate surrounding characters, without any consideration of the broader context. For instance, a left-flanking delimiter must be in the form of **<ordinary character>, <whitespace>**<punctuation>, or <punctuation>**<punctuation>. (Here, "ordinary character" refers to any character that is not whitespace or punctuation.) The first case is presumably intended to allow markers embedded within a word, like **마크다운**은, while the latter cases are meant to provide limited support for markers placed before punctuation, such as in 이 **"마크다운"** 형식은. The rules for right-flanking are identical, just in the opposite direction.
However, when you try to parse a string like **마크다운(Markdown)**은 using these rules, it fails because the closing ** is preceded by punctuation (a parenthesis) and it must be followed by whitespace or another punctuation mark to be considered right-flanking. Since it is followed by an ordinary letter (은), it is not recognized as right-flanking and thus fails to close the emphasis.
As explained in the CommonMark spec, the original intent of this rule was to support nested emphasis, like **this **way** of nesting**. Since users typically don't insert spaces inside emphasis markers (e.g., **word **), the spec attempts to resolve ambiguity by declaring that markers adjacent to whitespace can only function in a specific direction. However, in CJK (Chinese, Japanese, Korean) environments, either spaces are completly absent or (as in Korean) punctuations are commonly used within a word. Consequently, there are clear limits to inferring whether a delimiter is left or right-flanking based on these rules. Even if we were to allow <ordinary character>**<punctuation> to be interpreted as left-flanking to accommodate cases like **마크다운(Markdown)**은, how would we handle something like このような**[状況](...)は**?
In my view, the utility of nested emphasis is marginal at best, while the frustration it causes in CJK environments is significant. Furthermore, because LLMs generate Markdown based on how people would actually use it---rather than strictly following the design intent of CommonMark---this latent inconvenience that users have long felt is now being brought directly to the surface.
As Markdown has become the standard for LLM outputs, we are now forced to witness a common and unsightly mess where Markdown emphasis markers (**) remain unrendered and exposed, as seen in the image. This is a chronic issue with the CommonMark specification---one that I once reported about ten years ago---but it has been left neglected without any solution to this day.
The technical details of the problem are as follows: In an effort to limit parsing complexity during the standardization process, CommonMark introduced the concept of "delimiter runs." These runs are assigned properties of being "left-flanking" or "right-flanking" (or both, or neither) depending on their position. According to these rules, a bolded segment must start with a left-flanking delimiter run and end with a right-flanking one. The crucial point is that whether a run is left- or right-flanking is determined solely by the immediate surrounding characters, without any consideration of the broader context. For instance, a left-flanking delimiter must be in the form of **<ordinary character>, <whitespace>**<punctuation>, or <punctuation>**<punctuation>. (Here, "ordinary character" refers to any character that is not whitespace or punctuation.) The first case is presumably intended to allow markers embedded within a word, like **마크다운**은, while the latter cases are meant to provide limited support for markers placed before punctuation, such as in 이 **"마크다운"** 형식은. The rules for right-flanking are identical, just in the opposite direction.
However, when you try to parse a string like **마크다운(Markdown)**은 using these rules, it fails because the closing ** is preceded by punctuation (a parenthesis) and it must be followed by whitespace or another punctuation mark to be considered right-flanking. Since it is followed by an ordinary letter (은), it is not recognized as right-flanking and thus fails to close the emphasis.
As explained in the CommonMark spec, the original intent of this rule was to support nested emphasis, like **this **way** of nesting**. Since users typically don't insert spaces inside emphasis markers (e.g., **word **), the spec attempts to resolve ambiguity by declaring that markers adjacent to whitespace can only function in a specific direction. However, in CJK (Chinese, Japanese, Korean) environments, either spaces are completly absent or (as in Korean) punctuations are commonly used within a word. Consequently, there are clear limits to inferring whether a delimiter is left or right-flanking based on these rules. Even if we were to allow <ordinary character>**<punctuation> to be interpreted as left-flanking to accommodate cases like **마크다운(Markdown)**은, how would we handle something like このような**[状況](...)は**?
In my view, the utility of nested emphasis is marginal at best, while the frustration it causes in CJK environments is significant. Furthermore, because LLMs generate Markdown based on how people would actually use it---rather than strictly following the design intent of CommonMark---this latent inconvenience that users have long felt is now being brought directly to the surface. by yurume@hackers.pub
— Why #Markdown's emphasis synta… ()
- Boost (Mastodon): https://hackers.pub/ap/notes… ()
- Tā Tā For Now ()
- @timcappalli go Bruins! ()
- Batteries included: turning CSV into JSON in Python ()
- Happy New Year! ()
- Ecma ()