I’m a contributor – I did Kafka’s The Castle, Agatha Christie’s Giant’s Bread, and Stella Benson’s The Faraway Bride for this launch – and I’m happy to answer any questions about Standard Ebooks.
Do you think the things that makes an edition special goes missing while converting to e.g. Standard Ebooks. I remember both the The Castle and Das Schloss like they had typesetting that helped me in perceiving the feel of the book. Is there anyway to preserve that feeling and still keep within the bounds of standardisation you adhere to? (I did a quick look through my copy and it does not seem to be much that makes it unique really, just the size of the book, and the chapter heading graphics..)
Do you know if the project try to look at other languages at all?
Nothing particularly in The Castle, from my production of it. As this was not previously PD there wasn’t any Gutenberg (or other) transcription available, so I did my own from the OCR of the original scans. A large part of the feel of the work, to me at least, comes from the extreme sentence / paragraph lengths though.
We do have a default typography across all our works (the “Standard” in “Standard Ebooks” refers to a standard imprint; think Penguin) but we usually retain specific famous things where possible in a reflowable format. For example, the Mouse’s Tail in Alice’s Adventures in Wonderland,[1] or the letter in E. A. Poe’s “Thou Art the Man”.[2]
We don’t take on other languages, no. Our tooling[3] and style guides[4] are tailored specifically to English. Absolutely nothing stopping another project from forking the codebase (it’s GPL-3) and giving it a go.
It seems you may be making assumptions that the formatting and typesetting of any particular edition were intentional or even deliberate on the part of the author, not any number of people, from editors to printers, who could and would have influenced those things for various reasons.
Something I am rather familiar with is brought out by your mention of the German edition/title; that the continental market seems to generally produce books that are far more densely formatted, i.e., smaller font and typesetting, thinner pages, and leading to overall tighter book formats. I actually appreciate it when, e.g., a book is 1/2 the size and weight, and usually also made far more durably; but it will invariably compromise any author intention related to the arrangement of the lettering.
Maybe you can confirm that based on what seems to be your English and German editions of the same novel.
I found it amusing, considering all those memes about German words with 35 letters each.
And, as I get older, I began to consider letter size relevant to choose a book edition. Gave up buying new books and went for used, older editions with bigger letters.
Well that depends, there are obviously authors that care about these things. I have no idea what Brods intentions were with the book, and if he cared about layout.
The German and Swedish editions I read were similarly typeset, and the first scan I found in English felt similar. What I wanted to know was if there was some thought into it, because the website is nicely designed so striving for a unique typesetting strategy could be a goal.
Probably a dumb question, but how do you guys decide (and source) the book covers? I love how they look, but as a philistine can't put into words why.
Also thanks for doing this, I've read a bunch of stuff (GK Chesterton, Dorothy Sayers, Dashiell Hammett) that I wouldn't have otherwise if it weren't for this service.
The historical criteria is fine-art style oil painting. These days we’re starting to use first-edition cover art a bunch for more modern productions if it’s good quality. We also tend to use abstract oil paintings for sci-fi.[1] Obviously, all art is sourced from the public domain too. We’ve also started a database of confirmed-US-PD artwork that we can use for future productions.[2]
I don't know their reasons but PDF is a rather problematic format so I suspect that's why.
You can run their EPUB through Pandoc to convert yourself, or put some effort in and setup your own Calibre instance which will do something similar when you ask it to.
The title makes it look like Public Domain is universal, while the article does mention that this list is only about the USA.
> On January 1, 2026, books published in 1930 enter the U.S. public domain.
The Copyright laws are different in each country, and it's a non-sense in the modern world.
A few years ago, I was searching for books written by Alexandra David-Neel. I found them on a Canadian (IIRC) website, but downloads were filtered by geo-IP, since what was in the public domain there was not yet public in France. One of the books I wanted was written before 1900, and not in print since then. Yet the author died in 1969, aged 100, so the French Public Domain for her works will start in 2040.
Another example: "As I lay dying" by William Faulkner is now Public Domain in the USA. It was Public Domain in Canada from 2013 to 2023. Then the law changed, and the copyright was extended by 20 years, and reinstated for this book until 2032 — which is 70 years after the author's death in 1962.
It would be interesting to see a combined list taking into account both US "1930 or older" rule and more common internationally "life+70" rule, to see what works have finally escaped both of those and make works a bit less unsafe to make use of, but I have not seen any list like that?
I have a hypothesis that we're getting closer to a cultural inflection point (maybe half a decade out). With every year, more important and very high-quality cultural artifacts enter the public domain, while at the same time, many low quality artefacts are produced (... AI slop). It'll be increasingly difficult to choose a good cultural artefict for consumption (e.g., which book to read next or which movie to watch). A very good indicator for quality is time and thus a useful filter.
In some years we could have the following: a netflix-like (legal variant of popcorntime) software system (p2p) that serves high-quality public domain movies, for those who like it, even with AI upscaling or post processing.
The same would also work for books, with this pipeline: Project Gutenberg -> Standard Ebooks. At the inflection point, there would be a steady stream of high-quality formats of high-quality content, enough to satisfy the demand of cultural consumption. You wouldn't need the latest book/movie anymore, except for interest in contemporary stuff.
Do you know if the project try to look at other languages at all?
We do have a default typography across all our works (the “Standard” in “Standard Ebooks” refers to a standard imprint; think Penguin) but we usually retain specific famous things where possible in a reflowable format. For example, the Mouse’s Tail in Alice’s Adventures in Wonderland,[1] or the letter in E. A. Poe’s “Thou Art the Man”.[2]
We don’t take on other languages, no. Our tooling[3] and style guides[4] are tailored specifically to English. Absolutely nothing stopping another project from forking the codebase (it’s GPL-3) and giving it a go.
[1] https://standardebooks.org/ebooks/lewis-carroll/alices-adven...
[2] https://standardebooks.org/ebooks/edgar-allan-poe/short-fict...
[3] https://github.com/standardebooks/tools
[4] https://standardebooks.org/manual/
Something I am rather familiar with is brought out by your mention of the German edition/title; that the continental market seems to generally produce books that are far more densely formatted, i.e., smaller font and typesetting, thinner pages, and leading to overall tighter book formats. I actually appreciate it when, e.g., a book is 1/2 the size and weight, and usually also made far more durably; but it will invariably compromise any author intention related to the arrangement of the lettering.
Maybe you can confirm that based on what seems to be your English and German editions of the same novel.
I found it amusing, considering all those memes about German words with 35 letters each.
And, as I get older, I began to consider letter size relevant to choose a book edition. Gave up buying new books and went for used, older editions with bigger letters.
The German and Swedish editions I read were similarly typeset, and the first scan I found in English felt similar. What I wanted to know was if there was some thought into it, because the website is nicely designed so striving for a unique typesetting strategy could be a goal.
Also thanks for doing this, I've read a bunch of stuff (GK Chesterton, Dorothy Sayers, Dashiell Hammett) that I wouldn't have otherwise if it weren't for this service.
[1] https://standardebooks.org/subjects/science-fiction
[2] https://standardebooks.org/artworks
One of the SE editors experimented with turning SE ebooks into PDFs, though. See more about that here: https://groups.google.com/g/standardebooks/c/Xy2bwiexLeM/m/f...
You can run their EPUB through Pandoc to convert yourself, or put some effort in and setup your own Calibre instance which will do something similar when you ask it to.
> On January 1, 2026, books published in 1930 enter the U.S. public domain.
The Copyright laws are different in each country, and it's a non-sense in the modern world.
A few years ago, I was searching for books written by Alexandra David-Neel. I found them on a Canadian (IIRC) website, but downloads were filtered by geo-IP, since what was in the public domain there was not yet public in France. One of the books I wanted was written before 1900, and not in print since then. Yet the author died in 1969, aged 100, so the French Public Domain for her works will start in 2040.
Another example: "As I lay dying" by William Faulkner is now Public Domain in the USA. It was Public Domain in Canada from 2013 to 2023. Then the law changed, and the copyright was extended by 20 years, and reinstated for this book until 2032 — which is 70 years after the author's death in 1962.
I have a hypothesis that we're getting closer to a cultural inflection point (maybe half a decade out). With every year, more important and very high-quality cultural artifacts enter the public domain, while at the same time, many low quality artefacts are produced (... AI slop). It'll be increasingly difficult to choose a good cultural artefict for consumption (e.g., which book to read next or which movie to watch). A very good indicator for quality is time and thus a useful filter.
In some years we could have the following: a netflix-like (legal variant of popcorntime) software system (p2p) that serves high-quality public domain movies, for those who like it, even with AI upscaling or post processing.
The same would also work for books, with this pipeline: Project Gutenberg -> Standard Ebooks. At the inflection point, there would be a steady stream of high-quality formats of high-quality content, enough to satisfy the demand of cultural consumption. You wouldn't need the latest book/movie anymore, except for interest in contemporary stuff.
Was referring to the Bogart version.