On YouTube rants…

I have noticed that video is increasingly becoming the preferred communication medium on the web, especially for the younger generations. This is especially noticeable amongst the newly formed gator/puppy set which has spawned in the August that never ended, but not limited to just them. Any time these folks get some thought in their head that they feel is worth sharing with the world, they turn on their webcam, ramble off the cuff anywhere from 15 minutes to 3 hours and then promptly upload the whole thing to YouTube without editing.

Back in my day (only a decade ago, but that’s like a million internet years) we might have called this “vlogging” but I haven’t really seen that word used in ages. Personally, I always thought of vlogs as prepared essays with with visual components. To me the whole point of doing a video to show viewers examples of the stuff you are talking about. What Anita Sarkesian is doing is a good example: well researched, well edited, succinct, to the point visual essay with concrete examples of game-play and game dialogs. On the other hand, someone just talking “off the cuff” into the camera for twenty minutes in a single unbroken cut is…

Well, to me it just seems lazy. Here is the thing: I can read faster than you can talk. Therefore, if you have a message you want to get out there, the most efficient way of doing this is via text. Text can be absorbed very rapidly, even if it is an unstructured stream of consciousness jumble. I can skim long articles pretty quickly without losing too much information, but there is simply no way to skim a video. You can skip around, but that’s not the same. Skipping feels lossy. When I skim I can still look at the length and shape of the paragraph, check the opening and closing lines, scan for relevant keywords within and etc.. The best YouTube can do for me at the moment in this respect is to show me still thumbnails of what I can expect to see on the screen when I skip to that point. Which, if I’m watching a 20 minute unbroken rant, is always going to be your face.

Writing things down takes some effort. The very process of arranging words into sentences, sentences into paragraphs and so on forces you to think about structure and flow. You can’t just vomit words in the exact order they pop into your mind. Written word has rules, and ignoring them yields unreadable and confusing mess. But if you use video, you can just ramble, talk in circles, get tongue tied, correct yourself and go on tangents without losing too much coherence. Our brains are pretty good at making sense from unorganized, jumbled speech, because that’s how we communicate on the daily basis. So you can talk to a camera the way you would talk to your friend, and chances are most of your viewers will at least get a gist of what you’re saying. But the fact people can comprehend what you’re saying doesn’t mean you are coherent, or that you are not wasting their time. Because you are.

Yep, that’s a dude who didn’t even bother getting out of the bathtub to share his brilliant insights on ethics in video game journalism.

If you turn on a webcam, and hurl words at it for an hour without at least an outline, and without at least some basic editing to remove filler words (umm.., err..) and stuttering you are saving yourself time while wasting mine. Considering that, according to some estimates, over 20% of sounds we make during regular, conversational speech are non-lexical vocables, false starts and corrections, this is rather inconsiderate. This is why you don’t usually see people speaking this way on TV or in movies (save for maybe, you know mumblecore stuff, which consciously mimics “natural” conversation patterns) because for the most part its just noise. Useless, pointless interference that is not conducive to getting your message across.

So if you have some thoughts you want to share, write them down, kinda like I’m doing it here. Put these words on Medium, or Twitlonger, or one of the other five million sites designed to facilitate exactly that. Ranting into camera is just lazy.

Then again, maybe I’m just getting old. Perhaps there is a generational shift away from textual communication happening right now. And why not? It has never been easier to publish video online, and with ubiquitous broadband and storage we don’t have to aggressively edit for size, like we used to. So people are taking advantage of this.

There is this vision of the future that worries me quite a bit: one in which text is dead. In this future all interfaces we input data using touch and speech, and all output is visual and verbal. Humanity is mostly illiterate (save for handful of historians and archivists who study old text) but not uneducated. Poets and writers simply dictate their books to machines, because we perfected speech processing algorithms, and we have them read to us by descendants of Siri, who have perfect cadence and inhumanely soothing voices. Scientists and engineers dictate their papers and equations. Math is done in-silico…

Have you noticed how no one ever types in Spike Lee’s Her?

But would that even work? Can you read and write scientific papers without the ability to skim? Can you write good code, without actually… Writing? Up until now, education and literacy were inseparable: one depended on the other. But can technology disentangle the two? Can it help to create a society of highly educated analphabets, and would that even be a desirable thing? I’m inclined to think that this future simply won’t happen, because text is just too fast, efficient and convenient. It compresses insanely well, can be searched and indexed with frightening speed and efficiency, it can be absorbed much faster than audio and it can be translated without artifacts and side effects (such as lip movement being out of sync with dubbed speech on video). I just don’t see us ever giving up all the benefits of text, without getting anything in exchange. Because even if we get perfect speech recognition software, and machines can interpret our commands with flawless accuracy, talking is still slower, less accurate and less focused than writing. It just would not make any sense to abandon it.

But, Spike Lee’s movie Her does provide a vision of the future in which no one ever types anymore, but people still do read. And that is potentially something that could happen one day. And that’s my worst nightmare, because I can only ever properly organize my thoughts when I write. Which is one of the reasons I never felt compelled to make these sort of stream of consciousness type videos. Vocalizing my thoughts adds another layer of abstraction and takes me that much farther away from my message. I feel that dictation is nowhere near as flexible as typing. For example, have you ever tried to someone how you want them to re-format a document?

Can you copy that sentence… No that’s too much… No, actually I meant this sentence, and the short one afterwards. Now cut them out, and put them… Wait, scroll up a bit. No too much. Lower. Third paragraph… Sorry, I guess technically that’s fourth if you count that single word over there as a paragraph. So we put it here, but now we have to change it up to fix the flow…

It usually takes five minutes to explain to a human something you could do yourself in five seconds. Now imagine parsing all of this in an unambiguous way that can be understood by a machine. Editing text with speech would be a nightmare. In fact, editing anything with speech seems like an uphill battle. I think we would literally have to invent new, un-ambigous sub-dialects just to efficiently interface with machines. Or maybe learn Lojban.

I think what we’re seeing here is just laziness, and not some generational paradigm shift.

Then again, I have been wrong on things like these in the past. If this is the way of the future, I will have to adopt to that new, nightmarishly inefficient world. I don’t want to be the bitter old man who doesn’t get the new technology and refuses to get with the times. And at the very least, this strange future without reading and writing would result in more engaging, and visually pleasing Powerpoint presentations without bullet points…

6 Responses to On YouTube rants…

IceBrain says:

May 16, 2015 at 5:49 am

Nah, I don’t think text is dead. I’ll bet you that bellow that video of the guy in the bathtub, there are hundreds -if not thousands- of written comments. Text is still the low effort option for small posts and comments, not to mention the private option – can you picture teenagers dictating their messages when near their parents or teachers? I don’t think it’ll disappear until we get something better than dictation to replace it.

I do think most people won’t write long-form posts, but then again, they already didn’t. A person with a blog, or who writes a Medium post, is and has always been an exception.

Video blogs are like IRC and the early social web; it seems shocking that so many people can’t write, but the technology didn’t cause that inability, it just exposed what was previously hidden from public.

That said, I don’t agree that editing text with speech must be a nightmare. Or actually, editing text just with speech might be a nightmare, but a better combination can be found with touch screens + speech, without having to type. You just select, drag and delete using your fingers, and then use speech to replace or add content.
Right now, the Android UI is terrible for that (I don’t know if iOS is any better), but that’s just a software issue. It’s likely that there’s already some decent app for editing text.

Reply | Quote
Luke Maciak says:

May 16, 2015 at 3:11 pm

@ IceBrain:

Very true. The privacy angle is an especially good point. And it’s not just for teenagers. The whole idea of pin numbers and passwords is that we can enter them silently in public without revealing them to the world. Without keyboards the mainstay of our digital security simply vanish and we would have to settle on much less secure physical tokens (which can be stolen) and biometrics (which can likely be sampled and duplicated from the stolen token).

I think in Her they had little touch tablets for “finger editing” though they rarely used them. You can actually see them in the picture above, when the protagonist’s friend shows him her game. So I think you’re on to something here.

But yeah, editing with fingers is also clumsy on iOS. It’s the combination of “fat fingers”, small text requiring precision movements. Most of the time you have to kinda rock your finger back and forward to actually get the cursor where it needs to go. You can tap to select a word or a paragraph, and then iOS gives you handles you can use to narrow down the selection from both sides, which works but is nowhere near as practical as mouse highlighting.

Then of course keyboard and mouse interfaces are also flawed, which is why we have things like vim that allow for very efficient keyboard only navigation, and manipulation.

And of course vim interface is flawed because the text objects which include characters, words, sentences and sentence chunks are always too granular or not granular enough to grab the exact amount of text you need right now in a single command. :P

Reply | Quote
Max says:

May 17, 2015 at 5:13 pm

Maybe the YouTube rant people just mostly use phones and tablets and can’t type quickly on a keyboard? Still, if you want to get your thoughts out there, you should probably learn to use it.

And I’m imagining a speech-interface for vim now:
“double-yoo double-yoo dee two double-yoo…”

Reply | Quote
Kamil says:

May 19, 2015 at 2:44 pm

A few months ago Jacek Dukaj published a very interesting text about the future of reading and writing in the Gazeta Książki magazine. You should check it. It’s quite in tune with some of the things you’re saying here. On an unrelated note, are you going to review the Old Axolotl? It’s Dukaj’s first novel available in English translation.

Reply | Quote
Gabriel Chavez says:

May 21, 2015 at 4:45 am

I couldn’t agree more with the article. I also strongly believe text is the way to go most of the time.
Video, audio, every medium has its own strengths and weaknesses but text should rule them all in the web. Unfortunately, reading requires focus that demands few to non distractions. See the post on why we can’t read anymore

Ranting is special because anyone can rant for hours on a particular topic but if you want to write about it you can’t do it for too long. I learned this when I was listening to TotalBiscuit rant about a particular subject for around 20 minutes, but his core message wasn’t long or complicated.

Text is the upper limit of human expression: when done well, it provides fast, succinct and direct information. Sadly, it also requires focus and alone time, two commodities that are becoming more and more scarce.

Reply | Quote
Andrew Zimmerman says:

October 15, 2015 at 11:23 am

“I think what we’re seeing here is just laziness, and not some generational paradigm shift.”

Absolutely. I mean people will probably speak to text, before they switch to video. People don’t want to watch video just to get a single shred of knowledge they’re looking for. It’s amusing how long most of the videos are on Youtube, with the creator just taking FOREVER to get to the grain of truth.

Reply | Quote