Entries for 2023

@onusoz · /2023/12/13· 07:08 PM View on

@xkcd1963 @togethercompute https://t.co/HoXS8BCLuH

@onusoz · /2023/10/27· 11:06 AM View on

“the QIPS Exchange -- the marketplace where processing power was bought and sold. The connection to JSN had passed through the Exchange, transparently; her terminal was programmed to bid at the market rate automatically, up to a certain ceiling.” - Permutation City

Quoted post

Quoted post was not retrieved.

@onusoz · /2023/06/19· 04:23 PM View on

Created a wordcloud version of the Cognitive Bias Codex by @jm3 and @buster. Font size is proportional to Google search result count, which roughly measures each term's popularity. Read more: https://t.co/HWs9wqgPyh

Image hidden

@onusoz · /2023/06/17· 01:24 PM View on

Had lots of fun shipping this feature ✌️

Quoted post

Quoted post was not retrieved.

Onur Solmaz · Post · /2023/06/17

Cognitive Biases Ranked by Popularity

If you have spent some time on rationalist forums, you might have come across images that try to visualize cognitive biases that humans are prone to:

This specific one has been created by John Manoogian III and Buster Benson, who compiled the list of biases from Wikipedia.

It is a great way to get a sense of the sheer number of biases that exist, but it doesn’t tell you much about how much of the popular mindshare each bias has. All the biases having the same size implies that they are all equally important, but that is obviously not the case. Arguably, for someone who has just started to learn about cognitive biases, confirmation bias should be more important than, say, the Peltzmann effect.

To measure and visualize the popularity of each bias, I…

ran a Google search with the format "<insert cognitive bias here>" cognitive bias using a SERP API,
got the number of search results for each term,
created a wordcloud using the wordcloud Python package,
used logarithms of the search count for better scaling,
used the same colors as the Cognitive Bias Codex for consistency,
used a shape mask of a brain to make it look cool.

Here is the result:

The bigger the font, the more Google search results there are for that bias, the assumption being Google search results are a good measure of popularity.

Why should you care about the popularity of biases? The more popular or common a bias is, the more likely you are to be affected by it. So it makes sense to study them in decreasing order of popularity, to maximize the benefit to your own thinking. However, this is all statistics—you could still be impacted more by a bias that is smaller in the wordcloud. For example, there was a time when I was very prone to the sunk cost fallacy, even though it doesn’t show up so large in the wordcloud.

Below is a version of the image without the shape mask:

Below are the top 10 biases ranked by Google search result count:

Cognitive bias	Search result count
Prejudice	8,560,000
Anchoring	1,100,000
Stereotyping	1,080,000
Confirmation bias	992,000
Conservatism	610,000
Essentialism	436,000
Loss aversion	426,000
Attentional bias	374,000
Curse of knowledge	373,000
Social desirability bias	319,000

Click here to see the search result counts for each 188 biases included above.

I have also computed the average search result count for each category of biases, by dividing the total search result count for each category by the number of biases in that category:

Category	Average count
We discard specifics to form generalities	1,494,378
We notice when something has changed	237,141
We fill in characteristics from stereotypes, generalities, and prior histories	160,170
We are drawn to details that confirm our own existing beliefs	93,350
We think we know what other people are thinking	81,555
To act, we must be confident we can make an impact and feel what we do is important	72,435
We notice things already primed in memory or repeated often	70,835
To get things done, we tend to complete things we’ve invested time and energy in	65,822
To avoid mistakes, we aim to preserve autonomy and group status, and avoid irreversible decisions	65,750
We edit and reinforce some memories after the fact	59,503
We favor simple-looking options and complete information over complex, ambiguous options	52,491
We tend to find stories and patterns even when looking at sparse data	46,375
To stay focused, we favor the immediate, relatable thing in front of us	37,940
Bizarre, funny, visually striking, or anthropomorphic things stick out more than non-bizarre/unfunny things	37,081
We imagine things and people we’re familiar with or fond of as better	34,379
We simplify probabilities and numbers to make them easier to think about	33,881
We notice flaws in others more easily than we notice flaws in ourselves	31,390
We project our current mindset and assumptions onto the past and future	29,418
We reduce events and lists to their key elements	27,638
We store memories differently based on how they were experienced	20,440

Notice that the top few biases such as prejudice and anchoring highly skew the ranking.

Similarly, I have computed the average search result count for each top category of biases:

Top category	Average count
What Should We Remember?	316,297
Too Much Information	101,842
Need To Act Fast	64,568
Not Enough Meaning	64,134

You can see the code I used to create the figure here.

I will not try to reason as to why some biases are more popular than others, and instead leave that for another post.

@onusoz · /2023/05/18· 04:45 PM View on

If you are interested in using Manim Voiceover, auto-translating your videos into other languages, or any other cool stuff, hit me up in a DM!

@onusoz · /2023/05/18· 04:45 PM View on

I've just published *Code-Driven Videos*, my long term vision behind Manim Voiceover plugin. I will try to summarize it on this thread 👇🧵 cc @manim_community https://t.co/AXpOMTZKha

Onur Solmaz · Post · /2023/05/11· HN

Code-Driven Videos

tl;dr I created Manim Voiceover, a plugin for the Python math animation library Manim that lets you add voiceovers to your Manim videos directly in Python, with both AI voices or actual recordings.

This makes it possible to create “fully code-driven” educational videos in pure Python. Videos can be developed like software, taking advantage of version controlled, git-based workflows (i.e. no more Final.final.final.mp4 :),

It also makes it possible to use AI to automate all sorts of things. For example, I have created a pipeline for translating videos into other languages automatically with i18n (gettext) and machine translation (DeepL).

Follow my Twitter to get updates on Manim Voiceover.

A little background

For those who are not familiar, Manim is a Python library that lets you create animations programmatically, created by Grant Sanderson, a.k.a. 3blue1brown. His visual explainers are highly acclaimed and breathtakingly good (to see an example, click here for his introduction to neural networks).

Manim was originally built for animating math, but you can already see it being used in other domains such as physics, chemistry, computer science, and so on.

Creating any video is a very time-consuming process. Creating an explainer that needs to be mathematically exact is even more so, because the visuals often need to be precise to convey knowledge efficiently. That is why Manim was created: to automate the animation process. It turns out programming mathematical structures is easier than trying to animate them in a video editor.

However, this results in a workflow that is part spent in the text editor (writing Python code), and part in the video editor (editing the final video), with a lot of back and forth in between. The main reason is that the animation needs to be synced with voiceovers, which are recorded separately.

In this post, I will try to demonstrate how we can take this even further by making voiceovers a part of the code itself with Manim Voiceover, and why this is so powerful.

The traditional workflow

Creating a video with Manim is very tedious. The steps involved are usually as follows:

Plan: come up with a script and a screenplay.
Record: Record the voiceover with a microphone.
Animate: Write the Python code for each scene, that will generate the animation videos.
Edit: Overlay and synchronize the voiceover and animations in a video editor, such as Adobe Premiere.

The workflow is often not linear. The average video requires you to rewrite, re-record, re-animate and re-sync multiple scenes:

The less experience you have making videos, the more takes you will need. Creating such an explainer has a very steep learning curve. It can take up to 1 month for a beginner to create their first few minutes of video.

Enter Manim Voiceover

I am a developer by trade, and when I first tried to create a video with the traditional workflow, I found it harder than it should be. We developers are spoiled, because we get to enjoy automating our work. Imagine that you had to manually compile your code using a hex editor every time you made a change. That is what it felt like to create a video using a video editor. The smallest change in the script meant that I had to re-animate, re-record and re-sync parts of the video, the main culprit being the voiceover.

To overcome this, I thought of a simple idea: Create an API that lets one to add voiceovers directly in Python. Manim Voiceover does exactly that and provides a comprehensive framework for automating voiceovers. Once the entire production can be done in Python, editing in the video editor becomes mostly unnecessary. The workflow becomes:

Plan: Same as before.
Animate: Develop the video with an AI-generated voiceover, all in Python.
Record: When the final revision is ready, record the actual voiceover with Manim Voiceover’s recorder utility. The audio is transcribed with timestamps and inserted at the right times automatically.

A little demo—see how a video would look like at the end of step (2):

And watch below to see how it would look like at the end of step (3), with my own voice:

I explain why this is so powerful below:

Zero-cost revisions

In the previous method, making modifications to the script has a cost, because you need to re-record the voiceover and readjust the scenes in the video editor. Here, making modifications is as easy as renaming a variable, since the AI voiceover is generated from code automatically. This saves a lot of time in the production process:

This lets videos created with Manim to be “fully code-driven” and take advantage of open source, collaborative, git-based workflows. No manual video editing needed, and no need to pay for overpriced video editing software:

(Or at least drastically reduced need for them)

Increased production speed

From personal experience and talking to others who have used it, Manim Voiceover increases production speed by a factor of at least 2x, compared to manual recording and editing.

Note: The current major bottlenecks are developing the scene itself and waiting for the render. Regarding render speed: Manim CE’s Cairo renderer is much slower then ManimGL’s OpenGL renderer. Manim Voiceover currently only supports Manim CE, but it is on my roadmap to add support ManimGL.

The API in a nutshell

This all sounds great, but how does it look like in practice? Let’s take a look at the API. Here is a “Hello World” example for Manim, drawing a circle:

from manim import *

class Example(Scene):
    def construct(self):
        circle = Circle()
        self.play(Create(circle))

Here is the same scene, with a voiceover that uses Google Translate’s free text-to-speech service:

from manim import *
from manim_voiceover import VoiceoverScene
from manim_voiceover.services.gtts import GTTSService

class VoiceoverExample(VoiceoverScene):
    def construct(self):
        self.set_speech_service(GTTSService(lang="en"))

        circle = Circle()
        with self.voiceover(text="This circle is drawn as I speak."):
            self.play(Create(circle))

Notice the with statement. You can chain such blocks back to back, and Manim will vocalize them in sequence:

with self.voiceover(text="This circle is drawn as I speak."):
    self.play(Create(circle))

with self.voiceover(text="Let's shift it to the left 2 units."):
    self.play(circle.animate.shift(2 * LEFT))

The code for videos made with Manim Voiceover generally looks cleaner, since it is compartmentalized into blocks with voiceovers acting as annotations on top of each block.

See how this is rendered:

Record

To record an actual voiceover, you simply change a single line of code:

# self.set_speech_service(GTTSService(lang="en")) # Comment this out
self.set_speech_service(RecorderService())        # Add this line

Currently, rendering with RecorderService starts up a voice recorder implemented as a command line utility. The recorder prompts you to record each voiceover in the scene one by one and inserts audio at appropriate times. In the future, a web app could make this process even more seamless.

Check out the documentation for more examples and the API specification.

Auto-translating videos

Having a machine readable source for voiceovers unlocks another superpower: automatic translation. Manim Voiceover can automatically translate your videos to any language, and even generate subtitles in that language. This will let educational content creators reach a much wider audience.

Here is an example of the demo translated to Turkish and rendered with my own voice:

To create this video, I followed these steps:

I wrapped transtable strings in my demo inside _() per gettext convention. For example, I changed text="Hey Manim Community!" to text=_("Hey Manim Community!").
I ran manim_translate blog-translation-demo.py -s en -t tr -d blog-translation-demo, which created the locale folder, called DeepL’s API to translate the strings, and saved them under locale/tr/LC_MESSAGES/blog-translation-demo.po.
- Here, -s stands for source language,
- -t stands for target language,
- and -d stands for the gettext domain.
I edited the .po file manually, because the translation was still a bit off.
I ran manim_render_translation blog-translation-demo.py -s BlogTranslationDemo -d blog-translation-demo -l tr -qh, which rendered the final video.

Check out the translation page in the docs for more details. You can also find the source code for this demo here.

Here is a Japanese translation, created the same way but with an AI voiceover:

Note that I have very little knowledge of Japanese so that the translation might be off, but I was still able to create it with services that are freely available online. This is to foreshadow how communities could create and translate educational videos in the future:

Video is created using Manim/Manim Voiceover and is open-sourced.
The repo is connected to a CI/CD service that tracks the latest changes, re-renders and deploys the video to a permalink.
When a translation in a language is requested, said service automatically generates it using AI translation and voiceover.
The community can then review the translation and voiceover, make changes if necessary, and record a human voiceover if they want to.
All the different versions and translations of the video are seamlessly deployed, similar to how ReadTheDocs deploys software documentation.

That is the main idea of my next project, GitMovie. If this excites you, leave your email address on the form on the website to get notified when it launches.

Conclusion

While using Manim Voiceover might seem tedious to some who are already using Manim with a video editor, I guarantee that it is overall more convenient than using a video editor when it comes to adding voiceovers to scenes. Feel free to create an issue if you have a use case that is currently not covered by Manim Voiceover.

What is even more interesting, Manim Voiceover can provide AI models such as GPT-4 with a convenient way to generate mathematically precise videos. Khan Academy has recently debuted a private release of Khanmigo, their GPT-4 based AI teacher. Imagine that Khanmigo could create a 3blue1brown-level explainer in a matter of minutes, for any question you ask! (I already tried to make GPT-4 output Manim code, but it is not quite there yet.)

To see why this is powerful, check out my video rendering of Euclid’s Elements using Manim Voiceover (part 1):

This video itself is pedagogically not very effective because books do not necessarily translate into good video scripts. But it serves as preparation for the point that I wanted to make with this post:

Having a machine-readable source and being able to program voiceovers allowed me to generate over 10 hours of video in less than a few days. In a few years, AI models will make such approaches 1000 times easier, faster and cheaper for everyone.

Imagine being able to auto-generate the “perfect explainer” for every article on Wikipedia, every paper on arXiv, every technical specification that would otherwise be too dense. In every language, available instantly around the globe. Universal knowledge, accessible by anyone who is willing to learn. Thanks to 3blue1brown, Manim and similar open source projects, all of this will be just a click away!

@onusoz · /2023/01/22· 10:44 AM View on

You can now translate voiceovers in your Manim scenes into other languages using @DeepLcom Blog post with examples coming soon @manim_community https://t.co/eNNlfvQgdf

@onusoz · /2023/01/17· 10:39 AM View on

Revamp complete https://t.co/pYqLgcro93 @TextCortex