What’s inside the leaked Yandex source code?

Daniel Bunte
5 min readJan 28, 2023
A computer screen showing a software interface to write programs in
Photo by Riku Lu on Unsplash

Yandex, a Russian tech giant, was a recent victim of hackers who published over 40 GiB of the company’s source code. The company denied being hacked.

Whether or not they were hacked, the interesting question for many is “What’s inside?”

The leaked sources are packaged in individual components, such as analytics, cloud, portal, and so forth. Keeping in mind computer programs only contain text files, seeing over 40 GiB would make up quite a few Lines of Code (LoC). I actually asked ChatGPT for an answer:

How many lines of code make up 40 GiB?

It is difficult to determine the exact number of lines of code that make up 40 GiB without more information about the specific programming languages and coding practices used. However, it is worth noting that the number of lines of code does not directly correspond to the size of a program or file, as the size can be affected by factors such as code comments, white space, and data storage. Additionally, 1 Gb is equivalent to 1000MB, and 40 Gb is 40,000 MB. If we assume that an average line of code is around 150 characters, 40,000MB is approximately to 40,000,000,000 characters. So 40,000,000,000/150 is approximately to 266,666,666 lines of code.

That’s ~266 million LoC. Even for a tech giant, this is a whole lot of code. For…

--

--

Daniel Bunte

Leader, self-taught developer with interests in Architecture/Automation/Security.