Digital Information & File Formats

01

Digital Information: Bits, Bytes & Beyond

At the heart of every computer, smartphone, and digital device lies a remarkably simple concept: information expressed as a sequence of ones and zeros. Computers work by translating physical signals — such as electrical charges — into this binary form. Each individual one or zero is called a bit, the smallest possible unit of digital information.^[1]

Eight bits grouped together form a byte. A single byte can represent any value from 0 (eight zeros) to 255 (eight ones), giving 256 possible states. From there, we use the metric system to describe larger quantities of data.

Table 1 — Digital information units
Unit	Size	Common use
Bit	1 binary digit (0 or 1)	Smallest data unit; used in throughput measurements
Byte	8 bits	Single character of text; smallest addressable memory unit
Kilobyte (KB)	~1,000 bytes (1,024 in storage)	Small text files, short emails
Megabyte (MB)	~1 million bytes	Photos, songs, small documents
Gigabyte (GB)	~1 billion bytes	HD videos, app installs
Terabyte (TB)	~1 trillion bytes	Hard drives, large backups

Note on storage vs. networking: You will often see two different definitions. In storage contexts, 1 kilobyte = 1,024 bytes (powers of 2). In networking and marketing contexts, 1 kilobyte = 1,000 bytes (powers of 10). Both usages are common — context matters.^[1]

How Computers Encode Numbers and Text

Integers are stored using a portion of the available bits to represent the sign. In a 32-bit integer, the first bit indicates positive (0) or negative (1), and the remaining 31 bits encode the magnitude — yielding a range of roughly negative 2 billion to positive 2 billion.^[1]

Processing Speed

In a 64-bit processor, the CPU can read or write eight bytes at a time. Each such read/write is called a cycle, measured in hertz (Hz). Modern CPUs operate in the gigahertz range — meaning billions of cycles per second.^[1]

System Memory (RAM)

For processing, computers rely on Random Access Memory (RAM) — a fast, temporary workspace that clears when the machine powers off. The two main types are:

Dynamic RAM (DRAM) stores each bit as a charge in a tiny capacitor. Because capacitors discharge naturally, the system must refresh them every few milliseconds. Synchronous DRAM (SDRAM) is synchronized to the system bus for faster access. Physical RAM modules for desktops are called DIMMs; laptops use the smaller SO-DIMM form factor.^[1]

02

Encoding Text: ASCII & Unicode

Representing human-readable text in binary requires an agreed-upon mapping between characters and numbers. Two major systems have shaped how computers handle text worldwide.

Table 2 — Text encoding schemes
Scheme	Year	Capacity & notes
ASCII	1963	7-bit code; 128 characters. Covers the English alphabet, digits 0–9, and common punctuation. Codes 128–255 were later adapted for other Western languages, but international transfers often caused character corruption — accented characters would display as unrelated symbols on systems using different regional extensions.^[1]
Unicode	1992	16-bit+ encoding; capable of representing millions of characters across every human language, plus emoji. Most commonly implemented as UTF-8 or UTF-16. Web pages typically use UTF-8.^[1]

03

Digital Sound & Video Fundamentals

Sound and video were once recorded exclusively in analog form — physically represented on tape, film, or vinyl. Today they are captured digitally as sequences of binary values. Understanding a few core concepts helps explain why some files sound or look better than others.

Table 3 — Key terms for digital sound and video
Term	Definition
Sample	A single measurement of a sound wave's amplitude at a specific moment in time, converted to a numeric value.
Bit depth	How many bits are used to store each sample. 8-bit audio uses 1 byte per sample; 16-bit uses 2 bytes. Higher bit depth = greater dynamic range and sound quality.^[1]
Sampling rate	The number of samples captured per second, measured in hertz. Most modern recordings use 44.1 kHz (44,100 samples/second).^[1]
Frame rate	The number of still images (frames) shown per second in a video. Most digital video runs at approximately 30 frames per second.^[1]
Resolution	The pixel dimensions (width × height) of each video frame. Higher resolution produces a clearer image.^[1]
Color depth	How much memory is used to describe the color of each pixel. 8-bit = 256 colors per pixel; 24-bit ("true color") = over 16 million colors.^[1]

Display Technology

Computer monitors use an RGB display — each pixel is a set of three tiny lights: red, green, and blue. By varying the brightness of each light independently, a monitor can produce any color in the visible spectrum. When all three shine at full brightness, the eye perceives white; all three off produces black.^[1]

Resolution standards have their own naming conventions, from SVGA (800 × 600) up through 4K UHD (3840 × 2160). The aspect ratio — the proportion of width to height — also matters. The widescreen 16:9 ratio is now standard for HDTVs and most computer monitors, while older 4:3 content was designed for analog televisions.^[1]

04

Digital Image Formats

Digital images come in two fundamental types. Raster (bitmap) images store the color of every individual pixel — excellent for photographs, but the file size grows with resolution, and enlarging the image causes pixelation. Vector images store mathematical descriptions of shapes (points, curves, fills), allowing infinite scaling without quality loss; they are ideal for logos and icons but require specialized software to create.^[1]

Common image file formats

.jpg / .jpeg

Lossy

Joint Photographic Experts Group format. Compresses images at roughly 20:1 with minimal visible quality loss. Supports 24-bit color. The dominant format for web photos and social media.^[1]

.png

Lossless

Portable Network Graphics. Lossless compression with support for an alpha (transparency) channel — enabling transparent backgrounds. Ideal for web design and UI graphics.^[1]

.gif

Lossless

Graphics Interchange Format (1987). Limited to 256 colors, making it poor for photos but effective for simple icons and short looping animations.^[1]

.tiff

Uncompressed

Tagged Image File Format (1986). Uncompressed raster format optimized for high-quality print. Not natively supported by modern web browsers.^[1]

05

Digital Audio Formats

Before selecting an audio format, it helps to understand bitrate — the kilobits of audio data processed each second. Higher bitrate means better quality but larger files. Lossy formats reduce the bitrate by discarding audio data the ear is unlikely to notice, while lossless and uncompressed formats preserve every bit.^[1]

.wav / .aiff

Uncompressed

WAV (Microsoft/IBM, 1991) and AIFF (Apple, 1988) are uncompressed audio formats. Preferred by broadcast professionals for editing and archiving, but large file sizes make them impractical for mobile use.^[1]

.mp3

Lossy

MPEG Layer 3 audio. Developed by the Moving Pictures Experts Group (founded 1988), patented 1996. Uses auditory masking to reduce file size by 75–95% while maintaining perceived quality. Sparked a digital music revolution in the mid-1990s.^[1]

.aac

Lossy

Advanced Audio Coding (1997). Similar to MP3 but uses more sophisticated compression algorithms for better quality at the same file size. Used by YouTube, iOS, Android, and gaming platforms.^[1]

.wma

Lossy

Windows Media Audio (Microsoft, 1999). Developed to avoid MP3 licensing issues. Similar quality-to-size ratio as MP3 but primarily Windows-centric and less widely supported on other platforms.^[1]

.midi

Protocol

Musical Instrument Digital Interface. Unlike other audio formats, MIDI contains no actual audio waveforms — only instructions (which notes to play, when, and for how long). Easily editable and widely used with synthesizers and music production software.^[1]

06

Digital Video Formats & Codecs

Video files are among the largest in digital computing. To make them manageable, developers created video codecs — software that compresses and decompresses digital media. Without codecs, only the most powerful computers could handle raw video data.^[1]

How Codecs Work

Most codecs have two components: an encoder that compresses data when saving, and a decoder that decompresses it during playback. Some codecs include only one of the two.

Raw video data

→

Encoder
(compresses)

→

Compressed file
stored on disk

→

Decoder
(decompresses)

→

Playback
on screen

The codec encode/decode pipeline. Most consumer video formats use this two-step process.^[1]

Common codecs

MPEG-4 is one of the most widely used codecs, developed through several generations (MPEG-1, MPEG-2, MPEG-4). Its advanced compression algorithm can reduce a standard 120-minute film to approximately 300 MB.^[1]

DivX is a movie-encoding codec built on a variant of MPEG-4, often used to compress full films onto a single DVD. A high-definition variant, DivX HD, applies the same approach to HD content.^[1]

Common Video File Formats

.mp4

Lossy

MPEG-4 video. The most common internet video format due to small file size. Can also store audio, subtitles, and still images within the same container. Natively supported on virtually all devices and operating systems.^[1]

.avi

Uncompressed

Audio Video Interleave (Microsoft, 1992). An uncompressed container storing both audio and video. Files can be very large. Runs natively only on Windows and lacks many features of modern formats.^[1]

.mov

Lossy

Apple's QuickTime format. Similar capabilities to MP4 — supports video, audio, text tracks, and timecodes. Easier to edit than MP4 but has slightly less universal support. Playable on Windows with VLC or similar players.^[1]

.wmv

Lossy

Windows Media Video (Microsoft). Compressed format based on Microsoft's Advanced Systems Format (ASF), encoded with proprietary codecs. Requires additional software to play on macOS.^[1]

07

File Compression

As the volume of digital files created and transmitted each day has grown, storage and bandwidth have become constant concerns. File compression reduces file sizes to make them easier to store and share.^[1] There are two fundamentally different approaches.

Table 4 — Compression types compared
Type	How it works	Common formats
Lossless	Reduces file size without discarding any data. Decompressing a lossless file restores it bit-for-bit to its original state.^[1]	GIF, PNG, PDF, ZIP
Lossy	Achieves greater compression by permanently discarding data the human eye or ear is unlikely to detect. Ideal for images, video, and audio.^[1]	JPG, MP3, MP4
Run-Length Encoding (RLE)	A lossless algorithm that records repeated patterns rather than each individual bit. For example, thirty consecutive zeros are stored as "30, 0" rather than thirty separate values.^[1]	Used internally in many formats

Compression artifact: When a video or image is compressed too aggressively, patches of pixels can appear that have less detail than the surrounding image. These are called compression artifacts. They are common in highly compressed streaming video or educational content where file size is prioritized over visual sharpness.^[1]

08

3D Models & Rendering

3D graphics underpin film, gaming, product design, scientific visualization, and 3D printing. All 3D graphics are based on vectors — meaning they can be scaled to any size without pixelation — and are built through a multi-step process.^[1]

Step 1: Wireframing

Wireframing defines the shape using only vertices (points) and edges (lines connecting them), without any surfaces, textures, or lighting. These "wires" form the skeleton of the model. More polygons in the wireframe mean a more complex and detailed final result. Wireframes render quickly and are easy to edit, which is why animators use them before committing to the more computationally expensive rendering stage.^[1]

Step 2: Rendering

Rendering converts the 3D wireframe model into a 2D image, adding lighting, shadows, color shading, textures, and visual effects such as motion blur. The process is analogous to photography: the artist stages a scene with lighting and positioning to achieve a desired look, but every element must be constructed digitally. Rendering complex scenes can take significant time depending on scene complexity and available processing power.^[1]

Step 3: Ray Tracing

Ray tracing is an advanced rendering technique that simulates every ray of light produced by each light source in a scene, tracking how those rays interact with surfaces in real time. The result is highly realistic reflections, shadows, and lighting. Ray tracing is used in major films and modern video games, and can blend computer-generated objects seamlessly into live-action footage. It requires specialized, powerful hardware to execute efficiently.^[1]

3D printing: 3D graphic models can be converted into physical solid objects via 3D printing, making 3D modeling skills directly applicable to manufacturing and prototyping workflows.^[1]

Digital Information: Bits, Bytes & Beyond

How Computers Encode Numbers and Text

Processing Speed

System Memory (RAM)

Encoding Text: ASCII & Unicode

Digital Sound & Video Fundamentals

Display Technology

Digital Image Formats

Common image file formats

Digital Audio Formats

Digital Video Formats & Codecs

How Codecs Work

Common codecs

Common Video File Formats

File Compression

3D Models & Rendering

Step 1: Wireframing

Step 2: Rendering

Step 3: Ray Tracing

References & Sources