Toggle light / dark theme

Exploring a space-based, scalable AI infrastructure system design

Artificial intelligence (AI) is a foundational technology that could reshape our world, driving new scientific discoveries and helping us tackle humanity’s greatest challenges. Now, we’re asking where we can go to unlock its fullest potential.

The Sun is the ultimate energy source in our solar system, emitting more power than 100 trillion times humanity’s total electricity production. In the right orbit, a solar panel can be up to 8 times more productive than on earth, and produce power nearly continuously, reducing the need for batteries. In the future, space may be the best place to scale AI compute. Working backwards from there, our new research moonshot, Project Suncatcher, envisions compact constellations of solar-powered satellites, carrying Google TPUs and connected by free-space optical links. This approach would have tremendous potential for scale, and also minimizes impact on terrestrial resources.

We’re excited about this growing area of exploration, and our early research, shared today in “Towards a future space-based, highly scalable AI infrastructure system design,” a preprint paper, which describes our progress toward tackling the foundational challenges of this ambitious endeavor — including high-bandwidth communication between satellites, orbital dynamics, and radiation effects on computing. By focusing on a modular design of smaller, interconnected satellites, we are laying the groundwork for a highly scalable, future space-based AI infrastructure.

Scientists smash record in stacking semiconductor transistors for large-area electronics

King Abdullah University of Science and Technology (KAUST; Saudi Arabia) researchers have set a record in microchip design, achieving the first six-stack hybrid CMOS (complementary metal-oxide semiconductor) for large-area electronics. With no other reported hybrid CMOS exceeding two stacks, the feat marks a new benchmark in integration density and efficiency, opening possibilities in electronic miniaturization and performance.

A paper detailing the team’s research appears in Nature Electronics.

Among microchip technologies, CMOS microchips are found in nearly all electronics, from phones and televisions to satellites and medical devices. Compared with conventional silicon chips, hybrid CMOS microchips hold greater promise for large-area electronics. Electronic miniaturization is crucial for flexible electronics, smart health, and the Internet of Things, but current design approaches are reaching their limits.

Swarm reveals growing weak spot in Earth’s magnetic field

Using 11 years of magnetic field measurements from the European Space Agency’s Swarm satellite constellation, scientists have discovered that the weak region in Earth’s magnetic field over the South Atlantic—known as the South Atlantic Anomaly—has expanded by an area nearly half the size of continental Europe since 2014.

Earth’s magnetic field is vital to life on our planet. It is a complex and dynamic force that protects us from cosmic radiation and charged particles from the sun.

It is largely generated by a global ocean of molten, swirling liquid iron that makes up the around 3,000 km beneath our feet. Acting like a spinning conductor in a bicycle dynamo, it creates electrical currents, which in turn, generate our continuously changing —but in reality the processes that generate the field are far more complex.

Magnetic ‘switchback’ detected near Earth for the first time

In recent years, NASA’s Parker Solar Probe has given us a close-up look at the sun. Among the probe’s revelations was the presence of numerous kinks, or “switchbacks,” in magnetic field lines in the sun’s outer atmosphere. These switchbacks are thought to form when solar magnetic field lines that point in opposite directions break and then snap together, or “reconnect,” in a new arrangement, leaving telltale zigzag kinks in the reconfigured lines.

In their article published in the Journal of Geophysical Research: Space Physics, E. O. McDougall and M. R. Argall now report observations of a switchback-shaped structure in Earth’s magnetic field, suggesting that switchbacks can also form near planets.

The researchers discovered the switchback while analyzing data from NASA’s Magnetospheric Multiscale mission, which uses four Earth-orbiting satellites to study Earth’s magnetic field. They detected a twisting disturbance in the outer part of Earth’s magnetosphere—the bubble of space surrounding our planet where a cocktail of charged particles known as is pushed and pulled along Earth’s .

Radical Plan to Beam ‘Sunlight on Demand’ at Night Sparks Concerns

A proposed constellation of satellites has astronomers very worried. Unlike satellites that reflect sunlight and produce light pollution as an unfortunate byproduct, the ones by US startup Reflect Orbital would produce light pollution by design.

The company promises to produce “sunlight on demand” with mirrors that beam sunlight down to Earth so solar farms can operate after sunset.

It plans to start with an 18-metre test satellite named Earendil-1 which the company has applied to launch in 2026. It would eventually be followed by about 4,000 satellites in orbit by 2030, according to the latest reports.

Topsicle: a method for estimating telomere length from whole genome long-read sequencing data

Long read sequencing technology (advanced by Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (Nanopore)) is revolutionizing the genomics field [43] and it has major potential to be a powerful computational tool for investigating the telomere length variation within populations and between species. Read length from long read sequencing platforms is orders of magnitude longer than short read sequencing platforms (tens of kilobase pairs versus 100–300 bp). These long reads have greatly aided in resolving the complex and highly repetitive regions of the genome [44], and near gapless genome assemblies (also known as telomere-to-telomere assembly) are generated for multiple organisms [45, 46]. The long read sequences can also be used for estimating telomere length, since whole genome sequencing using a long read sequencing platform would contain reads that span the entire telomere and subtelomere region. Computational methods can then be developed to determine the telomere–subtelomere boundary and use it to estimate the telomere length. As an example, telomere-to-telomere assemblies have been used for estimating telomere length by analyzing the sequences at the start and end of the gapless chromosome assembly [47,48,49,50]. But generating gapless genome assemblies is resource intensive and cannot be used for estimating the telomeres of multiple individuals. Alternatively, methods such as TLD [51], Telogator [52], and TeloNum [53] analyze raw long read sequences to estimate telomere lengths. These methods require a known telomere repeat sequence but this can be determined through k-mer based analysis [54]. Specialized methods have also been developed to concentrate long reads originating from chromosome ends. These methods involve attaching sequencing adapters that are complementary to the single-stranded 3′ G-overhang of the telomere, which can subsequently be used for selectively amplifying the chromosome ends for long read sequencing [55,56,57,58]. While these methods can enrich telomeric long reads, they require optimization of the protocol (e.g., designing the adapter sequence to target the G-overhang) and organisms with naturally blunt-ended telomeres [59, 60] would have difficulty implementing the methods.

An explosion of long read sequencing data has been generated for many organisms across the animal and plant kingdom [61, 62]. A computational method that can use this abundant long read sequencing data and estimate telomere length with minimal requirements can be a powerful toolkit for investigating the biology of telomere length variation. But so far, such a method is not available, and implementing one would require addressing two major algorithmic considerations before it can be widely used across many different organisms. The first algorithmic consideration is the ability to analyze the diverse telomere sequence variation across the tree of life. All vertebrates have an identical telomere repeat motif TTAGGG [63] and most previous long read sequencing based computational methods were largely designed for analyzing human genomic datasets where the algorithms are optimized on the TTAGGG telomere motif. But the telomere repeat motif is highly diverse across the animal and plant kingdom [64,65,66,67], and there are even species in fungi and plants that utilize a mix of repeat motifs, resulting in a sequence complex telomere structure [64, 68, 69]. A new computational method would need to accommodate the diverse telomere repeat motifs, especially across the inherently noisy and error-prone long read sequencing data [70]. With recent improvements in sequencing chemistry and technology (HiFi sequencing for PacBio and Q20 + Chemistry kit for Nanopore) error rates have been substantially reduced to 1% [71, 72]. But even with this low error rate, a telomeric region that is several kilobase pairs long can harbor substantial erroneous sequences across the read [73] and hinder the identification of the correct telomere–subtelomere boundary. In addition, long read sequencers are especially error-prone to repetitive homopolymer sequences [74,75,76], and the GT-rich microsatellite telomere sequences are predicted to be an especially erroneous region for long read sequencing. A second algorithmic consideration relates to identifying the telomere–subtelomere boundary. Prior long read sequencing based methods [51, 52] have used sliding windows to calculate summary statistics and a threshold to determine the boundary between the telomere and subtelomere. Sliding window and threshold based analyses are commonly used in genome analysis, but they place the burden on the user to determine the appropriate cutoff, which for telomere length measuring computational methods may differ depending on the sequenced organism. In addition, threshold based sliding window scans can inflate both false positive and false negative results [77,78,79,80,81,82] if the cutoff is improperly determined.

Here, we introduce Topsicle, a computational method that uses a novel strategy to estimate telomere lengths from raw long read sequences from the entire whole genome sequencing library. Methodologically, Topsicle iterates through different substring sizes of the telomere repeat sequence (i.e., telomere k-mer) and different phases of the telomere k-mer are used to summarize the telomere repeat content of each sequencing read. The k-mer based summary statistics of telomere repeats are then used for selecting long reads originating from telomeric regions. Topsicle uses those putative reads from the telomere region to estimate the telomere length by determining the telomere–subtelomere boundary through a binary segmentation change point detection analysis [83]. We demonstrate the high accuracy of Topsicle through simulations and apply our new method on long read sequencing datasets from three evolutionarily diverse plant species (A. thaliana, maize, and Mimulus) and human cancer cell lines. We believe using Topsicle will enable high-resolution explorations of telomere length for more species and achieve a broad understanding of the genetics and evolution underlying telomere length variation.

The Sun’s Poles Hold the Key to Its Three Greatest Mysteries

The Sun’s poles may hold answers to long-standing mysteries about magnetic cycles, solar wind, and space weather. The polar regions of the Sun remain one of the least explored areas in solar science. Although satellites and ground-based observatories have captured remarkable details of the Sun’s su

/* */