On Thursday a rocket failed. Three humans remain on the ISS. What’s next?

Posted on October 15, 2018 by mea

NASA’s strong preference is to keep astronauts aboard the station. But Todd said NASA does have procedures for operating the station without crew on board. “That’s something that we’re always prepared for,” he said. “I feel very confident that we could fly for a significant period of time.”

Source: On Thursday a rocket failed. Three humans remain on the ISS. What’s next? | Ars Technica

Kepler Spacecraft in Emergency Mode

Posted on April 12, 2016 by mea

The last regular contact with the spacecraft was on April. 4. The spacecraft was in good health and operating as expected.

Kepler completed its prime mission in 2012, detecting nearly 5,000 exoplanets, of which, more than 1,000 have been confirmed. In 2014 the Kepler spacecraft began a new mission called K2. In this extended mission, K2 continues the search for exoplanets while introducing new research opportunities to study young stars, supernovae, and many other astronomical objects.

Source: Mission Manager Update: Kepler Spacecraft in Emergency Mode | NASA

Also From: Kepler Reaction Wheel Failure Cripples Spacecraft, but Mission Thrives

To save on bandwidth, Kepler only downlinks data from the pixels associated with 156,000 target stars out of the millions of stars in the Kepler field. Data from an “aperture” of pixels around each target star are downlinked to Earth, and computer programs on Earth measure the brightness of the star based on the light that hit the pixels in the aperture. If the telescope pointing is not good enough to keep the target stars in their respective apertures on the pixels, it is impossible to measure the brightness of those stars with a precision of 20 parts per million.

Update From: Kepler telescope readies for new mission after communications scare

Once the spacecraft checks out, Kepler will kick off its latest effort, looking toward the galactic center for planets whose gravity distorts the light from far more distant stars. This technique, known as gravitational microlensing, has been used with ground-based telescopes to discover about 46 planets, some of them orphaned from their parent stars. But the method is a first for Kepler, which searches for dips in starlight caused by planets crossing in front of their suns.

A400M probe focuses on impact of accidental data wipe

Posted on June 11, 2015 by mea

Computers operating each engine cannot work if this data, which is unique to each of the turboprops, is missing.

Source: Exclusive: A400M probe focuses on impact of accidental data wipe | Reuters

Under the A400M’s design, the first warning pilots would receive of the engine data problem would be when the plane was 400 feet (120 meters) in the air, according to a safety document seen by Reuters. On the ground, there is no cockpit alert.

Sounds like these data files became a single point of failure.

How a dumb software glitch kept thousands from reaching 911

Posted on October 22, 2014 by mea

At first, Intrado thought that the complaints arising from various PSAPs around the country were just isolated, unconnected events — even though alarm bells were going off an hour into the breakdown. Nobody noticed the warnings until it was too late; the server taking note of the alerts categorized them as “low level” incidents and were never flagged for a human, according to the FCC report.

via How a dumb software glitch kept thousands from reaching 911 – The Washington Post.

PSAP = Poor Sucker At Phone

Creating a Centralized Syslog Server

Posted on April 18, 2013 by mea

For this article, I’ll be focusing on syslog-ng as this is more up to date, and if the reader wishes, can be ‘supported’ via the company that owns the syslog-ng software by going with their enterprise edition version at a later date.

via Creating a Centralized Syslog Server | Linux Journal.

This is a good tutorial to get going with syslog-ng. Monitoring events being logged into syslog can provide ample warning when a server is about to die.

Mars Rover Curiosity in Safe Mode After Computer Glitch

Posted on March 4, 2013 by mea

The issue cropped up Wednesday (Feb. 27), when the spacecraft failed to send its recorded data back to Earth and did not switch into its daily sleep mode as planned. After looking into the issue, engineers decided to switch the Curiosity rover from its primary “A-side” computer to its “B-side” backup on Thursday at 5:30 p.m. EST (22:30 GMT). [Curiosity Rover’s Latest Amazing Mars Photos]

via Mars Rover Curiosity in Safe Mode After Computer Glitch | Space.com.

SpaceX overcomes thruster problems with cargo ship

Posted on March 1, 2013 by mea

Six-and-a-half hours after launch, follwoing extensive troubleshooting and analysis, it appeared company engineers had resolved the problem, bringing all four sets of thrusters on line and setting the stage for a delayed rendezvous with the space station.

via SpaceX overcomes thruster problems with cargo ship | Cutting Edge – CNET News.

Netflix Gives Data Center Tools to Fail

Posted on November 27, 2012 by mea

Netflix has released Hystrix, a library designed for managing interactions between distributed systems, complete with “fallback” options for when those systems inevitably fail.

The code for Hystrix—which Netflix tested on its own systems—can be downloaded at Github, with documentation available here, in addition a getting-started guide and operations examples, among others.

via Netflix Gives Data Center Tools to Fail.

Netflix will also release the real-time dashboard it uses for monitoring Hystrix. That dashboard relies on a traffic-light system to display service dependencies for the last ten seconds, with colors measuring latency and the size of the circles showing traffic.

That smooth SpaceX launch? Turns out one of the engines came apart

Posted on October 8, 2012 by mea

The Falcon 9, as its name implies, has nine engines, and is designed to go to orbit if one of them fails. On-board computers will detect engine failure, cut the fuel supply, and then distribute the unused propellant to the remaining engines, allowing them to burn longer. This seems to be the case where that was required, and the computers came through. The engines are also built with protection to limit the damage in cases where a neighboring engine explodes, which appears to be the case here.

via That smooth SpaceX launch? Turns out one of the engines came apart | Ars Technica.

Bucktown Bell

Cut to the chase.

Tag Archives: fault management