How We Automatically Analyzed 1,000 Crash Dumps in Ten Minutes

BrightSource’s production environment is disconnected from the main network, and there's no easy way to perform remote debugging or get live monitoring data from the field. By configuring Windows to automatically generate crash dumps, we obtained hundreds of dump files from multiple servers while working on our latest release. There was no way we could analyze these dumps by hand, triage the bugs, and assign them to the relevant developers.

That's when I discovered ClrMD – a Microsoft NuGet package that provides a dump debugging API. In this talk I'll explain how we built a tool with ClrMD to automatically analyze dumps and obtain exception information, thread call stacks, and memory usage.

We also integrated this tool with Redmine, a popular project and work item management tool, to file bugs automatically to the relevant developer based on the tool's initial analysis.

About the speaker - Dina Goldshtein

Dina Goldshtein is an Embedded Software Team Leader at BrightSource Energy, a solar energy company.

Dina leads a team of embedded software engineers building systems for real-time processors, develops infrastructure components in C#, C++, and JavaScript, and spends a lot of time improving processes, tools, and operations.

Event Leaders

GigaSpaces Technologies

Platinum Sponsors

AOL On Monitis

Gold Sponsors

Cloudify by GigaSpaces JFrog Chef Stratoscale BigPanda Wix Engineering Outbrain SimilarWeb

Silver Sponsors

Librato Metrics Rounds PagerDuty Kenshoo Anodot