Blog

All things data operations in the machine learning age
Strata+HadoopWorld
Meet Nexla at Strata + Hadoop World, March 14 – 16

We’re happy to announce that Nexla will be presenting at the Startup Showcase next week at Strata+Hadoop World in San Jose! It’s an honor to be selected along with other up-and-coming big data companies. The pitch-off takes place Tuesday, March 14 at 6:30pm.

And don’t miss our CEO Saket Saurabh’s talk on Wednesday at 2:30pm in the Solutions Showcase. Learn “How to Automate Data Operations so You Can Build Machine Learning and Advanced Analytics.” Come swing by our booth in the Innovator Pavilion.

earth_nexla_nasa
How to Easily Archive Vulnerable Environmental Data

The Internet Archive, also known as the Wayback Machine, has been capturing snapshots of the World Wide Web for over 20 years. But a new effort to save a specific type of Internet content- scientific data- has emerged among some scientists and professors. Groups like the Penn Program in Environmental Humanities and its offshoot Data Refuge are committed to preserving “…the facts we need at a time of ongoing climate change.”

These groups encourage others to download and store public scientific data. They’ve acted as a catalyst for groups around the country to host data meetups. At Nexla, we are committed to building tools that make it easier to collaborate with data. In that spirit, we’d like to share two of the methods we’ve found for archiving important data sets.

screw-1711469_1280
A first principles analysis of the immigration EO

When faced with a difficult question, mathematicians like to solve from first principles. Engineers get down to nuts and bolts. Most of us call it “going back to basics.” Letting go of assumptions and relying on fundamental concepts is a great way to find a clean, high quality solution to a difficult problem. As an engineer, I followed that approach in my work and as an entrepreneur, one of the first things the Nexla founding team did was to codify the core cultural values of our company. Those cultural values are our first principles, our nuts and bolts.

Nexla_Dodo
Data APIs are going the way of the dodo: What you need to know

At Nexla, we think of APIs are belonging to one of two categories: service or data. A service API is a building block for a developer, a way to hook into another application’s functionality. It’s how developers can build apps in Slack, or add google maps to a web app. Data APIs on the other hand are a bit more limited. They allow developers to pull data from a source and then use that data however they see fit. They don’t offer any additional functionality or services. And that’s why they’re facing extinction.

DataWrangling
Data Operations: Or How I Learned to Stop Data Wrangling and Love Machine Learning

Film buffs will remember the above scene from Stanley Kubrick’s iconic Dr. Strangelove, in which President Muffley says, “Gentlemen, you can’t fight in here! This is the War Room!” The absurd, satirical line is oddly applicable to what’s happening today in machine learning teams. The machine learning folks shouldn’t be data wrangling- they should be focused on machine learning. But because we don’t often receive our data in a usable format, valuable time is spent transforming, moving, and cleaning data.

1 2 3