Chris Mattmann is a Principal Data Scientist and Chief Architect at NASA’s Jet Propulsion Laboratory in the Instrument and Data Systems section. He helped develop the third generation of the Apache Object Oriented Data Technology data processing and information integration system. Throughout his career, he has contributed to several cutting-edge projects, including the NASA Orbiting Carbon Observatory and the Soil Moisture Active Passive missions. Chris also serves as an Adjunct Associate Professor in the Computer Science Department at the University of Southern California.
We caught up with Chris to learn more about his work with NASA, his responsibilities as Principal Data Scientist, and what gets him excited about the future of data science.
1. The Square Kilometre Array (SKA) and its 700 Terabytes of data/second that it will generate once turned on. It’s the next-generation astronomical instrument (ground-based, radio astronomy). How are we going to process and store the data? What data is important to keep around? What data can we let go back without analyzing? I talked about this to QCon in 2014.
2. The DARPA Memex Project – the next generation of search engines and deep search. I would liken Memex to the way that the ARPANet created and revolutionized the Internet – this will take the next steps and revolutionize search and information retrieval. We are mining and crawling the “Deep Web” and helping law enforcement and others automatically discern predators and victims in human trafficking and other illegal activities. We are also helping take these lessons learned and better search, discover, and mine NASA’s scientific information. I did an interview with ABC/Univision and with NBC on this topic last year:
J. Rice and L. Lopez. Finding a Better Way to Search the Whole Internet.
NBC Los Angeles
Channel 4. July 8, 2015.
D. Weiner-Bronner. Search Better: NASA is indexing the Deep Web to show mankind what Google wont.
June 10, 2015.