Helping others use data is "like giving them a superpower," says the senior data scientist at an ag-tech startup, Plenty.
Data Scientist Dana Seidel at work.
Image: Dana Seidel
Dana Seidel was "traipsing around rural Alberta, following herds of elk," trying to figure out their movement patterns, what they ate, what brought them back to the same spot, when she had an epiphany: Data could help answer these questions.
SEE: Snowflake data warehouse platform: A cheat sheet (free PDF) (TechRepublic)
At the time, enrolled in a master's program at the University of Alberta, she was interested in tracking the movement of deer and elk and other central foragers. Seidel realized that she could use her math and ecology background at Cornell University to help evaluate a model that could answer these questions. She continued her studies, earning a Ph.D. at University of California Berkeley related to animal movement and the spread of diseaseswhich she monitored, in part, by collecting data from collars. Kind of like a Fitbit, Seidel explained, "tracking wherever you go throughout the day," yielding GPS data points that could connect to land data, such as satellite images, offering a window into the movement of this wildlife.
Seidel, 31, has since transitioned from academia to the startup world, working as the lead data scientist at Plenty, an indoor vertical farming company. Or as she would call herself a "data scientist who is interested in spatial-temporal time series data."
Seidel was born in Tennessee, but grew up in Kansas. She's 31, which she said is "old" for the startup world. As someone who spent her twenties "investing in one career path and then switching over," she doesn't necessarily have the same industry experience as her colleagues. So while she is grateful for her experience, a degree is not a necessity, she said.
"I'm not sure that my Ph.D. helps me in my current job," she said. One area where it did help her, however, was by giving her access to internshipsat Google Maps, in Quantitative Analysts and RStudiowhere she gained experience in software development.
"But I don't think writing more papers about anthrax and zebras really convinced anybody that I was a data scientist," she said.
Seidel learned the programming language R, which she loved, in college, and in her master's program started building databases. She said she "generally taught myself alongside these courses to use the tools." The biggest skill of being a data scientist "may very well just be knowing how to Google things," she said. "That's all coding really is, creative problem-solving."
SEE: Job description: Chief data officer (TechRepublic Premium)
The field of data science is about a decade old, Seidel saidpreviously, it was statistics. "The idea of having somebody who has a statistics background or understands inferential modeling or machine learning has existed for a lot longer than we've called it a data scientist," she said, and a master's in data science didn't exist until the last year of her Ph.D.
Additionally, "data scientist" is very broad. Among data scientists, many different jobs can exist. "There are data scientists that focus very much on advanced analytics. Some data scientists only do natural language processing," she said. And the work emcompasses many diverse skills, she said, including "project management skills, data skills, analysis skills, critical thinking skills."
Seidel has mentored others interested in getting into the field, starting with a weekly Women in Machine Learning and Data Science coffee hour at Berkeley. The first piece of advice? "I would tell them: 'You have skills,'" Seidel said. Many young students, especially women, don't realize how much they already know. "I don't think we communicate often to ourselves in a positive way, all of the things we know how to do, and how that might translate," she said.
For those interested in transitioning from academia to industry, she also advises getting experience in software development and best practices, which may have been missing from formal education. "If you understand things like standard industry practices, like version control and git and bash scripting a little bit so that you have some of that language, some of that knowledge, you can be a more effective collaborator." Seidel also recommends learning SQLone of the easiest languages, in her opinionwhich she calls "the lingua franca of data analytics and data science. Even though I think it's something you can absolutely learn on the job, it's going to be the main way you access data if you're working in an industry data science team. They're going to have large databases with data and you need a way to communicate that," she said. She also recommends building skills, through things like the 25-day Advent of Code, and other ways to demonstrate a clean coding style. "What takes a good amount of legwork, and until you have your industry job, it's unpaid legwork, but it can really help make you stand out," she said.
SEE: Top 5 things you need to know about data science (TechRepublic)
On a typical morning at her current job, working from home, Seidel is drinking coffee and answering Slack messages in her home office/ quilting studio. She checks to see if there are questions about the data, something wrong with the dashboard, or a question about plant health. Software engineers working on the data may also have questions, she said. There's often a scrum meeting in the morning, and they operate with sprint teams (meeting every two weeks) and agile workflows.
"I have a pretty unique position where I can float between various data scrums we do, we have a farm performance scrum versus a perception team or a data infrastructure team," Seidel explained. "I can decide: What am I going to contribute to in this sprint?" Twice a week there's a leadership meeting, where she is on the software and data leads, and she can listen in on what else is being worked on, and what's coming up ahead, which she said is one of the most important meetings for her, since she can hear directly "when a change is happening on the software side or there's a new requirement coming out of ops for a software or for software or for data that's coming."
In the afternoon, she has a good block of development time, "to dig into whatever issue I'm working on that sprint," she said.
SEE: How to become a data scientist: A cheat sheet (TechRepublic)
Seidel manages the data warehouse and ensures data streams are "being surfaced to end users in core data models." Last week, she worked on the farm performance scrum, "validating measurements that are coming out of the farm, thinking ahead about the new measurements we need to be collecting, and thinking about the measurements that we have in our south San Francisco farm, measurements streaming in from a couple of thousand devices." She needs to ensure accurate measurement streams, which come from everything from the temperature to irrigation, to ensure plant health, and answer questions like: "Why did last week's arugula do better than this week's arugula?"
The primary task is to know if they're measuring the right thing, and to push back and say, "Oh, OK, what is it that you want that data to be explaining? What is the question you're asking?" She needs to stay a few steps ahead, she said, and ask: "What are all the new data sources that I need to be aware of that we need to be supporting?"
The toughest part of the job? "I really hate not having the answer. I hate having to say, "No, we don't measure that thing yet." Or, "We'll have that in the next sprint." Balancing giving people the answers with giving them tools to access the answers themselves is a daily challenge, she said, with the ultimate goal of making data accessible.
And saying, "Oh, yes, that data is there and it's this simple query," or, "Oh, have you seen this tool I built a year ago that can solve this problem?" is really gratifying.
"Helping someone learn how to ask and answer questions from data is like giving them a superpower," Seidel said.
Learn the latest news and best practices about data science, big data analytics, and artificial intelligence. Delivered Mondays
The rest is here:
- Coatue Data Science Head Alex Izydorczyk Is Leaving the Company - Business Insider - November 25th, 2021
- Data Science Is a Key Weapon in the Fight Against Fraud - Built In - November 25th, 2021
- Grads from MIT, IITs, IIMs and other Renowned Varsities Working on Building LIVEY, a World-Class Data Science and AI-powered Healthcare Solution -... - November 25th, 2021
- Tim Kao: Infusing Data Science to Revolutionize the Functionalities of Navy and Marine Industries - Analytics Insight - November 25th, 2021
- COVID-19: Compliance to household mixing restrictions in England decreased with each lockdown - EurekAlert - November 25th, 2021
- Data science approaches to confronting the COVID-19 pandemic: a narrative review - DocWire News - November 25th, 2021
- Bayer and Microsoft partner to build new cloud-based digital tools - IT Brief New Zealand - November 25th, 2021
- Elizabeth Holmes testifies in Theranos trial & Athenahealth acquisition 2.0 - STAT - November 25th, 2021
- Who are the current generation of Chief Data Officers? - ComputerWeekly.com - November 25th, 2021
- Analysis in Government Awards 2021 Shortlist - GOV.UK - November 25th, 2021
- Pace receives NSF grant to expand data science instruction nationwide - Westfair Online - November 17th, 2021
- Snowflake Shapes the Future of Data Science with Python Support - Business Wire - November 17th, 2021
- UK Railroads Invest in Data Science, AI, and Machine Learning - Cities of the Future - November 17th, 2021
- Top Upcoming Data Science Webinars to Attend in 2021 - Analytics Insight - November 17th, 2021
- The Iconic CEO Erica Berchtold wants to use the science of data to convert you to online shopping - The Australian Financial Review - November 17th, 2021
- Immunosuppressants linked to severe reactions in people with common genetic profile - Stanford Medical Center Report - November 17th, 2021
- Why Genpact 'Dare in Reality' Is A Hackathon Not To Be Missed - Analytics India Magazine - November 17th, 2021
- TigerGraph expands its graph data library with 20 new algorithms - VentureBeat - November 17th, 2021
- Improving Your Odds of ML Success with MLOps - insideBIGDATA - November 17th, 2021
- Smarter Production and Data-Driven Insight | - Advanced Television - November 17th, 2021
- The Role of Women in Scalping up AI and Data Science - Analytics Insight - November 10th, 2021
- Debunking The Four Most Common Data Science Myths - Influencive - November 10th, 2021
- Olive Partners with ClosedLoop to Improve Care and Reduce Financial Risk for Patients - Yahoo Finance - November 10th, 2021
- Knowland Releases First-of-Its-Kind Future Event Activity Forecast - PRNewswire - November 10th, 2021
- The Question Weve Stopped Asking About Teen-Agers and Social Media - The New Yorker - November 10th, 2021
- Do You Want To Deploy Responsible AI In Your Organization? Join This Session To Operationalize Responsible AI - Analytics India Magazine - November 10th, 2021
- In-Demand skills research finds the US is one of the most competitive markets for skilled tech workers, but talent scarcity is a global issue -... - November 10th, 2021
- Data Science Master's Degree | 100% Online | University of ... - November 6th, 2021
- MSE in Data Science - November 6th, 2021
- Variable Names: Why They're a Mess and How to Clean Them Up - Built In - November 6th, 2021
- UVA Announces New Research Partnership at Intersection of Business and Data Science - UVA Today - November 6th, 2021
- Succeeding in Data Science Projects Inputs that Could Help You - Analytics Insight - November 6th, 2021
- Inspiring Innovation; New Short Talks Features Karl Schubert and Data Science Program - University of Arkansas Newswire - November 6th, 2021
- How to succeed around data science projects - Information Age - November 6th, 2021
- Training students at the intersection of power engineering and computer science WSU Insider - WSU News - November 6th, 2021
- California Tries to Close the Gap in Math, but Sets Off a Backlash - The New York Times - November 6th, 2021
- A look at some of the AI and ML expert speakers at the iMerit ML DataOps Summit - TechCrunch - November 6th, 2021
- Exploring, Monitoring and Modeling the Deep Ocean Are Goals of New Research - UT News - UT News | The University of Texas at Austin - November 6th, 2021
- UVA Science and Engineering Faculty Win 12 NSF Career Awards - University of Virginia - November 6th, 2021
- Microsoft Excel is still the data analytics gold standard. The pre-Black Friday sale can teach you fast. - The Next Web - November 6th, 2021
- Heard on the Street 10/28/2021 - insideBIGDATA - October 28th, 2021
- Insights on the Data Science Platform Global Market to 2027 - Featuring Microsoft, IBM and Google Among Others - Yahoo Finance - October 28th, 2021
- Ericsson helps Giga to map connectivity in more than a million schools - Ericsson - October 28th, 2021
- NIH awards nearly $75M to catalyze data science research in Africa - National Institutes of Health - October 26th, 2021
- Worldwide Data Science Platform Industry to 2027 - Increasing Adoption of Data-Driven Technologies by Enterprises Presents Opportunities - PRNewswire - October 26th, 2021
- Multi-institution project to train Kenyan experts to bring social determinants to bear on modeling health outcomes - Newswise - October 26th, 2021
- Takeda's Krista McKee on Shifting the Data Ecosystem - Bio-IT World - October 26th, 2021
- Stitch Fix CEO: 'Data science and algorithms are at the core' of the company - Oakland News Now - October 26th, 2021
- Miami Dade College to Host 10th Anniversary School of Science White Coat Ceremony and STEM Research Symposium - The Reporter - October 26th, 2021
- FDA takes hands-on approach to upskill workforce under data modernization action plan - Federal News Network - October 26th, 2021
- Global Data Science Platform Market (2021 to 2027) - by Component, Deployment, Organization Size, Function, Industry Vertical and Geography -... - October 26th, 2021
- Planning to study Data Science after Plus-Two? Heres what is on offer - Telegraph India - October 26th, 2021
- Consumer-facing Companies Still Have Few Incentives to Stop Data Breaches, and Thats a National Security Concern. - Council on Foreign Relations - October 26th, 2021
- Skills and online courses to become a Data Scientist, the top job role in the world by 2025 - India Today - October 26th, 2021
- Which degree is right for me: data science or digital health? - News - The University of Sydney - October 26th, 2021
- Data Science Leaders are Ruling the Corporate Industry for their Technical Talent as well as Management Expertise - Analytics Insight - October 26th, 2021
- Will Data Science be in Demand in the Future? - Entrepreneur - September 30th, 2021
- Promoting the Public Good | UVA Today - UVA Today - September 30th, 2021
- MetaCell launches innovative Cloud Hosting for life science and healthcare - Yahoo Finance - September 30th, 2021
- KDD 2021 Honors Recipients of the SIGKDD Best Paper Awards - Yahoo Finance - September 30th, 2021
- Analytics Insight Announces Big Data Analytics Companies of the Year - Yahoo Finance - September 30th, 2021
- R is better than Python. Try telling that to banks - eFinancialCareers - September 30th, 2021
- World AI & Data Science Conference to be held on October 13th, 2021 - Analytics Insight - September 27th, 2021
- How Data Science and Big Data are Shaping the Indian Food Industry in 2021? - Analytics Insight - September 27th, 2021
- mRNA Could Fight Diseases Such as Alzheimer's and Cancer, With Help of UVA Scientist - University of Virginia - September 27th, 2021
- Media advisory: Kevin Leicht to testify before congressional subcommittee about disinformation - University of Illinois News - September 27th, 2021
- Heard on the Street 9/27/2021 - insideBIGDATA - September 27th, 2021
- New Business Institute at UT Austin Will Specialize in Sports Analytics - Diverse: Issues in Higher Education - September 27th, 2021
- How AI is Transforming The Race Strategy Of Electric Vehicles - Analytics India Magazine - September 27th, 2021
- Argentine project analyzing how data science and artificial intelligence can help prevent the outbreak of Covid-19 | Chosen from more than 150... - September 25th, 2021
- Metropolitan Chicago Data-science Corps to partner with area organizations on projects - Northwestern University NewsCenter - September 25th, 2021
- Business of Sports Institute at UT McCombs School Founded by Gift from Accenture - UT News - UT News | The University of Texas at Austin - September 25th, 2021
- 'I Want The Folks in Our Society to Be Data Literate So That We Are Making Good Decisions Together for the Good of the World,' Says Professor... - September 25th, 2021
- Pandemic oversight board to preserve data analytics tools beyond its sunset date - Federal News Network - September 25th, 2021
- Increase the Readability of Your Python Script With 1 Simple Tool - Built In - September 25th, 2021
- On World Cancer Research Day, Illumina Highlights the Transformative Power of Genomics - Yahoo Finance - September 25th, 2021
- An Introduction to Portfolio Optimization in Python - Built In - September 25th, 2021
- Life sciences use of digital twins mirrors its application in other industries - MedCity News - September 25th, 2021
- The Top 3 Tools Every Data Scientist Needs - Built In - September 21st, 2021
- OpsRamp Introduces The Future of Incident Response: Harnessing Machine Learning and Data Science to Predict and Prevent IT Outages - Yahoo Finance - September 21st, 2021