Category Archives: Data Science
Data Forward: CDSS in the Harvard Data Science Review | Computing, Data Science, and Society – UC Berkeley
Were excited that the Harvard Data Science Review (HDSR), an award-winning journal and a leading voice for data science, is spotlighting the UC Berkeley Division of Computing, Data Science, and Society (CDSS) and its leadership of data science at Berkeley and throughout the UC system.
This issue of HDSR examines our work from a variety of different perspectives, giving a comprehensive look into all things data science at Berkeley. It also provides valuable insight into how this fieldone thats rapidly transforming science, business, civic, and daily lifeis coming into its own.
First, theres a wide-ranging conversation between HDSR Founding Editor in Chief Xiao-li Meng, President Michael Drake of the University of California system, and UC Berkeley Associate Provost Jennifer Chayes. Originally conducted in January of this year, it explores the role of data in society, what the UC System is doing in the area of data science education, and how it fits into our larger purpose as Californias premier public higher educational institution.
In the interview, President Drake underscored the importance of values and equity as a key part of the mission of data science at the University of California. Theres a line in the play Hamilton, a song called In the Room Where It Happened...There are places where decisions are made that change us and society as we go forward. And power in the 18th century meant sitting around the table where those decisions were being made so that you could influence that futureWhen we're all in those rooms and those things are happening, I just want to make sure that social equity is in those rooms as well, that we bring that to those discussions and to the work that we're doing. So that would be something I'd ask for data science.
Next, in Data Science and computing at UC Berkeley, Jennifer Chayes lays out CDSSs pioneering vision and progress in building a university-wide entity for data science and computing to address the opportunities and challenges of our times. The article includes a discussion of Associate Provost Chayess thoughts by three deans of UC Irvines Donald Bren School of Information and Computer Sciences: founding dean Debra Richardson, former dean and present UC Irvine provost Hal Stern, and current dean Marios Papaefthymiou. There are also three other discussion pieces written by leaders of other institutions. (These rejoinder pieces will be made available later in May.)
Writing about the larger frame for CDSS, Chayes states, With a human-centered approach to data science and computing, we will be able to better frame the perspective and purpose and to anticipate the problems, in this new medium. But we will also need tremendous innovation in the core of computing and statistics to move the field dramatically forward, to address the most difficult technical challenges and threats, and to realize the greatest opportunities.
But theres more: Professor Michael Jordan of EECS co-authored Interleaving Computational and Inferential Thinking: Data Science for Undergraduates at Berkeley with Ani Adhikari of Statistics and John DeNero of EECS. It explores the thinking behind the undergraduate data science curriculum and its far-reaching effects in reshaping the undergraduate experience at Berkeley.
Launched in 2019, HDSR, which is published by the MIT Press, is a leading journal whose goal is to provide authoritative but accessible peer-reviewed content to define and shape data science as a scientifically rigorous and globally impactful multidisciplinary field. Earlier this year, it won the 2021 Professional and Scholarly Excellence (PROSE) Award for best new journal in science, technology, and medicine from the Association of American Publishers.
HDSRof which Associate Provost Chayes is a co-editoris published quarterly. It features research articles, discussion papers, special columns, interviews, conversations, short essays, news, and stories.
Explore the issue here and sign up for the HDSR newsletter to keep tuned to the fast-moving area of data science and its growing influence on society and humanity.
Continue reading here:
Reasons Why Data Science Will Continue To Be the Most Desirable Job Of the Decade – Entrepreneur
Organizations are looking to create a skilled talent pool that can provide technical expertise and move faster in the competitive environment
Stay informed and join our daily newsletter now!
May5, 20214 min read
Opinions expressed by Entrepreneur contributors are their own.
You're reading Entrepreneur India, an international franchise of Entrepreneur Media.
In recent years, data science has emerged as one of the most prominent career choices for fresh graduates as well as working professionals across hierarchical levels. The demand for skilled professionals in the field of data science has grown remarkably owing to the urgent need for strategic decision-making tailored for specific regions. Organizations are looking to create a skilled talent pool that can provide technical expertise and move faster in the competitive environment. According to Han Digital, the demand for data science experts has grown across sectors, IT services companies employ the majority (40 per cent) of the data science talent pool from India, followed by global in-house centres or captives in India with a 30 per cent market share. Here are key reasons for this growing demand.
Data has become the backbone of business decision making: Organizations generate a huge amount of data regularly. According to estimates, the total amount of data created, copied, captured and consumed in the world is likely to reach 149 zettabytes by 2024, as compared to two zettabytes generated in the year 2010. Data science has proven to be a powerful tool to extract meaningful insights from this large chunk of data. These insights help organizations in determining any prominent changes that are to be made basis the changing consumer behavior, shortcomings of previous solutions, forthcoming challenges and competition analysis. Most of the organizations today rely on strategic decisions backed by technology. This has become a major factor that has fueled the demand for skilled talent in the field of data science.
Dearth of skilled talent pool: While the demand for professionals adept in data science skills is at an all-time high, there is a major demand-supply gap due to the non-availability of skilled talent. In order to meet this acute shortage, many organizations are investing in training their employees through tie ups with edtech companies as well as launching internal training programs.
Highly lucrative career opportunity: Owing to the massive dearth of skilled professionals in the domain, organizations are willing to pay a premium and offer long-term incentives to attract and retain talent. According to Michael Pages 2021 India Talent Trends report, professionals with 3-10 years of experience receive an annual salary ranging between INR 25-65 lakh and those with more experience can command packages upwards of INR 1 crore. It further mentioned that experts with more than 15 years of experience can receive a salary up to INR 1.8 crore.
Omnipresence of jobs: Whether large or small, organizations irrespective of their sizes and industries are banking on technology for better efficiency. Industries such IT, BFSI, retail, healthcare, telecommunication and e-commerce are actively hiring data analytics professionals with salary increments to the tune of 35-45 per cent. Data science professionals today act as the backbone for organizations to harness data and meet their strategic goals.
Low entry barriers for existing professionals: Data science domain comes with very few entry barriers and allows for professionals from diverse backgrounds to join the workforce. It demands candidates to have proficiency in technical as well as business skills and with the availability of quality learning programs, anyone with the willingness to learn has the opportunity to enter this domain. The secret sauce lies in learning the required skills as well as improving the existing ones. Individuals can acquire good statistical and data mining techniques, programming languages, machine learning and advanced analytics by enrolling for a high quality upskilling program.
A large selection of roles within the field: The field of data science has a lot to offer to freshers and professionals. One may choose to opt for a job role based on their interest as well as experience level. Some of the job roles that are high in demand include data scientist, data architect, BI engineer, business analyst, data engineer, database administrator, data and analytics manager.
With the growth data science is witnessing across the globe, there are massive job opportunities available across industries resulting in the huge demand for skilled talent in this domain. Anyone who is looking to create an impact in this field should identify the skills required and should focus on acquiring the same by opting for the right learning program that best suits their profile.
Read the original here:
Reasons Why Data Science Will Continue To Be the Most Desirable Job Of the Decade - Entrepreneur
OHIO Trustee, alumna Anna Harvey named president and CEO of the Social Science Research Council – Ohio University
Ohio University Trustee and 1988 Honors Tutorial College alumna Anna Harvey has been named the 15thpresident and CEO of the Social Science Research Council (SSRC), an independent and international research nonprofit organization.
I am deeply honored to have been asked to lead the SSRC as it approaches its centenary, Professor Harvey said in a release from the SSRC. The SSRC was founded to support social science in the public interest, with the ultimate goal of more effective and more equitable public policy. There is much important and hard work to be done to reach that goal, and the SSRC is uniquely positioned to unite the research, policy, and philanthropic communities in the work of understanding how our societies can better support human well-being around the globe. I look forward to contributing to that work.
Harvey, a political scientist and affiliated professor of data science and affiliated professor of law at New York University, is the founding director of the Public Safety Lab at New York University, a community-engaged research initiative that draws on the dual lenses of social science and data science to provide insight into mass incarceration and recommendations for improvement of the criminal justice system.
She was appointed as a national trustee to the Ohio University Board of Trustees in October 2019. She also served on the Board of Visitors for the Honors Tutorial College from 2013-2016 and earned the 2011 Outstanding Alumna Award for the college and the 1996 Outstanding Alumna Award from the Ohio University Department of Political Science.
This is a tremendous opportunity, and I am confident that Trustee Harveys extensive leadership experience will serve her well, Ohio University President M. Duane Nellis said. Her alma mater is extremely proud of all that shes accomplished.
Harvey, an Athens native, received her Ph.D.in political science from Princeton University in 1995. While studying in the Honors Tutorial College at OHIO, she also received a Harry S. Truman Scholarship, in recognition of academic excellence and commitment to public service.
The SSRC builds networks, working with partners in the United States and around the world to link research to practice and policy, strengthen individual and institutional capacities for learning, and enhance public access to information. Whether confronting emerging issues, like the global pandemic, or persistent challenges, such as inequality and climate change, the SSRC challenges established knowledge pathways and builds new ones to a better world.
Originally posted here:
Personalizing the iSchool Experience with Electives and Faculty Assistantships – iSchool | Syracuse University – Syracuse University News
When Raj Desai G21 decided to leave his home country of India to attend graduate school in the United States, he looked for one thing in particular: practical experience. He graduated in 2019 with a Bachelors degree in computer science. Still, after hearing about his friends experiences studying abroad, he felt that the iSchool was the right place for him to get hands-on experience in the data science field.
Desai got involved in extracurricular activities, participated in faculty research, and started building his Masters program for Applied Data Science. He was also excited about the opportunity to diversify his academic interests by taking a wide variety of courses.
When I was an undergrad, I didnt get to choose any of my courses, said Desai. Now that Im at the iSchool, I can build a portfolio of courses that Im interested in, and even take classes outside of the iSchool in computer science or at the Whitman School of Management, he said.
One of Desais most significant projects outside his coursework is his role as a Graduate Faculty Assistant working with professor Bei Yu in text mining. The research focuses on building a model that explains peoples health beliefs about COVID-19 by annotating text to look for patterns. Desai is working on building a data set as part of the model to predict individual views about the pandemic.
His role as a faculty assistant also requires him to work with students taking courses in natural language processing and answering their text mining questions.
This job taught me how to help other students get into the data science field and use their projects we work on in class to get jobs and internships, Desai said. I love interacting with the students, even with our current virtual format during the pandemic.
This spring, Desai worked for IPwe, a fintech startup based out of Dallas, Texas, as a data science intern working on extracting genomic sequences from thousands of documents. He then mapped this information with the organisms. He worked on another project based on financial data using SQL.
Desai also worked on a research project at the iSchool during the summer in which he analyzed more than 2000 documents to model energy trends in the United States. He simultaneously completed a summer internship at Tangible AI, a nonprofit startup based out of La Jolla, California. While his summer consisted of 80 hour work weeks at times, Desai is grateful for the experiences because he believes it helped him get ahead in his field.
He also gained practical experience in data analytics by working as an analyst for iConsult, the iSchools collaborative that connects students with real-world clients who need help with digital transformation projects. Desai worked on a project in the fall that built a chatbot that helped individuals with disabilities find jobs that match their skill sets. His role was to clean a data set using Python about what types of jobs are available for people with disabilities and what those individuals feel are their most essential skills. He then created dashboards and data visualization that the chatbot used to connect users with job opportunities.
All of these experiences have helped Desai prepare for his current job search. He will graduate in May and hopes to find a role as a data scientist or a data engineer. His work as a faculty research assistant and at iConsult has developed his interest in working in a position where he can use machine learning in the initial stages of preparing data for analysis.
The faculty and staff have been so supportive of my job search in all aspects. They are so enthusiastic, and many career center staff members like Jeffery Fouts are always willing to help me prepare for interviews, Desai said.
Desai isnt sure exactly where hell end up after graduation, but he feels very optimistic about improving economic conditions amid the ongoing pandemic. He has many interviews lined up for full-time positions post-graduation. No matter where he lands, Desai is confident that his skills and experiences in data science from the iSchool will help him build the career of his dreams.
Visit link:
ZS and Seven Bridges Partner to Innovate Drug Research and Propel Precision Medicine – PRNewswire
BOSTON, May 6, 2021 /PRNewswire/ --ZS andSeven Bridges today announced a partnership that will combine ZS's expertise in biomedical research services and data science with Seven Bridges technology platform enabling multi-omics analytics, spanning the translational to clinical continuum. The combination provides biopharmaceutical and biotechnology companies with a one-stop solution for innovation and scalability for multi-omics analysis. Together, ZS and Seven Bridges help clients create breakthrough science, accelerate drug discovery and increase the probability of success for new drug candidates.
"Working side by side with our clients to help streamline R&D data and increase speed to market to improve patients' lives is at the core of what we do within ZS's research and development efforts," said Aaron Mitchell, principal and leader of ZS's research and development excellence practice. "Our partnership with Seven Bridges provides our clients, and ultimately patients, a faster path to new treatments and diagnostics."
"Drug discovery, translational medicine and preclinical drug development all require analysis at the molecular and genomic level of resolution, and our partnership with Seven Bridges gives our clients a new option for innovation and scalable capacity," said John Piccone, principal and leader of ZS's biomedical research service line. "We are building a talent pool of molecular natives with a deep understanding of science, data science and technology to partner with our clients in the identification of new drug targets, selection of drug candidates and to address key challenges in translational medicine. The combination of Seven Bridges platform and experts with ZS consulting teams provides a scalable solution for clients with growing multi-omics data and analytics needs."
"We are excited to combine the Seven Bridges platform with ZS's innovative apps for drug discovery and translational medicine. Integrating our multi-omics analysis platform and ARIATM, a centralized solution for molecular and patient-level phenotypic analysis at scale, with their research data lake is a key requirement to fit the unique needs of ZS's pharmaceutical clients," said Bruce Press, Chief Revenue Officer at Seven Bridges.
About Seven BridgesSeven Bridges enables researchers to extract meaningful insights from genomic and phenotypic data in order to advance precision medicine. The Seven Bridges Ecosystem consists of a compliant analytic platform, intelligently curated content, transformative algorithms, unprecedented access to federated data sets, and expert on-demand professional services. This holistic approach to bioinformatics is enabling researchers at the world's leading academic, biotechnology, clinical diagnostic, government, medical centers, and pharmaceutical entities to increase R&D efficiency, enhance the hypothesis resolution process, isolate critical biomarkers, and even turn a failing clinical trial around while also reducing computational workflow times and data storage costs. To learn more, visit sevenbridges.com or follow us onLinkedIn andTwitter.
Media ContactsEric SchubertSeismic for Seven Bridges+1 415 692 6799[emailprotected]
SOURCE Seven Bridges
Read the original post:
ZS and Seven Bridges Partner to Innovate Drug Research and Propel Precision Medicine - PRNewswire
nib chief information officer on pretty exciting proposition – Insurance Business AU
Speaking with Insurance Business, nib chief information officer Brendan Mills (pictured) noted: Weve focussed quite heavily over the last little while and continue to do so which is what were doing here with these roles on building stronger and deeper capabilities across the business around machine learning, data science, and the like.
Its a real focus on data and making sure that data is seen as a first-class citizen, and that we use data to be better able to predict and inform our members healthcare and well-being needs. So, thats the overarching driver. And, within the business, were also looking at where else we can use data engineering and machine learning capabilities across sales and marketing, retention, and claims processing.
In fact, nib is presently doing some work in the area of claims processing around using machine learning. Essentially, the capability being built group-wide is aimed at serving the insurer fairly well not only in being able to improve the member experience, but also in reducing nibs cost base in time.
While theres a lot of work ahead, the whole prospect surrounding the payer-to-partner transition is leaving the chief information officer stoked.
To really move closer to our members healthcare needs and to be able to apply data insights be it through traditional data engineering or through more contemporary methods around data science and machine learning is definitely the most exciting opportunity we have, I think, as a business, said Mills.
The challenge, though, is debunking peoples perceptions around the use of data.
The nib executive explained: Sometimes theres a view that with data we will increase someones premium or we will deny the cover or whatever. In our situation, because of the regulations we operate in, thats actually impossible to do. We cant charge someone a different premium or deny them cover; we have a mechanism called community rating, which prevents us from doing that.
What were really trying to do is make sure that as someones health partner, we really look at how we leverage the data to better serve them, stressed Mills, be that through using data to identify claims patterns that might ultimately allow us to offer them a health management programme or be that prediction of future episodes of care that they might need.
Theres no doubt that with the right harvesting and engineering of data, well be able to get closer and closer to our customers and better meet their needs, which I think is a pretty exciting proposition.
Whats crucial, he said, is getting that careful balance between people sharing data and people being more willing to give data in exchange for value.
As consumers, were quite often prepared to share our data in exchange for something, the chief information officer went on to illustrate. So, we share our data with Facebook in exchange for some sort of social media experience and connectivity with friends and family, and advertising or posting.
I think for us its about creating the right opportunities, the right products and services, the right platforms for people to then want to engage with us, to share their data with us, to then give them that value back.
Mills believes whats important is creating that right value exchange and being able to demonstrate, through real-world use cases, that the products and services are indeed valuable to the consumer.
He told Insurance Business: I think thats really where our opportunity and challenge lies, is that once we nail that and get that right, well have a far more open dialogue. Weve got some good programmes in place already, and weve started to work closely with a lot of our customers around health management programmes.
Weve set up a joint venture, in addition to the main part of the business, which is focussed on delivery of a lot of these clinical services. So, were well on the way, but we have a lot of work to do yet.
If you ask Mills, though, hell say that the work actually never stops.
I dont feel like you ever start and stop with these things, he asserted. The way I like to think about our investment in technology and business process reengineering, and the experiences were trying to create, is that its pretty much a continual evolution of what were offering.
I wouldnt like to put a timeline on it per se, because I just dont believe youre ever finished. As we get more and more feedback from our members and our customers, well adapt and pivot and align the experience to the feedback were getting along the way.
See more here:
nib chief information officer on pretty exciting proposition - Insurance Business AU
Everything you need to know to land a job in data science – ZDNet
TikTok, the CIA, Yeti Coolers and AstraZeneca Pharmaceuticals are all looking for data scientists. Salaries for data scientists range from $50,000 to more than $150,00, and companies want full-time data wizards as well as data engineers who will work on contract.
What does it take to get hired? Organizations are looking for job candidates with a bachelor's or master's degree in computer science, as well as experience with data modeling tools, XML, Python, Java, SQL, AWS and Hadoop.
SEE: Research: BI and data analytics usage up; but companies lack skills needed to take full advantage of tools (TechRepublic Premium)
Many data scientist job descriptions also mention the ability to work with a distributed and fast-moving team. Interpreting data for colleagues in business units is increasingly important as well.
Ryan Boyd, head of developer relations at Databricks, said that data science will soon be a commonplace skill outside engineering and IT departments as data becomes increasingly fundamental to businesses.
"To stay competitive, data scientists need to be equally as obsessed with data storytelling as they are with the minutiae of data software and programs," said Boyd. "Tomorrow's best data scientists will be expected to translate their know-how into actionable insights and compelling stories for different stakeholders across the business, from C-suite executives to product managers."
Whether you are looking for your first data science job or figuring out your next career move in the field, the following advice from hiring managers and data science professionals will help you plot a smart and successful course.
SEE: How to become a data scientist: A cheat sheet (TechRepublic)
Anand Karasi, founder of Budgets.ai, said that behind every good data scientist there is often a team of very good data engineers.
"In most projects, more than 80% of the work involved is data engineering," Karasi said. "It's critical that the data is very accurate and clean, otherwise the models will be inaccurate in their predictions."
Karasi said that data scientists at Budgets.ai scan the web and use AI to compute corporate budget spend on AI/ML projects. In the last year, the company increased its spend on speech recognition (68%), video surveillance (69%), transfer learning (99%) and recommenders (122%).
On Dice, there are many more postings for data engineer jobs compared to data scientists -- 1,997 versus 346. Companies are also looking for data architects, predictive modelers, data storytellers, business intelligence developers, and machine learning experts. Managing and analyzing data is a team effort, as these varied roles show.
Jen Hsin, head of data science at SetSail, an AI-powered sales platform, said that data science teams can have diverse areas of responsibility. She describes these roles:
Hsin said it's easy to see why preparing for a data science career can be intimidating.
"Being a generalist who can cover several specialties can make one a more efficient data scientist," she said.
She suggested identifying one or two areas that have the biggest overlap with work experience and personal interests to create 'starter' profiles.
"When job searching, read through the job description carefully to identify if the roles and responsibilities are a good match," Hsin said. "Meanwhile, continuously grow your skillset, before landing a job as well as on the job."
SEE: 7 data science certifications to boost your resume and salary (free PDF) (TechRepublic)
Alicia Frame, director of Graph Data Science at Neo4j, said that the best resumes don't necessarily come from people with doctorate degrees in computer science from Stanford.
"They come from people who can show that they've identified and solved problems and can clearly communicate their impact at a high level," she said.
Frame also looks for experience in mentoring, teaching, and supervising and recommended that job candidates add a publications section on their resumes. This should highlight peer-reviewed journal articles, non-peer-reviewed articles, blog posts, presentations and media mentions.
Boyd recommended that people looking for data science jobs should highlight the ability to make data understandable and actionable, and focus on telling a story about a data set and what it means in the bigger picture.
"Explore different mediums like Powerpoint or interactive reports to do so, hitting key points that will move your audience to action," he said. "Data storytelling makes new information easily accessible and actionable, and your organization will benefit from it."
Experience with data visualization tools like Redash makes for a great addition to a resume, Boyd said.
SEE: Top 5 programming languages for data scientist to learn (free PDF) (TechRepublic)
Frame said she receives hundreds of resumes featuring the same five class projects that everyone seems to work on for a master's degree in data science.
"I don't interview those candidates -- I look for people who've taken the initiative in whatever role they have," she said. "People who know how to shape and execute projects, communicate to stakeholders, and explain why the work they've done matters."
She suggested that job seekers ignore online credentials such as MOOCs and Kaggle competitions.
"Instead, I recommend you look for opportunities to take on data science projects and tasks in the role you currently hold," she said. "Data science is all around us, so finding those interesting projects within your current work can go a long way."
Karasi said that tools available for data scientists are exploding and that fairly good models can be developed by software engineers by using out-of-the-box tools like AutoML from Google.
"Microsoft, Google and Amazon offer great tools that leverage the cloud infrastructure, and getting trained on these offerings and getting a certification is recommended," he said.
Frame also noted that networking and building relationships with colleagues is a key element of building a successful career path.
"I know I can call on a colleague from 10 years ago, and they'll help me answer a question, or put me in touch with someone who can, and I think everyone who works with me knows that I would do the same for them," she said. "Caring about relationships creates the trust and respect that is often missing in the workplace and beyond."
Original post:
Everything you need to know to land a job in data science - ZDNet
‘There are so many hurdles.’ Indian scientists plead with government to unlock COVID-19 data – Science Magazine
People wait in line to receive the COVID-19 vaccine in Mumbai, India, on 29 April.
By Priyanka PullaMay. 4, 2021 , 12:10 PM
Reporting for this story was supported by a journalism grant from the Thakur Family Foundation, which has not influenced the contents of this report.
Indian researchers say they urgently need better access to data collected by government agencies to help them understand and fight the countrys devastating second pandemic wave. An open letter published on 29 April that has 740 signatories so far asks the government for access to databases on COVID-19 testing and genomic sequencing and urges it to remove other obstacles to research.
There are currently so many hurdles and so much paperwork around accessing these data, says Partha Pratim Majumder, a genetic epidemiologist at the National Institute of Biomedical Genomics in West Bengal and one of letters signers.
The Office of the Principal Scientific Adviser to the Government of India, K. VijayRaghavan, released a note the next day acknowledging the problems and promising to increase access. Our broader research community needs to be much more facilitated by our research agencies, the letter said. But some scientists are skeptical that the situation will improve quickly; the note was low on details and previous requests for data from government agencies have often gone unanswered, they say. Why the Indian government is so reticent to share data is unclear.
The government has collected detailed data on some aspects of the pandemic. For example, the Indian Council of Medical Research (ICMR), the countrys top medical research agency, captures demographic details such as age, location, and health status of everyone who submits a sample for a COVID-19 test. The data could help answer key questions, such as whether people with certain concurrent illnesses are more likely to have worse outcomes and whether vaccines are working, says Gagandeep Kang, a public health microbiologist at the Christian Medical College, Vellore, who also signed the letter. Lots of people want to know, for instance, what mortality is by location, and whether it differs between rural and urban areas, she says. This speaks to the kind of care people are getting.
But so far, scientists say, ICMR and other government agencies have dragged their feet on responding to requests for access. That has forced several groups building computer models of the epidemic to rely on public domain data, which are aggregated by state but lack granular details such as breakdowns by district, age, or gender, says L. S. Shashidhara, a developmental biologist at the Indian Institute of Science Education and Research, Pune. Even modeling groups advising the Indian government on its COVID-19 response policy often dont have these data, Shashidhara says.
Scientists also want access to more viral genome sequences generated by the Indian SARS-CoV-2 Consortium on Genomics (INSACOG). Established in December 2020, the consortiums stated goal is to sequence 5% of all new SARS-CoV-2 cases in the country, which is important to keep up with the spread of new virus variants. INSACOG has had a slow start, with just over 15,000 samples sequenced by late April, out of about 5.9 million new cases India has seen since January. And just 6200 sequencesfewer than halfhave been deposited in GISAID, an international database, during this period.
In February, INSACOG identified a variant, later christened B.1.617, which was growing in frequency in Maharashtra at a time when that state was experiencing a massive outbreak. B.1.617 has spread to several other countries, including the United Kingdom and the United States. But INSACOG has yet to share its analyses of whether the variant is more transmissible or more virulent. We need multiple pairs of eyes need to look at this data, instead of just one, Majumder says.
The authors also ask the government to remove obstacles that prevent INSACOG from stepping up the sequencing pace. INSACOG scientists currently have to jump through several bureaucratic hoops to import reagents, plastics, and other key materials. The measures, designed to protect Indian industry, are ill-advised during the coronavirus surge, says Rakesh Mishra, a genomicist at Hyderabads Centre for Cellular and Molecular Biology, one of 10 INSACOG labs. Its like taking a blanket away from a person in winter because the blanket is imported, he says.
The note released by VijayRaghavans office says government agencies will immediately highlight mechanisms of research access to already available datasets and put in place access to new datasets as they are formed. It also says the government will remove import bottlenecks and INSACOG will involve more teams in data analysis, bioinformatics, and decision-making.
Although some scientists welcomed the quick response, others say the letter is too little, too late. The note only says that government agencies will facilitate data access, Majumder says. The question is: when? Time is of the essence. And promises made in the past by important government functionaries have often not been fulfilled on time.
Continued here:
Examining Top Data Science Firms in the 2021 Forbes AI 50 – Solutions Review
The 2021 Forbes AI 50 recognizes the top artificial intelligence companies in the world. Our editors examine which data science firms warrant extra attention.
The 2021 Forbes AI 50 recognizes the best private artificial intelligence companies in the world. Ranging from a variety of industries including human resources, healthcare, transportation and data analytics, the annual listing makes mention of practitioners using AI in unique ways and generating real business impact. As part of the selection process for the list, Forbes data partner, Meritech Capital, received hundreds of submissions from the top AI startups.
In order to qualify for inclusion, companies needed to prove that technologies like machine learning, natural language processing and computer vision make up the core of their product. Once vetted, each AI company was ranked based on revenue gain, profitability, internal growth, customer statistics, and funding and valuations. Their scores were tabulated to produce the final ranks, which Forbes has arranged in ascending order of valuation.
The editors at Solutions Review have perused the 2021 Forbes AI 50, available here, and identified these top data science firms as warranting extra attention. Companies are listed in the order Forbes has them ranked.
Databricks offers a cloud and Apache Spark-based unified analytics platform that combines data engineering and data science functionality. The product leverages an array of open-source languages and includes proprietary features for operationalization, performance, and real-time enablement on Amazon Web Services. A Data Science Workspace enables users to explore data and build models collaboratively. It also provides one-click access to preconfigured ML environments for augmented machine learning with popular frameworks.
Dataiku offers an advanced analytics solution that allows organizations to create their own data tools. The companys flagship product features a team-based user interface for both data analysts and data scientists. Dataikus unified framework for development and deployment provides immediate access to all the features needed to design data tools from scratch. Users can then apply machine learning and data science techniques to build and deploy predictive data flows.
DataRobot offers an enterprise AI platform that automates the end-to-end process for building, deploying, and maintaining AI. The product is powered by open-source algorithms and can be leveraged on-prem, in the cloud or as a fully-managed AI service.DataRobotincludesseveralindependent but fully integrated tools (PaxataData Preparation,Automated Machine Learning, Automated Time Series,MLOps, and AI applications), and each can be deployed in multiple ways to match business needs and IT requirements.
Domino Data Lab offers an enterprise data science platform that allows data scientists to build and run predictive models. The product helps organizations with the development and delivery of these models via infrastructure automation and collaboration. Domino provides users access to a data science Workbench that provides open source and commercial tools for batch experiments, as well as Model Delivery so they can publish APIs and web apps or schedule reports.
Tim is Solutions Review's Editorial Director and leads coverage on big data, business intelligence, and data analytics. A 2017 and 2018 Most Influential Business Journalist and 2021 "Who's Who" in data management and data integration, Tim is a recognized influencer and thought leader in enterprise business software. Reach him via tking at solutionsreview dot com.
Excerpt from:
Examining Top Data Science Firms in the 2021 Forbes AI 50 - Solutions Review
Data Scientists are Crucial to Business Growth and Innovation – CXOToday.com
While data science is not a new term, and was growing in importance in the last several decades, it has gained currency more recently as enterprises have now become awash in a sea of data both small data sets as well as the big data generated by mass-scale applications. The global pandemic further proved the importance of managing data and extracting insights from the large volume of data generated daily to drive business decisions. Despite the collapse in the job market brought about by the pandemic, the demand for data scientists globally as well as in India continues to remain high.
In a recent interaction with CXOToday, Satish Pala, SVP of Digital Solutions at Indium Software, explains why data scientists are crucial to business growth and innovation. He also sheds light on the impact of the COVID pandemic on the Data Science job market, current skill shortage in Data Science and the way forward to reduce the skills gap.
Why do you think the role of data scientists is becoming more relevant than ever today?
Due to digital adoption over the last 5 years by enterprises, we have large volumes of data being generated and processed. In addition, user-generated data through social media and mobile apps has increased exponentially. With such data sets being available, we can derive actionable insights and predictions for almost any business, from content/product recommendation to automated self-driving cars to personal assistants, etc.
While data growth is exponential, the recent developments in advanced analytics have increased the potential to generate unimaginable insights by data scientists. Some of these technologies include Google powered TensorFlow, OpenCV, Fast text by Facebook and BERT based models by Hugging Face which powers document classification and translation. Along with the software capabilities, the hardware availability like GPUs and on-demand cloud services have made solving AI & Data Science problems easy, efficient, and quick. As a result, companies are in desperate need of data scientists, to be one step ahead of the competition.
How important is the Indian market for data scientists in terms of education, hiring?
India is among the top 5 data analytics markets in the world in terms of data generation, consumption, and services. This sector is expected to be around $15 billion by 2025. Specialized courses in Statistics and Mathematics are available in India, which are the foundations to become a data scientist. Universities also offer bachelors and masters degrees in data science covering the foundations and programming in R, Python, and other languages.
India provides a good number of data scientists when compared to other countries, only next to the US, however, there is a lack of data scientists in general. There were close to 90,000 unfilled data science job opportunities in India in 2020. As a result, India Inc. is trying to build data science capability from within as well as partner with learning institutes. Some of the top IT cities that create the maximum number of jobs include Bengaluru at around 25% and Delhi and Mumbai at around 20% respectively.
What was the impact of the COVID pandemic on the Data Science job market?
COVID-19 pandemic has completely shaken and disrupted businesses across the globe. Most of the businesses that have traditionally been run from a local and on-premise data center set up have been forced to move their businesses to the cloud to scale rapidly. Also, there is an increase in digitalization like e-commerce portals and mobile apps. As a result, we have a lot of data being generated, stored, and accessed. Once any business has these data sets, there is a huge potential to leverage these data sets and generate actionable insights for their business. Hence, there are many opportunities for data scientists. The Data Science job market is pretty hot and has actually spiked a lot in the last year.
What are the current challenges faced by Data scientists?
First of all, there are no clear set of expectations from a data scientist as the realm is vast. For example, Data scientists could vary from basic python developers to statisticians to advanced mathematicians. For example, a data scientist could be a Python developer who works with data (with SQL/Spark) or Statistical analysis or Machine Learning or Deep Learning or Computer Vision (OpenCV) or Data Analytics or Data Visualization.
Secondly, theres a lack of clarity on where to start. With the wide set of Data Science skills, its quite difficult for aspiring or even experienced ones aspiring to be data scientists to choose one or more skills to work and stick with.
Finally, I would say, experienced Data scientists are still limited in number when compared to traditional computer engineers. So, the Data Science communities and forums may not offer the same support and content for data scientists to refer to.
Do you see a solution to the current skill shortage in Data Science? How can industry-academia partnership help?
Yes. These days, there are a lot of online courses as well as university programs that are already teaching Data Science not only as a course but also as a special graduate program. Industry-Academia partnerships will help speed up. Some of the Industry players are partnering with universities to design courses, case studies and projects.
How can governments encourage more students to Data Science?
Governments can encourage more students to Data Science in three ways:
1)Demonstrate AI, ML and Data Science based outcomesin government initiatives like Digital India, UPI, etc.
2) Create Job opportunities in Data Scienceeither for the Government or facilitate through private players.
3)Focused Learning Many jobs are going to get automated in the future because of AI, so the government can take measures in upskilling people with mundane jobs through National Skill Development Centres and conducting short terms courses.
How can data scientists upskill themselves in times of pandemic?
Data scientists are crucial to business growth and innovation and need constant upskilling especially in the current data landscape. Data Scientists can register in online tech learning platforms which have a lot of real-time and video recorded courses related to Data Science. Data Scientists require a lot of practice to apply theoretical knowledge on algorithms and programming. One way to achieve this is by solving problems in online Data Science competitive platforms. They provide data sets as well. Another way is to look out for Data Science hackathons to solve problems.
Excerpt from:
Data Scientists are Crucial to Business Growth and Innovation - CXOToday.com