Table Of Contents :
What is artificial intelligence ? How to build a career in artificial intelligence ?
The rapid rise of AI has led to a rapid rise in AI jobs, and many people are building exciting careers in this field. A career is a decades-long journey, and the path is not straightforward. Over many years, I’ve been privileged to see thousands of students, as well as engineers in companies large and small, navigate careers in AI. Here’s a framework for charting your own course.


Initially, you
focus on learning
foundational skills.
Chapters with the
cover topics about
learning foundational
technical skills.
After having gained
foundational technical
skills, you will begin
working on projects.
During this period, you’ll
also keep learning.
Chapters with the
focus on projects.
Later, you will work
on finding a job.
Throughout this
process, you’ll continue
to learn and work on
meaningful projects.
Chapters with the
focus on a job search.
1. How to learn A.I (Artificial intelligence) ?
1.1 Learning Technical Skills for a Promising AI (Artificial intelligence )Career
- Foundational machine learning skills: For example, it’s important to understand models such as linear regression, logistic regression, neural networks, decision trees, clustering, and anomaly detection. Beyond specific models, it’s even more important to understand the core concepts behind how and why machine learning works, such as bias/variance, cost functions, regularization, optimization algorithms, and error analysis.
- Deep learning: This has become such a large fraction of machine learning that it’s hard to excel in the field without some understanding of it! It’s valuable to know the basics of neural networks, practical skills for making them work (such as hyperparameter tuning), convolutional networks, sequence models, and transformers.
- Software development: While you can get a job and make huge contributions with only machine learning modeling skills, your job opportunities will increase if you can also write good software to implement complex AI systems. These skills include programming fundamentals, data structures (especially those that relate to machine learning, such as data frames), algorithms (including those related to databases and data manipulation), software design, familiarity with Python, and familiarity with key libraries such as TensorFlow or PyTorch, and scikit-learn.
- Math relevant to machine learning: Key areas include linear algebra (vectors, matrices, and various manipulations of them) as well as probability and statistics (including discrete and continuous probability, standard probability distributions, basic rules such as independence and Bayes’ rule, and hypothesis testing). In addition, exploratory data analysis (EDA) — using visualizations and other methods to systematically explore a dataset — is an underrated skill. I’ve found EDA particularly useful in data-centric AI development, where analyzing errors and gaining insights can really help drive progress! Finally, a basic intuitive understanding of calculus will also help. The math needed to do machine learning well has been changing. For instance, although some tasks require calculus, improved automatic differentiation software makes it possible to invent and implement new neural network architectures without doing any calculus. This was almost impossible a decade ago.
- Software development: While you can get a job and make huge contributions with only machine learning modeling skills, your job opportunities will increase if you can also write good software to implement complex AI systems. These skills include programming fundamentals, data structures (especially those that relate to machine learning, such as data frames), algorithms (including those related to databases and data manipulation), software design, familiarity with Python, and familiarity with key libraries such as TensorFlow or PyTorch, and scikit-learn.
disjointed web pages is inefficient because they tend to repeat each other, use inconsistent terminology (which slows you down), vary in quality, and leave gaps. That’s why a good course
— in which a body of material has been organized into a coherent and logical form — is often the most time-efficient way to master a meaningful body of knowledge. When you’ve absorbed the knowledge available in courses, you can switch over to research papers and other resources.
Finally, no one can cram everything they need to know over a weekend or even a month. Everyone I know who’s great at machine learning is a lifelong learner. Given how quickly our field is changing, there’s little choice but to keep learning if you want to keep up.
How can you maintain a steady pace of learning for years? If you can cultivate the habit of learning a little bit every week, you can make significant progress with what feels like less effort.
1.2 Should You Learn Math to Get a Job in AI?
“What does someone need to know to accomplish their goals?” The goal might be building a machine learning model, architecting a system, or passing a job interview.Understanding the math behind algorithms you use is often helpful, since it enables you to debug them. But the depth of knowledge that’s useful changes over time. As machine learning techniques mature and become more reliable and turnkey, they require less debugging, and a shallower understanding of the math involved may be sufficient to make them work.For instance, in an earlier era of machine learning, linear algebra libraries for solving linear systems of equations (for linear regression) were immature. I had to understand how these
libraries worked so I could choose among different libraries and avoid numerical round off pitfalls. But this became less important as numerical linear algebra libraries matured. Deep learning is still an emerging technology, so when you train a neural network and the optimization algorithm struggles to converge, understanding the math behind gradient
descent, momentum, and the Adam optimization algorithm will help you make better decisions. Similarly, if your neural network does something funny — say, it makes bad predictions on images of a certain resolution, but not others — understanding the math behind neural network architectures puts you in a better position to figure out what to do.Of course, I also encourage learning driven by curiosity. If something interests you, go ahead and learn it regardless of how useful it might turn out to be! Maybe this will lead to a creative spark or technical breakthrough.
2. Projects for A.I A.I (Artificial intelligence)
2.1 Scoping Successful AI Projects
Step 1
Identify a business problem (not an AI problem). I like to find a domain expert and ask, “What are the top three things that you wish worked better? Why aren’t they working yet?”For example, if you want to apply AI to climate change, you might discover that power-grid operators can’t accurately predict how much power intermittent sources like wind
and solar might generate in the future.
Step 2
Brainstorm AI solutions. When I was younger, I used to execute on the first idea I was excited about. Sometimes this worked out okay, but sometimes I ended up missing an even better idea that might not have taken any more effort to build. Once you understand a problem, you can brainstorm potential solutions more efficiently. For instance, to predict power generation from intermittent sources, we might consider using satellite imagery to map the locations of wind turbines more accurately, using satellite imagery to estimate the height and generation capacity of wind turbines, or using weather data to better predict cloud cover and thus solar irradiance. Sometimes there isn’t a good AI solution, and that’s okay too. Determine milestones. Once you’ve deemed a project sufficiently valuable, the next step is to determine the metrics to aim for. This includes both machine learning metrics (such as accuracy) and business metrics (such as revenue). Machine learning teams are often most comfortable with metrics that a learning algorithm can optimize. But we may need to stretch outside our comfort zone to come up with business metrics, such as those related to user engagement, revenue, and so on. Unfortunately, not every business problem can be reduced to optimizing test set accuracy! If you aren’t able to determine reasonable milestones, it may be a sign that you need to learn more about the problem. A quick proof of concept can help supply the missing perspective.Step 3
Assess the feasibility and value of potential solutions. You can determine whether an approach is technically feasible by looking at published work, what competitors have done, or perhaps building a quick proof of concept implementation. You can determine its value by consulting with domain experts (say, power-grid operators, who can advise on the utility of thepotential solutions mentioned above).
Step 4
Determine milestones. Once you’ve deemed a project sufficiently valuable, the next step is to determine the metrics to aim for. This includes both machine learning metrics (such as accuracy) and business metrics (such as revenue). Machine learning teams are often most comfortable with metrics that a learning algorithm can optimize. But we may need to stretch outside our comfort zone to come up with business metrics, such as those related to user engagement, revenue, and so on. Unfortunately, not every business problem can bereduced to optimizing test set accuracy! If you aren’t able to determine reasonable milestones, it may be a sign that you need to learn more about the problem. A quick proof of concept can help supply the missing perspective.
Step 5
Budget for resources. Think through everything you’ll need to get the project done including data, personnel, time, and any integrations or support you may need from other teams. For example, if you need funds to purchase satellite imagery, make sure that’s in the budget.Working on projects is an iterative process. If, at any step, you find that the current direction is infeasible, return to an earlier step and proceed with your new understanding. Is there a domain that excites you where AI might make a difference? I hope these steps will guide you in exploring it through project work — even if you don’t yet have deep expertise in that field. AI won’t solve every problem, but as a community, let’s look for ways to make a positive impact wherever we can.2.2 Finding Projects that Complement Your Career Goals
What if you don’t have any project ideas?
Here are a few ways to generate them:
- Join existing projects. If you find someone else with an idea, ask to join their project.
- Keep reading and talking to people. I come up with new ideas whenever I spend a lot of time reading, taking courses, or talking with domain experts. I’m confident that you will, too.
- Focus on an application area. Many researchers are trying to advance basic AI technology — say, by inventing the next generation of transformers or further scaling up language models — so, while this is an exciting direction, it is also very hard. But the variety of applications to which machine learning has not yet been applied is vast! I’m fortunate to have been able to apply neural networks to everything from autonomous helicopter flight to online advertising, partly because I jumped in when relatively few people were working on those applications. If your company or school cares about a particular application, explore the possibilities for machine learning. That can give you a first look at a potentially creative application — one where you can do unique work — that no one else has done yet.
- Develop a side hustle. Even if you have a full-time job, a fun project that may or may not develop into something bigger can stir the creative juices and strengthen bonds with collaborators. When I was a full-time professor, working on online education wasn’t part of my “job” (which was doing research and teaching classes). It was a fun hobby that I often worked on out of passion for education. My early experiences in recording videos at home helped me later in working on online education in a more substantive way. Silicon Valleyabounds with stories of startups that started as side projects. As long as it doesn’t create a conflict with your employer, these projects can be a stepping stone to something significant.
Here’s a quick checklist of factors to consider:
- Will the project help you grow technically? Ideally, it should be challenging enough to stretch your skills but not so hard that you have little chance of success. This will put youon a path toward mastering ever-greater technical complexity.
- Do you have good teammates to work with? If not, are there people you can discuss things with? We learn a lot from the people around us, and good collaborators will have a huge
impact on your growth. - Can it be a stepping stone? If the project is successful, will its technical complexity and or business impact make it a meaningful stepping stone to larger projects? If the project
is bigger than those you’ve worked on before, there’s a good chance it could be such a stepping stone.
2.3 Building a Portfolio of Projects that Shows Skill Progression
1. Class projects:
The first few projects might be narrowly scoped homework assignments with predetermined right answers. These are often great learning experiences!2. Personal projects
Eventually, you will gain enough skill to build projects in which others see more tangible value. This opens the door to more resources. For example, rather than developing machine learning systems in your spare time, it might become part of your job, and you might gain access to more equipment, compute time, labeling budget, or head count. You might go on to work on small-scale projects either alone or with friends. For instance, you might re-implement a known algorithm, apply machine learning to a hobby (such as predicting whether your favorite sports team will win), or build a small but useful system at work in your spare time (such as a machine learning-based script that helps a colleague automate some oftheir work). Participating in competitions such as those organized by Kaggle is also one way to gain experience.
3. Creating value
Eventually, you will gain enough skill to build projects in which others see more tangible value. This opens the door to more resources. For example, rather than developing machine learning systems in your spare time, it might become part of your job, and you might gain access to more equipment, compute time, labeling budget, or head count.4. Rising scope and complexity
Successes build on each other, opening the door to more technical growth, more resources, and increasingly significant project opportunities.3. How to get a job in A.I (Artificial intelligence) ?
3.1 A Simple Framework for Starting Your AI Job Search
benefits. In this chapter, I’d like to focus on a framework that’s useful for many job seekers in AI, especially those who are entering AI from a different field. If you’re considering your next job, ask yourself:
- Are you switching roles? For example, if you’re a software engineer, university student, or physicist who’s looking to become a machine learning engineer, that’s a role switch.
- Are you switching industries? For example, if you work for a healthcare company, financial services company, or a government agency and want to work for a software company, that’s a switch in industries.
- If you’re considering your next job, ask yourself: Are you switching roles? For example, if you’re a software engineer, university student, or physicist who’s looking to become a machine learning engineer, that’s a role switch.
- Are you switching industries? For example, if you work for a healthcare company, financial services company, or a government agency and want to work for a software company, that’s a switch in industries.
working in this role for a while, you’ll be better positioned to switch to a tech company (if that’s still your goal).Alternatively, if you become an analyst in a tech company, you can continue to use your skills as an analyst but apply them to a different industry. Being part of a tech company also
makes it much easier to learn from colleagues about practical challenges of AI, key skills to be successful in AI, and so on.If you’re considering a role switch, a startup can be an easier place to do it than a big company. While there are exceptions, startups usually don’t have enough people to do all the desired
work. If you’re able to help with AI tasks — even if it’s not your official job — your work is likely to be appreciated. This lays the groundwork for a possible role switch without needing to leave the company. In contrast, in a big company, a rigid reward system is more likely to reward you for doing your job well (and your manager for supporting you in doing the job for which you were hired), but it’s not as likely to reward contributions outside your job’s scope.After working for a while in your desired role and industry (for example, a machine learning engineer in a tech company), you’ll have a good sense of the requirements for that role in that
industry at a more senior level. You’ll also have a network within that industry to help you along. So future job searches — if you choose to stick with the role and industry — likely will be easier. When changing jobs, you’re taking a step into the unknown, particularly if you’re switching either roles or industries. One of the most underused tools for becoming more familiar with a new role and/or industry is the informational interview.
3.2 Using Informational Interviews to Find the Right Job
Prepare for informational interviews by researching the interviewee and company in advance, so you can arrive with thoughtful questions. You might ask:Informational interviews are particularly relevant to AI. Because the field is evolving, many companies use job titles in inconsistent ways. In one company, data scientists might be
expected mainly to analyze business data and present conclusions on a slide deck. In another, they might write and maintain production code. An informational interview can help
you sort out what the AI people in a particular company actually do. With the rapid expansion of opportunities in AI, many people will be taking on an AI job for the first time. In this case, an informational interview can be invaluable for learning what
happens and what skills are needed to do the job well. For example, you can learn what algorithms, deployment processes, and software stacks a particular company uses. You
may be surprised — if you’re not already familiar with the data-centric AI movement — to learn how much time most machine learning engineers spend iteratively cleaning datasets.
- What do you do in a typical week or day?
- What are the most important tasks in this role?
- What skills are most important for success?
- How does your team work together to accomplish its goals?
- What is the hiring process?
- Considering candidates who stood out in the past, what enabled them to shine?
3.3 Finding the Right AI Job for You
- Research roles and companies online or by talking to friends.
- Optionally, arrange informal informational interviews with people in companies that appeal to you.
- Either apply directly or, if you can, get a referral from someone on the inside.
- Interview with companies that give you an invitation.
- Receive one or more offers and pick one. Or, if you don’t receive an offer, ask for feedback from the interviewers, human resources staff, online discussion boards, or anyone in your
network who can help you plot your next move.
but I’m pretty sure that if most of your colleagues work hard, learn continuously, and build AI to benefit all people, you’re likely to do the same. (By the way, some large companies won’t
tell you who your teammates will be until you’ve accepted an offer. In this case, be persistent and keep pushing to identify and speak with potential teammates. Strict policies may make it
impossible to accommodate you, but in my mind, that increases the risk of accepting the offer, as it increases the odds you’ll end up with a manager or teammates who aren’t a good fit.)Get help from your community. Most of us go job hunting only a small number of times in our careers, so few of us get much practice at doing it well. Collectively, though, people in your immediate community probably have a lot of experience. Don’t be shy about calling on them. Friends and associates can provide advice, share inside knowledge, and refer you to others who may help. I got a lot of help from supportive friends and mentors when I applied for my first faculty position, and many of the tips they gave me were very helpful. I know that the job-search process can be intimidating. Instead of viewing it as a great leap, consider an incremental approach. Start by identifying possible roles and conducting a handful of informational interviews. If these conversations tell you that you have more learning to do before you’re ready to apply, that’s great! At least you have a clear path forward. The most important part of any journey is to take the first step, and that step can be a small one.
3.4 Keys to Building a Career in AI
1. Teamwork:
When we tackle large projects, we succeed better by working in teams than individually. The ability to collaborate with, influence, and be influenced by others is critical. Thus, interpersonal and communication skills really matter. (I used to be a pretty bad communicator, by the way.)2. Networking:
I hate networking! As an introvert, having to go to a party to smile and shake as many hands as possible is an activity that borders on horrific. I’d much rather stay home and read a book. Nonetheless, I’m fortunate to have found many genuine friends in AI; people I would gladly go to bat for and who I count on as well. No person is an island, and having a strong professional network can help propel you forward in the moments when you need help or advice. In lieu of networking, I’ve found it more helpful to think about building up a community. So instead of trying to build up my personal network, I focus instead on building up the communities that I’m part of. This has the side effect of helping me meet more people and make friends as well3. Job search
Of all the steps in building a career, this one tends to receive the most attention. Unfortunately, there is a lot of bad advice about this on the internet. (For example, many articles urge taking an adversarial attitude toward potential employers, which I don’t think is helpful.) Although it may seem like finding a job is the ultimate goal, it’s just one small step in the long journey of a career.4. Personal discipline
Few people will know whether you spend your weekends learning, or binge watching TV — but they will notice the difference over time. Many successful people develop good habits in eating, exercise, sleep, personal relationships, work, learning, and self-care. Such habits help them move forward while staying healthy.5. Altruism
I find that people who aim to lift others during every step of their own journey often achieve better outcomes for themselves. How can we help others even as we build an exciting careerfor ourselves?