AI Engineer Roadmap – How to Learn AI in 2025
By freeCodeCamp.org
Summary
Topics Covered
- AI Engineers Deploy Models
- Garbage Data Dooms AI
- Master ML Before Neural Nets
- Transformers Power LLMs
Full Transcript
this AI engineering road map takes you from core fundamentals to Advanced AI implementations it covers essential mathematics machine learning deep
learning and large language models providing you with the exact skills needed to thrive as an AI engineer in 2025 whether you're starting fresh or
upgrading your skills this road map offers a clear path to success with hands-on experience and Industry relevant insights T from lunar Tech
developed this course imagine being at the Forefront of one of the most transformative fields of our time where technology meets
Innovation and changes the world welcome to the AI engineering road map of 2025 my name is D Vasan from lunar Tech
and I'm absolutely exciting to be here with you today to dive into this highly requested topic together we will will explore everything that you need to know
to navigate this exciting world of artificial intelligence and AI engineering to set yourself up for success in this field in this video we are going to break down the step-by-step
road map for becoming a worldclass AI engineer here is what we are going to cover first we will Define what AI engineering is and how it feds into this
broader ecosystem of AI and data science next we will explore the real world applications of AI engineering showcasing its really strong power
transformative impact across different Industries then we will dive into the must have versus nice to have skills helping you to understand exactly where
to focus your efforts and your time finally we will go to step-by-step process so the skill sets that you need to master outlining the essential topics
to help you become a job ready AI engineer this session is packed with unique insights and practical tips that
you won't find any URS so stay tuned without further Ado let's get started so let's start with the basics what is AI engineering AI engineering is
this practice of Designing building and deploying AI systems that solve real world problems it sits in this intersection of software engineering
machine learning and data science and here is how it fits into this broader Tech world and the ecosystem so the data scientists often focus on analyzing data
or predicting something or developing models AI Engineers take these models and make them work in the real world settings and with much more advanced
models they create systems that process data make decisions and deliver actionable insights for example in the healthcare a data scientist might develop a machine learning model to
detect the tumors in x-rays an AI engineer brings this to the next level he ensures that the model is integrated into Hospital Systems runs in real time
and works reliably under different conditions also AI Engineers they work with much more advanced models like deep learning models or neural network based
models so data science principles system design optimization machine learning deep learning is what all combines into one place which is AI engineering it's
not just about building models it's about making sure that those models actually solve problems and deliver value for the business or this public
Enterprise and that's why AI engineering is such a critical role in today's Tech ecosystem it's where this Cutting Edge research meets the Practical industry
impactful implementation so bridging this gap between the research and the actual engineering so um AI engineering isn't
just limited to one field it's changing Industries all over the world let's look actually at some of the examples how AI
engineering is making an impact first up is the healthcare so AI systems are used to analyze medical images predict patients outcomes and also assist the
doctors in the drug Discovery or the patient care AI engineers build the systems to ensure that those are scalable reliable and efficient for real
world use next up is the finance from fraud detection to aloric trading AI processes
massive amount of financial data in real time engineers in this field they focus on creating secure efficient and realtime systems that can handle this
sensitive information real time like FR detection in the retail and e-commerce in the platforms like Amazon they use AI to personalize recommendations optimize
pricing and manage inventory AI Engineers they design algorithms and systems that drive this experiences next up is the entertainment of course the
streaming platforms like Netflix they rely on AI for personalized content recommendations jna tools like Dolly and chatbot chbt are changing now how the
creators produce content next up is the autonomous vehicles so self-driving cars they depend on AI for navigation object
detection and decision making AI Engineers they are the ones who design this algorithms and Hardware integration to make this autonomous Vehicle Systems
safe and reliable so these examples are just few of them and they show how different and impactful AI engineering is so whether you are passionate about
health care Finance Tech defense or any other creative industry there is a place for you in this field and that is actually why the AI engineering is so popular this day and it's going to be
one of the most independent Professionals in the next decade there are many Industries and companies who are currently Hing when it comes to the
salaries for AI Engineers those are highly competitive just 40 ENT roll they start around 80 up to
120k at least for the midlevel engineers this is uh 120k to 180k in us and where senior roles this can take all the way
from 200 up to 750k in the US dollar so let's now get into the actual skill set that you must know in order to become an AI engineer
and here I'm talking about becoming a worldclass well-rounded real AI engineer not just someone who does promp engineering real AI engineer not just
someone who does promp engineering and without knowing these different models uh just uses them but actually becomes someone who will create new algorithms who will create their own unicorns or
will become an AI and without knowing these different models uh just uses them but actually become someone who will create new algorithms who will create their own unicorns or will become an AI
engineer that works at this uh large Cutting Edge companies like open AI Tesla meta and many other Cutting Edge startups so first up is of course the
mathematics mathematics is a Fiel when it comes to traditional machine learning all the way to the most Cutting Edge AI that you see nowadays so um when it comes to mathematics there are different
topics from this field that you must know not the entire universe of mathematics or the super advanced stuff but really the fundamentals and um these
are selected topics from different uh levels so you cannot just say first level of University or second level of University of that specific study no it's a combination of these different
levels from this different fields and studies that you need to combine in one place learn it such that you can move on on to the next page and today I will
tell you which are those in a more detail such that you are left with a specific topics for you in mind to learn mathematics if you decided to do a
self-study and become an self faced AI engineer on your own so first up is the high school mathematics in here um you
can understand doing basic divisions how to solve an equation with uh squared unknowns so for example a square plus something you are able to uh calculate
the discriminant to find the solutions to that equation you know this different um geometric um terms like what is sinus
what is cosine what is tangent what is cotangent uh the Pythagorean theorem um basically all the topics from the high
school all the way to the last level next up is the uh linear algebra of course linear Al ra comes usually from the second uh year of econometric study
or applied mathematical and statistical studies and this field is really important for understanding not just the traditional machine learning but also the Deep learning which is really
important and it's a more advanced type of ml that powers today's most cutting gge applications including the GPT models the Transformers Etc so if you
want to know and understand the cycle of n networks the training how it's being optimized and how this entire neural networks structure works then you must
understand linear algebra so when it comes to linear algebra let me tell you specifically what I mean not the entire linear algebra but really to understand the norm of a vector this understanding
of vector and matrices the cartisian coordinate system that comes from um the high school but then here is also very relevant to understand where the vector
are how you can position the vectors in the cian coordinate system understand this idea of Norm versus alal and distance the uh Pythagorean theorem here again the
orthogonality um you also need to understand the vectors and operations so foundations of the vector the special vectors unit vectors um and also uh the
idea of dot product the application of the dot product the C squares equation also you need to understand the matrices and the solving of the linear systems
using this idea of matrices so here you need to have the foundations of linear systems and matrices you need to uh be able to add matrices multiply them to
compute a DOT product between matrices or between Matrix and a vector um also understanding of ging reduction the
reduced ulum form the row reduced ulum form the no space the c space the rank the full rank this all will be foundation for you to understand how
this their networks work um if you truly want to understand um the different deep learning and AI models you also need to have a good basis when it comes to
linear transformation and matrices so this algebraic lows for matrices uh including how um it actually works how you can uh solve a system with the
linear equations multiple of them using these different Transformations so what is for example the transpose of a matrix what is the inverse of a metrix and
apply these different uh rows and the rules from linear algebra uh also what is the determinant how you can calculate it what are the properties of determinant the transpose of matrices I
believe I just mentioned and then you also need to understand some topics from Advanced linear algebra like uh the
projections of vectors um the gr Schmid process the infamous process that you um need to understand uh the metrix factorization really important not just
for the Deep learning but also for the traditional machine learning or the things like metrix uh factorization that is used in the recommender systems so uh this part is
also very important to understand the QR de composition ion values igon vectors uh which is really important for understanding the principal comp quasis and dimensionality reduction also the
igon de composition which is based on igon values and igon vectors and understand the singular value the composition or the SVD which is really important part as part of traditional
machine learning so um this is what uh you need to know when it comes to the linear algebra and if you are looking for that
one place to learn linear algebra then uh last year uh we have published an entire 26 plus hour course that covers all these topics in one place it was
quite a popular course uh and highly demanded one and you can get also a certification once you completed so check out this course the fundamental s
linear algebra uh at the lunch. to also
uh go through all these topics uh follow it study it practice it and then get also a certification next up when it comes to
mathematics Beyond um the linear algebra and the um High School mathematics you also need to understand calculus this one is really important as well uh you
will need to have an understanding what are the gradients what are the derivatives how you can calculate derivatives how you can calculate the integrals not just with one n but with
two variables basically so double integrals um how you can uh use this uh derivatives and integrals when comes to
optimization this uh concept of the slope and uh optimization of the models using the gradients first order gradient and second order gradient in the context
of it how you can adjust the parameters for better accuracy and um just a traditional
calculus one and some calculus 2 so um this is um no-brainer when it comes to AI not just for advanced AI but for the traditional machine learning learning
for understanding these different models you must know calculus next up is the game theory not the entire universe of Game Theory not all the topics but there
are some topics from Game Theory which usually comes from third year of econometrical or ply mathematical studies is something that you must know
think about NES equilibrium or the mean Max strategy or this um um this game where um competing is actually resulting
in worse outcome than collaborating so uh this idea of NES equilibrium is really important for understanding one of the foundational generative AI models which is the
generative adversarial networks so for understanding one of this Genna models you will need to also have this uh couple of topics from game theory in
place all right so that's about the mathematics um and here I'm also not mentioning this foundational geometry topics which is usually also covered as
part of high school so once again the sign cosine the tangent how to work with with the different um angles the 90° angle what are these different values
for different angles and this common notation with the pi so what the pi represents the radians Etc once you comfortable with this mathematical topics the next topic that I would
suggest you to study is the statistics statistics is very important when it com comes to becoming a well-rounded AI professional to understand the um idea
of predicting the next word but all the way to the very basic machine learning uh having this basics of Statistics will be very helpful to you so here is the
list of topics that I would suggest you to study when it comes to statistics so first up of course understanding this concept of probabilities to know what
the probabilities are what is its concept uh why it is used for this concept of probability distribution functions the PDFs the cumulative
distribution functions or the cdfs and also um to understand uh what is this idea of sample why we use sample um versus
population um this idea of having a representative sample work with the data so understanding for example what are the random variables what is this idea
of experiment uh what are the probabilities um the uh criteria and qualities of probabilities what is the PDF or the probability distribution
function uh what is the cumulative uh distribution function this uh basic statistics like the mean the median the variance the standard deviation the mode
um and also how they can be calculated this um idea of covariance and correlation what is the difference between correlation and cation uh
understanding um how these different statistics can be used to describe your data and to tell a story about your data
and um also this idea of Sample versus population why we use sample um and why we um are unable for example to deal with a
population um and how this becomes relevant when it comes to this entire universe of data science um also understanding the bias theorem the
different rules when it comes to the probabilities like the conditional probability the idea of Independence between different random variables um
then I get into some Bic probability distribution functions especially the normal distribution function the baroli distribution function this idea of boli Trials the binomial distribution
function what is this connection between bomal distribution function and the binomial distribution function how it is used in these different concepts like
tossing a coin so basic statistics basically uh also understand uh the idea of uh linear regression and ordinary Le
squares what are these different uh conditions and assumptions that this ordinary squares is making when calculating and optimizing these
different um parameter estimates this idea of estimation versus um the unknown parameter the idea of error terms the error terms versus
residuals um and also this concept of gas Mark of theorem how it is used um and this comes usually from econometrics and the idea of parameters what are the
properties of parameters like the bias of a parameter the consistency and the efficiency and this is again tied back to the gas Mark of theorem uh also the
understanding of confidence interal will be really important in your career in the field of science and AI the idea of 95% confidence interval how it's
calculated what is this idea of um calculating this interval the lower bound and the upper bound what it means another very important topic from statistics is this idea of hypothesis
testing why we need hypothesis testing the idea of null um uh hypothesis the alternative hypothesis how you set up these experiments why it is important
why we even need it the concept of statistic iCal significance is very important how to calculate type one error type two error what is the difference between them what is false
positive what is false negative uh the statistical test like the student T Test the F test Anova test uh the uh two
sample T Test the two sample normal test there are so many test that um it would that um can be studied in this field of
Statistics but there are a couple of of them that I uh selected and um I would also provide you the links to that and you can also check them out and I would
highly suggest you to study them also this concept of P value is um very uh essential uh also this uh calculation of the P value how you can use it how to
interpret it its limitations and also this concept of inferential statistics so blows like the central limit theorem the of large
numbers how it is used when it comes to this uh experiments and this is tied back to the uh normal distribution function one of the most INF famous distribution function that you must know
as an AI engineer next up we have the dimension reduction techniques like the principal component analysis or the factor analysis and you can also add
here the panical correlation nysis so a CCA so if you are looking for that one place that in organized way can help you to refresh your memory or to study all
this in one place then you can also check out our fundamentals to statistics course because we are covering there all these different topics which is a prerequisite and it's a must for you to
know before you get into the next level in your AI engineering Journey so once you're comfortable with the mathematics and statistics you are ready to move on to the next step in your journey of
becoming an AI engineer the next skill set is the skills of data science so as an AI engineer you really need to have a good data science skills without good
data and without understanding whether you even have a good data or not and applying your data science skills um any of other skills won't matter because um
it's this phrase that is really uh easy to remember you can have a great AI model but if you put a garbage in you will get a garbage out and that uh what
you put into your AI model is your data if your data is a trashy is a bad data and sometimes you don't even know that you are dealing with B data because you don't have the data science skills then
it doesn't matter how much effort or how much money you will put in your um AI model how much gpus you will use or um how big your data will be if your data
quality is a bad one to understand these data skills you will need to have a data science skills so what I mean by that so
when it comes to um AI models they like to work and they are performing good if they are dealing with the clean data your AI models also need to use a
meaningful data a relevant one and also as an AI engineer you are responsible for the um for the ethical side of your
model and for that your data should be uh unbiased as well so um as an AI engineer you will need to understand how to clean data how to Source data how to
collect it if you don't have an AI engineer next to you and also how to pre-process data and here I mean identifying the uh Missing data in your
database to understand what is the mechanism behind it is it missing a trandom is is it missing not a trandom because this will then define whether you can impute the data so you can fill
in this missing data what kind of techniques you can use to fill in this missing data or maybe to drop it all together to understand whether you have uh anomalies in your data outliers how
you can use statistical and other techniques to find this outliers in your data and to remove it or maybe adjust it this concept of normalization you will
need to have a good understanding how you can filter your data how you can um group your data um tell story about your data before you even get into the model
development section and how to uh split your data to have the skills of um following the cycle of data preparation
data evaluation and also using the data as an input for your model whether it's a machine learning deep learning or an advanced generative AI model also
understanding how to uh visualize your data is really important as a data scientist you usually learn the um exploratory data analysis and how you
can use these different tools includ including Python and simple libraries like Seaburn and metli to visualize your data and as a data science skill uh this
is a must to also identify outliers to identify certain Trends and also to tell a story about your data so this is the
basically the pre-work that you need before you get into any moral development if you want to do everything properly and as a professional you also need to understand uh Fe engineering
skills which also is a data science skill so understanding how you can create new variables so sometimes for example you have multiple variables but
it's not good enough because you just need one and it's usually a combination of this multiple variables and by understanding how you can combine different variables in your database in
one place and uh create one single variable is what we are referring as a feature engineering so you engineer the features that then you can use as an input to your machine learning or your
deep learning or your AI model in general so this is about the data science knowing data signs uh will be um
will set you for Success when it comes to AI engineering career next up is the infamous traditional machine learning so
without understanding traditional machine learning there is no way to beable arounded AI engineer um if you don't want to be in this position where for every single problem
you use neural networks use you waste your company's money on the gpus or uh you spend a lot of time on using complex models that while you can use a simple
machine learning models if you don't understand this then you can never become this AI engineer that uh looks at problems not just from a research perspective but also from business or
Enterprise perspective so um that's why I always suggest to First Master the traditional machine learning and then only get into the next
point so here what I mean by traditional machine learning I mean to um understand this concept of classification regression supervised learning
unsupervised learning these different algorithms that fall under these categories like uh linear regression logistic regression decision trees uh
bagging boosting XG boost uh light GBM GBM and uh many other models including unsupervised models like K means hierarchy Cloud string or DB scan in
which cases which of your models you can use the idea is that once a PM or a business leader comes to you and tells you this vague business problem you as
an AI engineer you will need to quickly uh be able to figure out whether you are dealing with a classification problem regression problem maybe an unsupervised
learning program and you will also need to have this uh quick understanding okay I'm going to use most likely this models
in order to solve that problem and being able to understand this will be really important before you move on to any advanced moral uh
studying so um Beyond understanding the algorithms and if I believe if I remember correctly those are about 23 or 24 algorithms from traditional machine learning understand their mathematics
behind the statistics behind it what are their benefits what are their disadvantages because in each of these categories you also need to understand how each of these models work and um
have this understanding that for this type of problems for example when you have a lot of missing data you can use that model because it's more stable or if you are dealing with a data that follows normal distribution then you
will then you can better use another type of model cuz for each of this classification regression or other type of problems you will have many options
and it's up to you as an AI engineer to short list them and also from that to filter out which one you will use so beside this you also need to
understand how you can evaluate a tradition machine learning model what is this common cycle of the training testing validation what are these different sampling techniques or
resampling techniques uh what is bootstrapping what is cross viation what is kold cross viation or leave one out cross viation and also to understand what are the different evaluation
metrics depending on your problem you can use in order to evaluate your model for example what is the difference between using the mean absolute um error
versus the mean squared error in which cases you can use which one or are or the root mean squared error or um how you can evaluate a model that is in the
field of classification it is the F1 score um or it's the fbaa score which is more General version of the F1 score should you use recall should you pay more attention to the Precision
Etc so uh understanding also when to use machine learning when to use uh just rule based approach will be also important for you as an AI engineer so
um that is about machine learning if you want to uh Master the field of machine learning and everything that I just mentioned in one place you can also check out our fundamental to machine
learning course where we cover everything that you must know in order to become a well-rounded machine learning specialist you can also get a
certification from lunatech Once you complete your machine learning course so once you are comfortable with mathematics statistics and the traditional machine learning next up is
studying the Deep learning deep learning is at the heart of the Modern Art artificial intelligence especially when it comes to generative AI so all these
different Cutting Edge tools like the chat gbt The Dol Sora or the um different applications the um self
driving cars the uh robots humanik robots they are all based on narrow networks and narrow networks is this fundamental part when it comes to deep
learning think of the deep learning as more advanced machine learning where the models are able to study better uh with
a larger amount of data and this big data that uh the size of which increased more and more in the last decade made the evolution of the deep learning more
possible so when it comes to the Deep learning what I mean exactly is that you need to understand how the Deep learning differs from the traditional machine learning you need to understand the
architecture of neural networks uh and how it works the concept of neurons the perceptor this uh um in a simple way to be able to understand the structure of
neural networks the activation functions what it means this difference between different activation functions um and also understand in which cases to use
what this idea of hidden layers input layer output layer um how they are related to the performance of neural network um you also need to understand
the concept of for forward PA backward pass the idea of B propagation what the B propagation algorithm does the idea of loss function how you can calculate the
loss function for a neural network also how the training of neural network works so how it starts from the input then it goes to the forward path then does the uh the loss calculation the back
propagation Etc and also what is this idea behind it and how using each of these different making each of these different decisions like the activation
function or the uh different optimization algorithms how it will be impacting the performance of your deep learning model also understanding the different optimization algorithms like
the gradient descent stochastic gradient descent the RMS prop uh the momentum SGD Etc and of course the Adam or the adamw
these different algorithms will be really important for you to understand how the Deep learning models are being trained and optimize uh beside that you also need to understand the concept of Ving radiant
problem the exploring radiant problem um also understand um this different um computational graphs that are being used
in order to represent uh NE networks um also um how you can evaluate the performance of neural networks how you can use the cross entropy um and um
being able to understand these different um optimization technique makes the concept of mini badge gradient descent is also important and the difference between bch gradient descent mini BGE
gradient descent stochastic um gradient descent uh understand the concept of Haitian uh why Haitian is is being used what it means
to have a faster versus better performing neural network um understand also this batch normalization layer normalization what is the difference
beside between them understand the concept of residual connections and also what is uh gradient clipping cavier initialization basically how you can
initialize your neural network models of course um when I meant the fundamentals of neural networks I definitely meant also understanding what is the bias what is the weights uh what it means to train
a neuron Network the role of improving these weights and also you need to understand the ways you can solve these different problems like how to solve a Venum gradient problem how to solve an
exploding gradient problem um and also um these different techniques to combat the overfitting what it means to have an overfitting this comes from traditional machine learning but also in the Deep
learning it's still a problem and also understand how you can use drop out what is drop out uh what is this difference between random Forest versus drop out
what is the U L1 regularization L2 regularization versus Dropout the difference between them what is this idea of course of that dimensionality and what is this
difference between discriminative versus degenerative models what are out hand coders uh what is this idea of
reconstruction um error the um nonlinear counterpart of PCA basically when it comes to Outer encoders um understand
the architecture Behind these different sorts of deep learning models the um
Infamous uh Ann CNN RNN GNN grus lstms what is the difference between the lstm and RNN why they have been created what
are their problems what are their um good points and also how you can train these different algorithms and for what
kind of problems you can use what is p convolution what is pooling what is tried uh when it comes to a computer vision uh and in which cas cases you can
use what uh where you can use RNN where you can use CNN um and also the generative adversarial Network so we are getting more towards the generative AI
but I will tell you in a bit about that so this is basically about the theory when it comes to deep learning everything that I just mentioned and having these fundamentals in place will
be really important for you in order to go on to the next step which is understanding how you can use the traditional machine learning and traditional deep learning in the real
case problems which means uh learning Python and learning Advanced python um the AI Frameworks like the py Tor and tensor flow but not just that also to
understand a basic data structures and algorithms in Python and um beyond the python for data science also to understand how you can train a machine
learning model in Python how you can train a deep learning model in Python uh how you can uh do a visualization in python or how you can filter your data
prepare your data clean it basically everything where it comes to data science machine learning and deep learning its practical implementation is happening in a programming language and that's where the python one of the most
popular programming languages come in handy and my suggestion would be to learn next the python to understand how
you can um uh create uh lists variables how you can load data different sorts of data whether those are images text audio
how to work with dates how to filter your data how to group your data how to visualize it training a machine learning model training um deep learning model
how to make use of uh pytorch which is a deep learning framework in python as well as tanor flow this idea of tensors and how you can be Feld you can train
and deploy machine learning and deep learning model using python if you are looking for that one place to learn all what I just described make sure to check
the lunch. page because we have an
the lunch. page because we have an entire course when it comes to python for data science which would be a great starting point for you to Learn Python specifically for AI and data science and
also get a certification in that field and if you want to H check out all these different Topics in one place when it comes to learning make sure to check our deep learning in preparation course
which covers all these topics that I just mentioned in a unique Q&A way all right so once you have done that you have your mathematics statistics machine
learning deep learning and python as well as advanced python in one place you are ready to move on to the next step and here of course I'm talking about the
generative AI so when it comes to generative AI this by the way also includes the large language models here is what I would suggest you to learn in
order to call yourself a worldclass AI engineer generative AI is one of the most in demand skill set when it comes
to the 2024 and 2025 in this AI race in the companies like open AI the entropic Google meta Tesla xai all these
different um companies that are at the Forefront of AI Revolution they all are based on these different usages of generative AI CH GPT is based on
generative AI the Del is based on generative Ai and um the gini the perplexity the autonomous cars these
days all of the cutting get AI Tools in one way or the other are based on generative AI That's why if you are an AI engineer with the specialization in
generative AI this will be your year all right so let's now talk about what exactly it takes to become a generative AI based AI engineer so first up you
need to understand the AI foundations and you need to understand um where you can apply generative AI before you get into the theoretical part so
understanding also the moral development cycle when it comes to generative Ai and training techniques will be really important because different type of generative AI Foundation models can be
trained and optimized in different ways understanding different foundational generative AI models will also be very important think of like the generative
adval networks this concept of the um G and D uh so basically the generator discriminator what are their roles what they are doing how the Game Theory comes
here in place the um where you can apply generative aterial networks the mathematics and statistics behind it the mean Max Theory the Nasha equilibri room and also this concept of uh mode
collapse which is a problem for generative adversarial networks and how you can train and optimize these gens in order to create new data for example
synthetic data Etc so the next topic that I would highly suggest you to study is this variational out and coders this is yet another Foundation generative AI
model that I would suggest you to study to understand its mathematics behind it its uh statistics behind it the um idea of this difference between plain out
hand coders and variational outand coders which is a matter of difference between discriminative and generative uh type of models architecture behind it
the KO Divergence the elbow how you can train a variational out and code or this um all the way to the idea of reparameterization trick um and how you
can apply variation out encoder in practice of course next topic that I would suggest you to study is of course
the Transformers so Transformers they are the um mother of all the uh current Cutting Edge large language models when
you hear about Lama from met when you hear about gpts from open AI or the cloud Sonet from anthropic all these different large language models at the
base of them is the Transformers Transformers are at the heart of large language models and no knowing their history their evolution understanding what is the difference between
Transformers versus the rnns and stms from Deep learning understanding what is that the what was the reason of them being invented compared to the RNN and stms the idea of embeddings the
positional encodings how you can calculate them the attention mechanism the self attention mechanism the curies keys and values the single-headed tension multi-headed tension
understanding this entire AR architecture of Transformers step by step the mathematics and statistics behind it how you can calculate these different parts how you go from the input embeddings positional encodings
all the way to the logits which are the outputs of the Transformers will be really important for you as an innovator to understand because then you will be
able to understand this more uh variated versions of the Transformers because GPT models from open AI which power the chat GPT for example are A variation of
Transformer model and given that this uh modern models like llama the gpts or other ones um they are never published in its entirety so you won't know for
example what is the real architecture of GPT 403 uh which some call it also the first AGI model um for that I would highly
suggest you to go to the basics and to instead understand the Transformers once you understand the Transformers you also understand the cycle of pre-training so
what it means to pre-train a Transformer model do it by hand from scratch instead of just using um pytorch Library this will help you to completely understand
this um foundational language model once you are done with your Transformers I would highly suggest you to uh get into
the uh next topic which is the large language models when it comes to large language models uh this is yet uh
entirely a different um Universe um and for understanding this and calling yourself an expert in LMS uh you have quite a journey to go through but I will
make your life a bit easier and I will give you the step-by-step process and a specific skill set that you can um learn
in order to um Master the field of large language models so you can start with understanding uh what are the language models this idea of engrams the the
concept of predicting the next word and then um how this um large language models have been um evolved over time um
what is their unique sites um also understand what are the key large language models like I just mentioned the gpts from open AI the Llama from
meta the Falcon the bird from Google and also the Gemma from Google um the cloud Sonet from entropic these are different
uh very popular open-sourced or close- sourced large language models that you can use as part of your AI engineering journey and understanding these differences understanding how it relates
back to the Transformer models will be foundational for you also understand this concept of Open Source and closed Source what are these different tools that you can use when it comes to open
source of course I'm referring to the hugging phase and um other platforms that you can use in order to um use large language models and generative AI
in a more efficient and cost uh um efficient way so understanding the uh foundations of large language models will be really important for you like
the attention mechanisms that I just mentioned the language models the engrams uh the architectural Transformers the type of architecture you um you will meet in the field of LMS
like the encoder base or decoder base or a combination of them um the concept of tokenization from NLP the embeddings how you can calculate each of these parts of
the Transformers if you want to be someone who understands these large language models you want to know how to uh tweak them how to edit them and also
um in an intelligent way to use them in your job or create a platform so um you also need to uh be aware how you can prepare your data for
your large language model application like with the machine learning or deep learning if you don't have the data science skills then it's just a matter of wasting your time and money because
then uh this concept of garbage in and garbage out still holds um even if you have the most Cutting Edge LM model in front of you if you don't know how to
clean your data how to deal with with unstructured data how to prepare it and then injust it into this AI model then
this AI model will be uh performing poly that's why understanding this uh prompt uh templates this different type of models these different structures will
be uh Paramount for you next up I would suggest you understand the cycle of pre-training large language Model fine-tuning A large language model prompt engineering reinforcement
learning evaluating and optimizing in it so understand what it means to pre-train a large language model how do those large companies use um masked language modeling or Auto regressive language
modeling in order to pre-train large language model and then uh maybe do an example uh do a pre-training from scratch uh in order to get a taste of what it's like to pre-train a large
language model what are the different scaling clows what is the output of pre-trained large language model and what are the different Downs stream tasks that you need to be familiar when
it comes to large language models because large language models can be used for classification it can be um for uh predicting the next word uh for different tasks instruction based
approach for each of them the downstream tasks is different which means that the model and the way the model is trained is also different and also when it comes
to the large language models you need to definitely know how to fine tune a large language model so fine-tuning on a single task fine tuning on a multitask
instruction model um how to perform uh parameter efficient fine-tuning or the PFT and here of course I'm talking about understanding in detail the mathematics
the linear algebra behind Laura the chlora so the quantized version of Laura how to prepare your data for finetuning what are the steps behind
fine F tuning what is reinforcement learning so on supervised fine tuning supervised fine tuning um and uh
definitely um also experience um when it comes to fine tuning can you for example fine tune large language model on your own do you know how to use these different models
Etc once you are done with the pre-training and fine-tuning I would also suggest you to understand and learn this idea of reinforcement learning with human feedback not too detail uh you can
start with the basics uh just to understand this uh concept of the rhf or the reinforcement learning with human capital and why is that that we use this
in order to make our moral smarter then uh once you are done with that I would highly suggest you also to understand uh this prompt engineering so what it's
like to work with language model and to make it smarter once it's already pre-trained and fine-tuned in here
um I would suggest you to um to look into the best practices for prompt engineering how to do effective prompting prompt optimization and also
how you can apply prompt engineering along with the fine-tuning in order to create AI agents once you are done with that I would suggest you next to go into the
topics overx so to understand what are the concepts of the retrieval argumented uh generators and systems the vector databases integrated DRS with Gen how to
fine-tune with retrieve data this concept of Lama index um agentic RS Etc so once you are done with this I would
suggest as a final last two topics for you the first one is the evaluation and optimization of large language models
the mastering of the LMS to understand how you can use uh quantization knowledge distillation
pruning how you can use um Alm Ops to not just um train LM but also productionize it using topics like um
tools like longchain flask Etc and also understand how you can evaluate a large language model there are different benchmarks different data sets that you can use in order to properly evaluate
and compare a specific large language model to other world known Universal large language models so so finally I would say as an AI engineer you are the
first person responsible for the ethical and safe creation and usage of your AI models that's why we all have a joint responsibility to understand the AI
ethics the principle of ethical AI the bias in AI the privacy and data security in AI to understand this AI act from Europe to understand the gdpr act and
then understand the the regulations and the governance so um I hope this didn't overwhelm you if you have a good guidance and if you um have everything
in one place it will take on average 3 to 6 months for you depending on where you are and whether you already have the prerequisites in order to become a world- class AI engineer so AI
engineering is all about solving real problems not just the theoretical knowledge being able to understand all the theory the foundational knowledge along with the implementation of each of
these different topics ICS in the reality will be really important for you to become a job ready professional a world class tenic AI engineer who knows
the foundations and the actual implementation by Bridging the Gap between research and the industry application for that you will need a
well-rounded comprehensive training as on one hand and also on the other hand the Practical implementation with projects with ready resume to be able to
start applying or create your own up CU it's all about innovation in this field and it takes a lot of effort and motivation to combine this different um
skill set in one place but this also comes with a reward on one hand yes you need to put a lot of effort in order to become an AI engineer but on the other hand this is one of the most rewarding
and most in demand careers for the next decade so if you are serious about becoming a world-class AI engineer then you can also Al apply to our AI
engineering boot camp at Lun Tech to get everything in one place not just surface level knowledge but the theory the actual projects that you can also put on your resume those are cutting Gage
projects and as a results you will be able to call yourself a real AI engineer and Lear that dream dream job or to
become a Founder for a unicorn
Loading video analysis...