What’s next?

Looking ahead to Module 3

MI resources

Here we provide a brief list of resources and possible next steps if you want to continue in your MI journey in the future. A good place to start is the list of MI resources compiled by Nathan Frey, which includes a diverse selection of blogs, articles, Python tools, and databases in this field. If you’re interested in more Jupyter notebook examples, we recommend:

  • Matminer examples: This introduces other functionalities of the package. Click on launch binder to start an interactive Jupyter notebook environment.

  • MI best practices: This is a companion GitHub repo to the article by Wang et al. [1] on best practices in MI. Many of these concepts should be familiar to you!

  • Any of the notebooks in this textbook. 😛

More research projects!

Of course, simply reading about MI is not nearly as fun as practicing MI, which means pursuing more research projects! See Bryce Meredig’s editorial in Chemistry of Materials for a few areas where MI will likely make an impact in the immediate or near future [2]. Then, consider reaching out to UC Berkeley research labs/groups to see if they’re in need of undergraduate research assistants. It’s helpful to work alongside others and learn from them, and getting the opportunity to follow a PI will help you find impactful and novel research directions.

Of course, if you want to set out on your own, that’s cool too! You can review some of the materials we’ve shared here to identify the research topics that interest you, and once you find a suitable dataset(s), you can try to blaze your own trail. If you’re interested in just practicing with building ML models, occasionally but not always related to science problems, check out Kaggle which regularly hosts data science competitions open to anyone—they’re pretty fun!

Coursework

Maybe after going through this module you want to focus on a particular topic in more depth, so here we recommend some courses you can take depending on your interests.

Materials science

If you want to learn more about electronic materials (including dielectrics), we recommend MATSCI 111: Properties of Electronic Materials to brush up on this topic. If you were looking specifically for a course in MI, that doesn’t exist at UC Berkeley, but you might find similar themes addressed in MATSCI 215: Computational Materials Science and PHYSICS 188: Bayesian Data Analysis and ML for Physical Sciences. These courses will all deepen your knowledge of (and appreciation for!) materials science and its applications.

Programming

Perhaps the programming tasks in this module have compelled you to take a CS/programming course to improve your programming abilities. This will improve your algorithmic thinking, deepen understanding of the Python language, and establish best practices for programming that could serve you well in the long run. Debugging is also an extremely useful skill to know, and taking a course can help with that. Presumably the CS61A/B/C series will be useful in this regard.

Data science

In a similar vein, maybe instead of focusing so intensely on programming, you want to instead gain a thorough understanding of data science. Fortunately, UC Berkeley has a great data science program, and Data 8: Foundations of Data Science and Data 100: Principles and Techniques of Data Science are both courses that are well-designed and well-taught. You will get exposure to the fundamental concepts and way of thinking, and practical tools/skillsets. You might even notice some structural similarities with this module, such as the use of Jupyter Book!

Machine learning

Last but not least, maybe all this talk about ML has made you excited to learn more about these algorithms. This is a huge field with a lot of classes fitting this bill, depending on your interests and skill level. CS 189: Introduction to Machine Learning is probably the most comprehensive course on this topic, although it is admittedly not for the faint of heart and requires mathematical maturity. EECS 127: Optimization Models in Engineering and EE 221A: Linear System Theory could be nice classes to take as prerequisites.

Don’t be a stranger

Keep us posted on your future progress, even if it’s outside the field of MI! 🙂 We understand many of you are entering your final year at Berkeley, and we would love to hear where life takes you in the future. Thanks for keeping it 💯.

thats_all_folks

Fig. 6 GIPHY

References

1

Anthony Yu-Tung Wang, Ryan J. Murdock, Steven K. Kauwe, Anton O. Oliynyk, Aleksander Gurlo, Jakoah Brgoch, Kristin A. Persson, and Taylor D. Sparks. Machine learning for materials scientists: An introductory guide toward best practices. Chemistry of Materials, 32(12):4954–4965, 2020. doi:10.1021/acs.chemmater.0c01907.

2

Bryce Meredig. Five high-impact research areas in machine learning for materials science. Chemistry of Materials, 31(23):9579–9581, 2019. doi:10.1021/acs.chemmater.9b04078.