Intro to Module 2¶

This segment is our “syllabus hour,” which we have explicitly typed out below to avoid any confusion. The three main sections are as follows:

  1. Structure and expectations

  2. Resources and technical details

  3. Our learning community đŸŒ±

Structure and expectations¶

First, we’ll start with a quick poll: Do you feel comfortable with being recorded on Zoom?

The reason we ask this is because there will be a few sessions where some of us won’t be able to make it, and we’d like to record those sessions so those folks can catch up on the information. But we also recognize some might feel uncomfortable about being recorded, so please let us know. The recordings will not be shared with anyone outside of this group and will not be used for any other purposes (e.g., posted to YouTube, used for research).

Schedule and timing ⌚¶

A condensed summary of everything in this module can be found at Schedule (one page) 📅. We plan to hold interactive tutorials during the first week with live “lectures” and asynchronous “homework,” for which we encourage you to work collaboratively and ask questions. In the second and third weeks, you will be doing self-directed research and we will hold office hours (OH) and other special events.

In terms of timing, we’ll kick off each day at 9:00 AM PDT and have an afternoon debrief at a time that we’ll collectively decide right now. Note that we will not be following Berkeley Time in this module. In terms of actual working hours, particularly during the self-directed research segment, understand that we’re very flexible in accomodating your personal schedules and work preferences, so just keep us updated. There’s also no requirement or expectation to be working outside of “working hours,” which we’ll loosely say is 9-5 Mon-Fri.

Assessments¶

To help build understanding and collect feedback, there will be a scaffolded sequence of assessments, which are a mix of reading, writing, coding, and making presentations. These are (or at least, should be) individually low stakes and implemented with the following considerations in mind:

  • You will learn by doing, which is particularly important when it comes to data science and programming.

  • Much of the coding, even in self-directed research, will be done in groups. Teamwork makes the dream work.

  • The assessments build on each other so you can practice different ways of communicating science, culminating in a final presentation.

  • Consider these assessments to be checkpoints to gauge your learning and to help keep things moving during these short three weeks.

  • There are no grades (obviously). The more effort you’re able to put in, the more you will learn. Thus, we hope you are excited to challenge yourself to learn new things in this module.

Guest speakers¶

As mentioned in the kickoff meeting, one thing that we will try to incorporate into this module is inviting guest speakers to share their work and life story with us. These are great career development opportunities for you to learn about the impact of materials informatics and how graduate school (or not!) can prepare you for this field. We’ve tried to be very intentional about selecting speakers with diverse educational and professional journies for you to learn about (see their respective pages for more information and times). We kindly ask you to attend these talks and ask the speaker questions. They’re all very enthusiastic to spend their time with us and they look forward to the discussions!

Resources and technical details¶

In order to successfully pull off a remote, data science research internship, we’ll have to rely on quite a few different pieces of technology. In this section, we’ll be discussing how these pieces fit together to support your learning.

Zoom¶

We will use Zoom for all of our synchronous meetings. The link should’ve been sent to you in Slack and it can also be found in this text file. We will use the same link for all events in this module (e.g., tutorials, OH, guest speakers, presentations). Feel free to use the room at any time if you need to meet with someone (a peer or a mentor).

Since we’re all pretty tired of Zoom meetings by now, please understand that we will be very intentional of when we are using Zoom. We will try to maximize the amount of interaction and discussion to help all of us stay engaged with the material and for your questions to be answered. We invite you to actively participate during these live sessions, and we want to acknowledge that participation can take several forms:

  • You can unmute and speak up.

  • You can type in the chat.

  • You can react with emojis and other non-verbal feedback.

  • You can turn your video on.

Feel free to use the method(s) you’re most comfortable with.

Slack¶

We will use Slack as our primary communication platform for sending private and group messages, in addition to files, images, and emojis. đŸ™ŒđŸ„łđŸ˜đŸ€đŸą Let’s keep our group conversations in the #module-2 channel.

We kindly ask that you are “active” during working hours on Slack. We encourage you to turn on notfications during this time. You can tag people with @, including individual (@enze) and multiple (@channel) users. Outside of working hours though, please send notifications to others only if urgent. Enze will pretty much respond to any message at any time within 15 minutes. 😛

Jupyter Book¶

This curriculum/website is created using Jupyter Book, which is a project that facilitates the publication of professional, interactive websites that feature computational narratives. We think it will greatly support your learning and we would like to point out a few features to aid your navigation and improve your experience.

Embedded content¶

By now, you’ve probably realized that the Jupyter Book interface is very simple, yet very powerful, with the ability to embed all sorts of content (e.g., Google Slides, Slido polls, images, links, YouTube videos). This means we’re able to keep all of the curriculum materials for this module in a centralized location, which hopefully makes it easier for you to keep track of everything. There’s a handy search bar and navigation bar on the left-hand side for you to quickly locate content, and we encourage you to click around on your own time to familiarize yourself with the interface. 🙂 Please note that external links will open in the same tab, so you’ll have to right-click or Ctrl+Click to open links in a new tab.

DataHub¶

We will be conducting tutorials in the Python programming language and executing the Python code using interactive Jupyter notebooks [1]. The principal reason we’re using Jupyter Book for this curriculum is because this software allows us to directly compile Jupyter notebooks into this website while maintaining links back to the original notebooks. We can then run the notebooks in the cloud without having to mess with individual installations of Python, which greatly increases accessibility. To enable interactive code execution for these notebooks, which we will be doing throughout the tutorials, we can open the Jupyter notebook pages using the DataHub service generously provided by the Division of Computing, Data Science, and Society. We will demonstrate how to use DataHub in the next lesson for today.

GitHub issues¶

We really want this textbook to be a living document that is continually refined, and one way to do so is to make the material open source on GitHub. This allows anyone to collaborate on this work, including you all. If you spot an error (including typos!) or have suggestion for content, please don’t hesitate to open an issue using the GitHub icon above.

Our learning community đŸŒ±Â¶

In the final section, we want to open up a discussion about some community norms for creating a positive and vibrant learning community. We already have two listed, but would love to hear your thoughts during this quick brainstorm. We know it has been a very taxing year for everyone and want to make this space as welcoming and stress-free as possible.

Active learning¶

Lectures are boring to listen to and they’re boring to give, and overall not very effective for learning. So, we’ll be employing active learning strategies that will give you opportunities to discuss ideas, ask questions, test hypotheses, and take agency over your learning, which research has shown to be extremely important [2]. In addition to the more “obvious” activities like interactive coding and peer discussions, you’ll also be providing feedback frequently—for us, for your peers, and for yourselves—as opportunities for reflection on growth. Our hope is that you will feel comfortable participating in all of these activities (and don’t find them too tedious 😜) as we’ve tried to intentionally design them to maximize your learning.

Trying new things¶

The pandemic has really forced all of us to change [several] gears in how we operate and how we think. And while it has led to tremendous loss, it has also catalyzed change and opened new windows of opportunity. In terms of higher education, our optimism leads us to ask the following: Rather than focusing on recreating what was lost, what new pedagogy can we create? This summer internship and using Jupyter Book to develop this module’s curriculum are just two small examples of the new things your instructors are trying.

We also commend you for participating in this internship, as several of you noted that research is new for you. That’s awesome! And for those of you who have done research before: well, we’re honored that you chose to come back and spend your summer with us. Regardless of your previous experience, it’s likely you will learn a lot of new concepts in this module, and we want to explicitly note that learning new things is hard. Like, really, really hard. As we mentioned in the kickoff meeting, we will likely fail when doing research, and fail often.

Fortunately, once we internalize these facts, it becomes not-too-hard to chart a path forward. In fact, just as ML is able to learn from your failures [3], you too will learn from your failures. We know that by signing up for this internship, you are all ready for this challenge and you all believe that ability can be developed (also known as a growth mindset [4]). With this mentality towards learning and the support of your peers and mentors, you’ll surely fluorish in this module. 🌾

We want to hear from you! đŸ’ŹÂ¶

Throughout this module, please do not hesitate to point out what’s confusing or offer suggestions for improvement. Tell us on Slack, on Zoom, or using the GitHub issues tracker. This is the first time we’re trying out this curriculum (sorry, đŸč) and your ideas can have a huge impact on future generations of students!

References¶

1

Thomas Kluyver, Benjamin Ragan-Kelley, Fernando PĂ©rez, Brian Granger, Matthias Bussonnier, Jonathan Frederic, Kyle Kelley, Jessica Hamrick, Jason Grout, Sylvain Corlay, Paul Ivanov, DamiĂĄn Avila, Safia Abdalla, Carol Willing, and Jupyter development team. Jupyter Notebooks - a publishing format for reproducible computational workflows. In Fernando Loizides and Birgit Scmidt, editors, Positioning and Power in Academic Publishing: Players, Agents and Agendas, 87–90. Netherlands, 2016. IOS Press. doi:10.3233/978-1-61499-649-1-87.

2

Scott Freeman, Sarah Eddy, Miles McDonough, Michelle Smith, Nnadozie Okoroafor, Hannah Jordt, and Mary Wenderoth. Active learning increases student performance in science, engineering, and mathematics. Proceedings of the National Academy of Sciences, 111(23):8410–8415, 2014. doi:10.1073/pnas.1319030111.

3

Paul Raccuglia, Katherine C. Elbert, Philip D. F. Adler, Casey Falk, Malia B. Wenny, Aurelio Mollo, Matthias Zeller, Sorelle A. Friedler, Joshua Schrier, and Alexander J. Norquist. Machine-learning-assisted materials discovery using failed experiments. Nature, 533(7601):73–76, 2016. doi:10.1038/nature17439.

4

Carol S. Dweck. Mindset: The New Psychology of Success. Ballantine Books, New York, updated edition, 2016. ISBN 978-0-345-47232-8.