Databases are useful for both storing and using data effectively. Using a …
Databases are useful for both storing and using data effectively. Using a relational database serves several purposes. It keeps your data separate from your analysis. This means there’s no risk of accidentally changing data when you analyze it. If we get new data we can rerun a query to find all the data that meets certain criteria. It’s fast, even for large amounts of data. It improves quality control of data entry (type constraints and use of forms in Access, Filemaker, etc.) The concepts of relational database querying are core to understanding how to do similar things using programming languages such as R or Python. This lesson will teach you what relational databases are, how you can load data into them and how you can query databases to extract just the information that you need.
This is an alpha lesson to teach Data Management with SQL for …
This is an alpha lesson to teach Data Management with SQL for Social Scientists, We welcome and criticism, or error; and will take your feedback into account to improve both the presentation and the content. Databases are useful for both storing and using data effectively. Using a relational database serves several purposes. It keeps your data separate from your analysis. This means there’s no risk of accidentally changing data when you analyze it. If we get new data we can rerun a query to find all the data that meets certain criteria. It’s fast, even for large amounts of data. It improves quality control of data entry (type constraints and use of forms in Access, Filemaker, etc.) The concepts of relational database querying are core to understanding how to do similar things using programming languages such as R or Python. This lesson will teach you what relational databases are, how you can load data into them and how you can query databases to extract just the information that you need.
Network analysis is one of the four pillars of computational humanities, along …
Network analysis is one of the four pillars of computational humanities, along with geographic, text, and image analysis. Participants in this course will receive a broad overview of networks as they’re applied to humanities problems.
Good data organization is the foundation of any research project. Most researchers …
Good data organization is the foundation of any research project. Most researchers have data in spreadsheets, so it’s the place that many research projects start. We organize data in spreadsheets in the ways that we as humans want to work with the data, but computers require that data be organized in particular ways. In order to use tools that make computation more efficient, such as programming languages like R or Python, we need to structure our data the way that computers need the data. Since this is where most research projects start, this is where we want to start too! In this lesson, you will learn: Good data entry practices - formatting data tables in spreadsheets How to avoid common formatting mistakes Approaches for handling dates in spreadsheets Basic quality control and data manipulation in spreadsheets Exporting data from spreadsheets In this lesson, however, you will not learn about data analysis with spreadsheets. Much of your time as a researcher will be spent in the initial ‘data wrangling’ stage, where you need to organize the data to perform a proper analysis later. It’s not the most fun, but it is necessary. In this lesson you will learn how to think about data organization and some practices for more effective data wrangling. With this approach you can better format current data and plan new data collection so less data wrangling is needed.
Lesson on spreadsheets for social scientists. Good data organization is the foundation …
Lesson on spreadsheets for social scientists. Good data organization is the foundation of any research project. Most researchers have data in spreadsheets, so it’s the place that many research projects start. Typically we organize data in spreadsheets in ways that we as humans want to work with the data. However computers require data to be organized in particular ways. In order to use tools that make computation more efficient, such as programming languages like R or Python, we need to structure our data the way that computers need the data. Since this is where most research projects start, this is where we want to start too! In this lesson, you will learn: Good data entry practices - formatting data tables in spreadsheets How to avoid common formatting mistakes Approaches for handling dates in spreadsheets Basic quality control and data manipulation in spreadsheets Exporting data from spreadsheets In this lesson, however, you will not learn about data analysis with spreadsheets. Much of your time as a researcher will be spent in the initial ‘data wrangling’ stage, where you need to organize the data to perform a proper analysis later. It’s not the most fun, but it is necessary. In this lesson you will learn how to think about data organization and some practices for more effective data wrangling. With this approach you can better format current data and plan new data collection so less data wrangling is needed.
Data Carpentry lesson to learn how to use command-line tools to perform …
Data Carpentry lesson to learn how to use command-line tools to perform quality control, align reads to a reference genome, and identify and visualize between-sample variation. A lot of genomics analysis is done using command-line tools for three reasons: 1) you will often be working with a large number of files, and working through the command-line rather than through a graphical user interface (GUI) allows you to automate repetitive tasks, 2) you will often need more compute power than is available on your personal computer, and connecting to and interacting with remote computers requires a command-line interface, and 3) you will often need to customize your analyses, and command-line tools often enable more customization than the corresponding GUI tools (if in fact a GUI tool even exists). In a previous lesson, you learned how to use the bash shell to interact with your computer through a command line interface. In this lesson, you will be applying this new knowledge to carry out a common genomics workflow - identifying variants among sequencing samples taken from multiple individuals within a population. We will be starting with a set of sequenced reads (.fastq files), performing some quality control steps, aligning those reads to a reference genome, and ending by identifying and visualizing variations among these samples. As you progress through this lesson, keep in mind that, even if you aren’t going to be doing this same workflow in your research, you will be learning some very important lessons about using command-line bioinformatic tools. What you learn here will enable you to use a variety of bioinformatic tools with confidence and greatly enhance your research efficiency and productivity.
Software Carpentry lesson that teaches how to use databases and SQL In …
Software Carpentry lesson that teaches how to use databases and SQL In the late 1920s and early 1930s, William Dyer, Frank Pabodie, and Valentina Roerich led expeditions to the Pole of Inaccessibility in the South Pacific, and then onward to Antarctica. Two years ago, their expeditions were found in a storage locker at Miskatonic University. We have scanned and OCR the data they contain, and we now want to store that information in a way that will make search and analysis easy. Three common options for storage are text files, spreadsheets, and databases. Text files are easiest to create, and work well with version control, but then we would have to build search and analysis tools ourselves. Spreadsheets are good for doing simple analyses, but they don’t handle large or complex data sets well. Databases, however, include powerful tools for search and analysis, and can handle large, complex data sets. These lessons will show how to use a database to explore the expeditions’ data.
This collection uses primary sources to explore the Declaration of the Rights …
This collection uses primary sources to explore the Declaration of the Rights of Man and of the Citizen. Digital Public Library of America Primary Source Sets are designed to help students develop their critical thinking skills and draw diverse material from libraries, archives, and museums across the United States. Each set includes an overview, ten to fifteen primary sources, links to related resources, and a teaching guide. These sets were created and reviewed by the teachers on the DPLA's Education Advisory Committee.
The designing, collecting, analyzing, and reporting of psychological studies entail many choices …
The designing, collecting, analyzing, and reporting of psychological studies entail many choices that are often arbitrary. The opportunistic use of these so-called researcher degrees of freedom aimed at obtaining statistically significant results is problematic because it enhances the chances of false positive results and may inflate effect size estimates. In this review article, we present an extensive list of 34 degrees of freedom that researchers have in formulating hypotheses, and in designing, running, analyzing, and reporting of psychological research. The list can be used in research methods education, and as a checklist to assess the quality of preregistrations and to determine the potential for bias due to (arbitrary) choices in unregistered studies.
Engineering analysis distinguishes true engineering design from "tinkering." In this activity, students …
Engineering analysis distinguishes true engineering design from "tinkering." In this activity, students are guided through an example engineering analysis scenario for a scooter. Then they perform a similar analysis on the design solutions they brainstormed in the previous activity in this unit. At activity conclusion, students should be able to defend one most-promising possible solution to their design challenge. (Note: Conduct this activity in the context of a design project that students are working on; this activity is Step 4 in a series of six that guide students through the engineering design loop.)
The Design Process is a modern approach to the teaching of practical …
The Design Process is a modern approach to the teaching of practical skills in schools, colleges and universities. It is sometimes called Product Design. In this course learners will learn how to define the Design Process and explain the framework of design. This course discusses the advantages and disadvantages of the design process and it illustrates the design process diagrammatically. It explains problem identification techniques and discusses ways of analysing products to be designed. In addition, this course discusses the importance of investigating into problems before designing and making.
Welcome to 2.007! This course is a first subject in engineering design. …
Welcome to 2.007! This course is a first subject in engineering design. With your help, this course will be a great learning experience exposing you to interesting material, challenging you to think deeply, and providing skills useful in professional practice. A major element of the course is design of a robot to participate in a challenge that changes from year to year. This year, the theme is cleaning up the planet as inspired by the movie Wall-E. From its beginnings in 1970, the 2.007 final project competition has grown into an Olympics of engineering. See this MIT News story for more background, a photo gallery, and videos about this course.
Welcome to 2.007! This course is a first subject in engineering design. …
Welcome to 2.007! This course is a first subject in engineering design. With your help, this course will be a great learning experience exposing you to interesting material, challenging you to think deeply, and providing skills useful in professional practice. A major element of the course is design of a robot to participate in a challenge that changes from year to year. This year, the theme is cleaning up the planet as inspired by the movie Wall-E. From its beginnings in 1970, the 2.007 final project competition has grown into an Olympics of engineering. See this MIT News story for more background, a photo gallery, and videos about this course.
On Monday, you scanned Steve Jobs' commencement speech from Stanford and on …
On Monday, you scanned Steve Jobs' commencement speech from Stanford and on Tuesday in class we close-read paragraphs 6 through 8. In this discussion, you will post one detail from the speech and provide your thinking about the detail.
The goal of this class is practical: to interrogate, make explicit, and thus to …
The goal of this class is practical: to interrogate, make explicit, and thus to develop the powerful musical intuitions that are at work as you make sense of the music all around you. Reflecting, we will ask how this knowledge develops in ordinary and extraordinary ways.
Sharing data and code are important components of reproducible research. Data sharing …
Sharing data and code are important components of reproducible research. Data sharing in research is widely discussed in the literature; however, there are no well-established evidence-based incentives that reward data sharing, nor randomized studies that demonstrate the effectiveness of data sharing policies at increasing data sharing. A simple incentive, such as an Open Data Badge, might provide the change needed to increase data sharing in health and medical research. This study was a parallel group randomized controlled trial (protocol registration: doi:10.17605/OSF.IO/PXWZQ) with two groups, control and intervention, with 80 research articles published in BMJ Open per group, with a total of 160 research articles. The intervention group received an email offer for an Open Data Badge if they shared their data along with their final publication and the control group received an email with no offer of a badge if they shared their data with their final publication. The primary outcome was the data sharing rate. Badges did not noticeably motivate researchers who published in BMJ Open to share their data; the odds of awarding badges were nearly equal in the intervention and control groups (odds ratio = 0.9, 95% CI [0.1, 9.0]). Data sharing rates were low in both groups, with just two datasets shared in each of the intervention and control groups. The global movement towards open science has made significant gains with the development of numerous data sharing policies and tools. What remains to be established is an effective incentive that motivates researchers to take up such tools to share their data.
Das Tutorial Digitale Textedition mit TEI besteht aus einer Reihe von Kapiteln, …
Das Tutorial Digitale Textedition mit TEI besteht aus einer Reihe von Kapiteln, die aufeinander aufbauend in die Kodierung und Edition von Texten nach den Guidelines der Text Encoding Initiative (TEI) einführen. Das Tutorial ist für den Einsatz in der Lehre konzipiert, kann aber auch im Selbststudium eingesetzt werden.
Jedes Kapitel behandelt einen bestimmten Aspekt des Themas und besteht jeweils aus drei Elementen: erstens aus einem Foliensatz für ein Inputreferat, das in die wichtigsten Begriffe und Elemente von TEI einführt; zweitens aus einem oder mehreren Aufgabenblättern, die zur praktischen Einübung des gelernten dienen; und drittens aus den diversen Materialien, die für die Bearbeitung der Aufgaben notwendig sind, bspw. digitale Faksimiles, XML-Dateien, und mehr.
This OER packet contains the course materials for ENGL 1302 Composition II …
This OER packet contains the course materials for ENGL 1302 Composition II Research and Analysis that introduce you to the ways in which the act of writing has the power to help you make connections between yourself and the world. Writing can help you establish your own experiences or ideas in relation to the experiences or ideas of others. In short, it can help you figure out what you think about things and help you to situate those thoughts in relation to the world and among the multitude of opinions and ideas that exist within it. That’s a powerful tool.
A Data Carpentry curriculum for Economics is being developed by Dr. Miklos …
A Data Carpentry curriculum for Economics is being developed by Dr. Miklos Koren at Central European University. These materials are being piloted locally. Development for these lessons has been supported by a grant from the Sloan Foundation.
Students will choose a podcast to listen to and analyze it by …
Students will choose a podcast to listen to and analyze it by identifying similarities and differeneces between education in modern society and education historically.
No restrictions on your remixing, redistributing, or making derivative works. Give credit to the author, as required.
Your remixing, redistributing, or making derivatives works comes with some restrictions, including how it is shared.
Your redistributing comes with some restrictions. Do not remix or make derivative works.
Most restrictive license type. Prohibits most uses, sharing, and any changes.
Copyrighted materials, available under Fair Use and the TEACH Act for US-based educators, or other custom arrangements. Go to the resource provider to see their individual restrictions.