DAT 229 - Tabular and Linked Data Analysis Credits: 4 4 Skills Lab Hours
Prerequisites: DAT 119 or MMC 150 or CIT 111 or permission of Department Head or instructor.
Description In this course, students explore relational databases which underlie many modern data systems. Students acquire foundational knowledge of table schema design, normalization and Structured Query Language (SQL) needed to interact with data sets from many knowledge domains. In contrast to production database systems, course emphasis lies on analytics-focused database skills, which enable students to integrate relational databases in data pipelines ending in analytic and presentation platforms–rather than transactional or logistics applications. Python is used throughout the course to build database schemas, insert data from flat files and third-party repositories and then extract the data for analytic and decision-support endeavors. Learning Outcomes Upon successful completion of the course, the student will:
- Design a multi-table, normalized database schema with strategically typed fields to house data for supporting decision making in a specific domain, such as transportation, environmental management or political science.
- Implement a database schema using data definition commands.
- Employ Structured Query Language (SQL) and Python or R to read data into a relational database to extract appropriate fields for analysis using primary-foriegn key relationships, table joins and row filtering.
- Configure a relational database system using UNIX shell commands with appropriate user privileges for data management and connection to databases on a remote computer.
- Present the investigation of an inquiry question backed by data in a relational database requiring the use of analytic tools in a Python or R environment to an audience of knowledgeable non-experts.
- Compare the design goals and features of relational databases to those of NoSQL databases in context of a particular analytic application of a chosen data domain.
Listed Topics
- Relational database management systems
- Primary-foreign key relationships
- Normalized database schemas
- Structured Query Language (SQL)
- Bourne Again Shell (BASH)
- Remote database connectivity tools (e.g. SSH Tunneling)
- Python’s cursor and resultset objects
- NoSQL databases and file databases
- Primary key Indexing
- Database views
Reference Materials Current and appropriate database resources selected by instructor Students who successfully complete this course acquire general knowledge, skills and abilities that align with CCAC’s definition of an educated person. Specifically, this course fulfills these General Education Goals: - Information Literacy
- Quantitative & Scientific Reasoning
Approved By: Dr. Quintin B. Bullock Date Approved: 12/14/2020
Course and Section Search
Add to Portfolio (opens a new window)
|