Skip to main content

Catalog Search

Search Options

Contents

A Message from the President
Academic Calendar
Graduate Degree and Advanced Certificate Programs
Courses
- ANTHE - Secondary Education Anthropology Course Descriptions
- ARCH - Architecture Course Descriptions
- ART - Art Course Descriptions
- ARTE - Secondary Education Art Course Descriptions
- BIO - Biology Course Descriptions
- BIOE - Secondary Education Biology Course Descriptions
- BME - Biomedical Engineering Course Descriptions
- CE - Civil Engineering Course Descriptions
- ChE - Chemical Engineering Course Descriptions
- CHEM - Chemistry Course Descriptions
- CHEME - Secondary Education Chemistry Course Descriptions
- CSc - Computer Science Course Descriptions
- DSE - Data Science Engineering Course Descriptions
  - I1020
  - I1030
  - I2100
  - I2400
  - I2450
  - I2700
  - I9800
  - I9900
- EAS - Earth and Atmospheric Science Course Descriptions
- EASE - Secondary Education Earth and Atmospheric Science Course Descriptions
- ECO - Economics Course Descriptions
- ECOE - Secondary Education Economics Course Descriptions
- EDCE - Teaching, Learning, and Culture Course Descriptions
- EDLS - Educational Leadership Course Descriptions
- EDSE - Secondary Education Course Descriptions
- EDUC - Teaching, Learning, and Culture Course Descriptions
- SPED - Special Education Course Descriptions
- EE - Electrical Engineering Course Descriptions
- ENGL - English Course Descriptions
- ENGLE - Secondary Education English Course Descriptions
- ENGR - Engineering Graduate Courses
- HISTE - Secondary Education History Course Descriptions
- HIST - History Course Descriptions
- IAS - Study of the Americas Course Descriptions
- IR - International Relations Course Descriptions
- LAAR - Landscape Architecture Course Descriptions
- LALS - Latin American and Latino Studies
- MATHE - Mathematics Education Course Descriptions
- MATH - Mathematics Course Descriptions
- MCA - Media and Communication Arts Course Descriptions
- ME - Mechanical Engineering Course Descriptions
- MEDS - MEDS Course Descriptions
- MIS - Computer Science Course Descriptions
- MUS - Music Course Descriptions
- PHIL - Philosophy Course Descriptions
- PHYS - Physics Course Descriptions
- PHYSE - Secondary Education Physics Course Descriptions
- PSCE - Secondary Education Political Science Course Descriptions
- PSM - Public Service Management Course Descriptions
- PSY - Psychology Course Descriptions
- SCIE - Secondary Education Science Course Descriptions
- SOCE - Secondary Education Sociology Course Descriptions
- SOC - Sociology Course Descriptions
- SPANE - Secondary Education Spanish Course Descriptions
- SPAN - Spanish Languages and Literatures Course Descriptions
- SUS - Sustainability in the Urban Environment Course Descriptions
- UD - Urban Design Course Descriptions
Policies on Non-Discrimination and Sexual Harassment
Important Notice of Possible Changes
About The City College
Academic Requirements and Regulations for Graduate Students
The Office of the Registrar
Tuition and Fees
Financial Aid
Research and Study Facilities
The Division of Student Affairs
The College of Liberal Arts and Science
Bernard and Anne Spitzer School of Architecture
Sustainability and the Urban Environment, Master of Science (M.S.)
The School of Education
Translational Medicine, Master of Science (M.S.)
Grove School of Engineering
Institutional Policies

Catalog Links

Share

Print this page

DSE I2450 Big Data and Scalable Computation

The course aims to provide a broad understanding of big data and current technologies in managing and processing them with a focus on the urban environment. With storage and networking getting significant cheaper and faster, big data sets could easily reach the hands of data enthusiasts with just a few mouse clicks. These enthusiasts could be policy makers, government employees or managers, who would like to draw insights and (business) value from big data. Thus, it is crucial for big data to be made available to the non-expert users in such a way that they can process the data without the need of a supercomputing expert. One such approach is to build big data programming frameworks that can deal with big data in as close a paradigm as the way it deals with “small data.” Also, such a framework should be as simple as possible, even if not as efficient as custom-designed parallel solutions. Users should expect that if their code works within these frameworks for small data, it will also work for big data. General topics of this course include: big data ecosystems, parallel and streaming programming model, MapReduce, Hadoop, Spark, Pig, and NoSQL solutions. Hands-on labs and exercises will be offered throughout to bolster the knowledge learned in each module.

Prerequisite

DSE I1020 and DSE I1030, or equivalents.

Credits

3

Contact Hours

3 hr./wk.