Data Science

 0    57 flashcards    klaudiatkacz
download mp3 print play test yourself
 
Question English Answer English
Apache MLlib
start learning
Language that makes machine learning scalable
Apache Spark
start learning
A general-purpose cluster-computing framework allowing you to process data using compute clusters
API
start learning
Application programming interface allows communication between two pieces of software
Caffe
start learning
A deep learning algorithm repository built with C++ with Python and Matlab bindings
CDLA
start learning
Community Data License Agreement
Classification models
start learning
Are used to predict whether some information or data belongs to a category (or “class”)
CLI
start learning
Command line interface
C++
start learning
A general-purpose programming language. It is an extension of the C programming language or C with Classes
Data set
start learning
A structured collection of data
Deeplearning4
start learning
Language for deep learning
Deep learning
start learning
A specialized type of machine learning. It refers to a general set of models and techniques that loosely emulate the way the human brain solves a wide range of problems
ELT
start learning
Extract, Load, Transform
ETL
start learning
Extract, Transform, and Load
FSF
start learning
Free Software Foundation
ggplot2
start learning
A popular library for data visualization in R
GPU
start learning
Graphics processing units
Git
start learning
De facto standard for code asset management, also known as version management or version control. Around Git emerged several services, GitHub, and GitLab
Hadoop
start learning
Application of Java which manages data processing and storage for big data applications running in clustered systems
Java
start learning
Object-oriented programming language
Java-ML
start learning
Language for machine learning
JVM
start learning
Java Virtual Machine
JavaScript
start learning
A general-purpose language that extended beyond the browser with the creation of Node. js and other server-side approaches
Julia
start learning
A language for high-performance numerical analysis and computational science
Jupyter Notebook
start learning
A browser-based application that allows you to create and share documents containing code, equations, visualizations, narrative text links, and more
Jupyter Lab
start learning
A browser-based application that allows you to access multiple Jupyter Notebook files, other code, and data files
Kernel
start learning
An execution environment for the different programming languages
Lattice
start learning
It is a high-level data visualization library that can handle graphics without customizations
Library
start learning
A collection of functions and methods that allow you to perform many actions without writing the code
Leaflet
start learning
Used for creating interactive plots
ML
start learning
Machine learning uses algorithms – also known as “models” - to identify patterns in the data
Matplotlib
start learning
Package for data visualization
Model training
start learning
The process by which the model learns patterns from data
MNIST
start learning
Modified National Institute of Standards and Technology
MongoDB
start learning
A NoSQL database for big data management that was built with C++
NLP
start learning
Natural Language Processing
NLTK
start learning
Natural Language Toolkit
NumPy
start learning
Libraries are based on arrays and matrices, allowing you to apply mathematical functions to the arrays
OSI
start learning
Open-Source Initiative
PaaS
start learning
Platform as a service
Pandas
start learning
A library that offers data structures and tools for effective data cleaning, manipulation, and analysis
Plotly
start learning
Used for web-based data visualizations that can be displayed or saved as individual HTML files
PMML
start learning
Predictive Model Markup Language
Python
start learning
A high-level, general-purpose programming language. It has a large, standard library that provides tools suited to many different tasks, including Databases, Automation, Web scraping, Text processing, Image processing, Machine learning, and Data analytics
R
start learning
A statistical computing language
Regression models
start learning
Are used to predict a numeric (or “real”) value
Reinforcement Learning
start learning
Loosely based on the way human beings and other organisms learn
REST
the S stands for State, and the T stands for Transfer
start learning
RE stands for Representational
RStudio
start learning
Unifies programming, execution, debugging, remote data access, data exploration, and visualization into one tool
SaaS
start learning
Software as a service
Scala
start learning
Is a combination of scalable and language. A general-purpose programming language that provides support for functional programming and is a strong static type system
Spyder
start learning
Integrates code, documentation, and visualizations, among others, into a single canvas
SQL
start learning
Structured Query Language that is non-procedural, used for querying and managing data
Supervised Learning
start learning
A learning in which a human provides input data and correct outputs
TensorFlow
start learning
Deep Learning library for dataflow that was built with C++
Unsupervised Learning
start learning
The data is not labeled by a human. Examples are Clustering models used to divide each record of a dataset into one of a similar group
Watson Studio
start learning
A fully integrated development environment for data scientists
Weka
start learning
Language for data mining

You must sign in to write a comment