100% found this document useful (1 vote)
2K views729 pages

Yousef AI Follow-Up Sheet

Yousef AI Follow-up Sheet

Uploaded by

yousef shaban
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
2K views729 pages

Yousef AI Follow-Up Sheet

Yousef AI Follow-up Sheet

Uploaded by

yousef shaban
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
You are on page 1/ 729

Ericsson

Hauweii
Nokia
Bell
AT&T
rogers
TELUS

Principal Data Scientist - Product Development (Global AI Accelerator India)


Bangalore, India
Product Development
Get Job Alerts
Share Job
Job Description
Date: Mar 30, 2020
Ericsson Overview:

Ericsson is world’s leading provider of communications technology and services. Our offerings include services, consulting,

Using innovation to empower people, business and society, Ericsson is working towards the Networked Society: a world co

We are truly a global company, operating across borders in over 180 countries, offering a diverse, performance-driven cult

Exciting Opportunity:

It will be practically impossible for human brains to understand how to run and optimize next generation of wireless netwo

Machine Intelligence, the combination of Machine Learning and other Artificial Intelligence technologies is what Ericsson u

Ericsson is now looking for Principal Data Scientists to significantly expand its global team for AI acceleration for our group

Do you have in depth understanding of Machine Learning and AI technologies?

Do you want to apply and extend those skills to solve real complex problems with high societal impact; going beyond ML/A

Then, you do want to join Ericsson’s global team of Engineers/Scientists pushing the technology frontiers to automate, sim

Role Summary:

As a Principal Data Scientist, you shall build and deploy AI models into production with focus on scaling, monitoring and pe

Your knowledge and experience in Data Science methodologies will be applied to solve challenging real-world problems as
Key Responsibilities:

Lead multiple AI/ML projects for a certain product/business


Manage communication, planning, collaboration and feedback loops with business stakeholders.
Work with huge datasets including petabytes of 4G/5G-networks, IoT and exogenous data
Identify the model monitoring strategy in prod and retraining plan.
Define data sourcing, access and pipeline design. Identify and plan for sourcing external data.
Model the business problem statement into AI/ML problem.
Define the Data sourcing strategy and works with stakeholders to procure data. Contribute to IP creation for Ericsson in AI
Define/Design data storage and retrieval strategies from various kind of data sources such as NOSQL Databases. Design da
Design APIs for AI/ML models with focus on business, modularity and versioning; and build standard/canonical data mode
Lead functional and technical analysis within Ericsson businesses and for strategic customers to understand MI-driven bus
Contribute to rapid and iterative development of validated minimum viable solutions addressing these needs. This include
Lead studies and creative usage of new and/or existing data sources. Work with Data Architects to leverage existing data m
Collaborate with product development teams and partners in Ericsson Businesses to industrialize machine learning models
Work with new technologies and be the ambassador for them in MI Communities within Ericsson, nurturing the communiti
Provide MI Competence build-up in Ericsson Businesses and Customer Serving Units
Develop new and apply/extend existing, concepts, methodologies, techniques for cross functional initiatives
Engage with external ecosystem (academia, technology leaders, open source etc.) to develop the skills and technology por
Present and be prominent in MI related forums and conferences, e.g., publishing patents, presenting papers, organizing se
Key Qualifications:

Bachelors/Masters/Ph.D. in Computer Science, Data Science, Artificial Intelligence, Machine Learning, Electrical Engineerin
Applied experience: 8+ years of ML and/or AI production level experience; and an overall industry experience of about 15+
Proven skills of implementing a variety of Machine Learning techniques
Strong Programming skills (R/Python) with proficiency in at least one
Strong grounding in mathematics, probability, statistics needed for data analysis and experiments
Proven ability of leading AI/ML projects end-to-end with complete ownership
Proven skills in building AI/ML based solutions using a variety of frameworks such as Python, R, H2O, Keras, TensorFlow, Sp
Experience in implementing new algorithms and methodologies from leading open source initiatives and research papers
Extensive experience in model development and life-cycle-management in one or more industry/application domain
Experience in building models using semi-structured and unstructured data
Hands-on experience in designing and building AI models using Deep Neural Networks for applicable scenarios
Experience in using ensembles and stacking techniques to solve complex ML problems
Able to build and deploy AI models into production with focus on scaling, monitoring and performance
Knowledge of building explainable models (XAI) and prescriptive analytics
Experience with working in Big Data technologies such as Hadoop, Cassandra etc.
Able to Define/Design data storage and retrieval strategies from various kind of data sources such as NOSQL DBs
Knowledge of designing data pipelines and flow strategies
Familiarity with data pipelining frameworks such as Air Flow, AWS Sagemaker, etc. would be a plus
Able to design APIs for AI/ML models with focus on business, modularity and versioning
Experience in writing and presenting white papers, journal articles and technical blogs on the results
Soft Skills:

Good communication skills in written and spoken English


Great Team worker and collaborator
Creativity and ability to formulate problems and solve them independently
Self-driven and ability to work through abstraction
Ability to build and nurture internal and external communities
Additional Requirements:

Certifying MI MOOCS, a plus


Applications/Domain-knowledge in Telecommunication and/or IoT, a plus.
Experience with data visualization and dashboard creation is a plus
Knowledge of Cognitive models is a plus
Ability to work independently with high energy, enthusiasm and persistence
Experience in partnering and collaborative co-creation, i.e., working with complex multiple stakeholder business units, glo

Senior Data Scientist - GAIA


Bangalore, India
Research & Development
Role Summary:

As a Senior Data Scientist, you will need to have strong programming skills and deep understanding of data science and M

Key Responsibilities:

Lead functional and technical analysis within Ericsson businesses and for strategic customers to understand MI-driven bus
Define the model validation strategy and business success criteria in data science terms
Identify the right architecture and flow for the data and DS model
Design the implementation and deployment strategy for the model into production
Contribute to rapid and iterative development of validated minimum viable solutions addressing these needs. This include
Lead studies and creative usage of new and/or existing data sources. Work with Data Architects to leverage existing data m
Collaborate with product development teams and partners in Ericsson Businesses to industrialize machine learning models
Work with unstructured data including text and images in AI/ML models
Work with new technologies and be the ambassador for them in MI Communities within Ericsson, nurturing the communiti
Provide MI Competence build-up in Ericsson Businesses and Customer Serving Units
Develop new and apply/extend existing, concepts, methodologies, techniques for cross functional initiatives
Engage with external ecosystem (academia, technology leaders, open source etc.) to develop the skills and technology por
Present and be prominent in MI related forums and conferences, e.g., publishing patents, presenting papers, organizing se

Key Qualifications:

Bachelors/Masters/Ph.D. in Computer Science, Data Science, Artificial Intelligence, Machine Learning, Electrical Engineerin
Applied experience: 5+ years of ML and/or AI production level experience; and an overall industry experience of around 10
Proven skills of implementing a variety of Machine Learning techniques
Experience in Security, Internet of Things is a plus
Strong skills in the use of current machine learning frameworks such as H2O, Keras, TensorFlow, Spark ML etc.
Demonstrated ability to implement new algorithms and methodologies from leading open source initiatives and research p
Experience with Big Data technologies such as Hadoop, Cassandra etc.
Good with effective big data storage and retrieval strategies including indexing, partitioning, etc.
Hands on working with data pipeline and flow
Hands on with API design/development for AI/ML models
Strong grounding in math and statistics.
Proven ability of leading projects end-to-end.
Proven experience writing production-grade software
Extensive experience in model development and AI model life-cycle-management in one or more industry/application dom
Strong Programming skills in various languages (C++, Scala, Java, R) with proficiency in Python and/or C++
Good communication skills in written and spoken English
Creativity and ability to formulate problems and solve them independently
Ability to build and nurture internal and external communities
Experience in writing and presenting white papers, journal articles and technical blogs on the results
Additional Requirements:

Certifying MI MOOCS, a plus


Applications/Domain-knowledge in Telecommunication and/or IoT, a plus.
Experience with data visualization and dashboard creation is a plus
Ability to work independently with high energy, enthusiasm and persistence
Experience in partnering and collaborative co-creation, i.e., working with complex multiple stakeholder business units, glo

Data Scientist
Santa Clara, California
Research & Development
In this role you will:

Create and maintain optimal data and model dataOps pipeline architecture
Assemble large, complex data sets that meet functional / non-functional business requirements.
Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, r
Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data so
Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issue
Keep data separated and secure across national boundaries through multiple data centers and strategic customers/partne
Create tool-chains for analytics and data scientist team members that assist them in building and optimizing our product in
Work with data and machine learning experts to strive for greater functionality in our data and model life cycle manageme
Support dataOps competence build-up in Ericsson Businesses and Customer Serving Units

To be successful, you have:

BS, MS or PhD degree in Computer Science, Informatics, Information Systems or another related field.
3-4 years’ experience using the following software/tools: Hadoop, Spark, Kafka, etc.
Experience with relational SQL and NoSQL databases, including Postgres and Cassandra.
Experience with Data and Model pipeline and workflow management tools: Azkaban, Luigi, Airflow, Dataiku, etc.
Experience with stream-processing systems: Storm, Spark-Streaming, etc.
Experience with object-oriented/object function scripting languages: Python, Java, C++, Scala, etc.
You have advanced SQL knowledge and experience working with relational databases, query authoring (SQL) as well as wo
You have experience building and optimizing ‘big data’ data pipelines, architectures and data sets.
Experience performing root cause analysis on internal and external data and processes to answer specific business questio
You have strong analytic skills related to working with unstructured datasets.
You have built processes supporting data transformation, data structures, metadata, dependency and workload managem
Working knowledge of message queuing, stream processing, and highly scalable ‘big data’ data stores.
Strong project management and interpersonal skills.
Experience supporting and working with cross-functional teams in a dynamic environment

Artificial Intelligence Software Engineer


Lewisville, Texas
Product Development
You will:
• Drive strategic technical leadership, challenge existing technologies and be responsible for design, development, integra
• Perform continuous analysis with an outstanding local and global teams to build industrial & cloud-based systems and so
• Prototype, design, develop and test high quality products
• Influence and drive exciting, meaningful product development
• Drive continuous improvements of products and processes

To be successful in the role, you must have:


• Bachelor and/or master’s degree in computer science engineering or related fields, who has strong background in Artific
• Minimum of 7 to 10 years’ experience in software or automation design and development
• Strong experience in any of the following programming languages: C#, Python, Java
• Hands-on knowledge of the .NET framework, SOA (Service Oriented Architecture) or OPC-UA (Open Protocol Communica
• Hands-on experience with Agile development methodology and framework.
• Experience with architecting implementing application appropriate technology and communication protocol stacks
• Innovating, collaborative approach, networking, adapting and responding to change
• Strong planning and prioritizing skills and ability to deliver results and meeting customer expectations

You might also have:


• Knowledge of industry 4.0 landscape and IIoT (Industrial Internet of Things) solutions.
• Experience with industrial robots, controllers and real-time control network
• Experience in data science and machine learning models.

Data Scientist
Lisbon, Portugal
Network Operation and Integration

Date: Apr 17, 2020


Job Summary:

Ericsson is one of the leading ICT providers, with about 40% of the world’s mobile traffic carried through our networks. We

Learn what makes YOU + Ericsson a powerful combination. Join us today.


Now we are looking for a Data Scientist.

Key Qualifications:

Student or Recent Graduate in Technical Telecom, Electronics or Computer Science Engineering major
Previous experience in the field (professional or university project)
Good knowledge of statistical analysis, theory of probabilities, design of experiments and machine learning
Understanding of data preparation, data mining and pre-processing
Good knowledge of SQL, PL/SQL, SQL Server or SPSS
Good command of few programming language and software environment for statistical analysis, graphics representation a
Knowledge of Business Objects, Tableau, Cognos
Knowledge of Hadoop, Spark, Data Lake would be appreciated

Key Responsabilities:

Gather and process data at scale; scripts, files, APIs, database queries, etc.
Work closely with analytics, and engineering team members
Interpret data, analyze results using statistical techniques and provide ongoing reports
Acquire data from primary or secondary data sources and maintain databases/data systems
Identify, analyze, and interpret trends or patterns in complex data sets

What we offer:

Paid internship and flexible working environment


You will benefit from a wide range of learning opportunities and knowledgeable colleagues
Direct collaboration with high performance teams
Work based on developing the latest solutions in the area of mobile technology
Work in an international environment based on cooperation

Machine Learning Engineer


Nanjing, China
Product Development

Job Description
Date: Apr 17, 2020
Responsibilities:
Create Scalable Machine Learning systems that are highly performant.
Identify patterns in data streams and generating actionable insights.
Customize Machine Learning algorithms in image recognition and computer vision.
Participate related Proof-of-Concept with customer to apply machine learning algorithms in different use cases.
Investigate new machine learning technologies & identify feasible ones for specific cases.
Collaborate with others both internally and externally in machine learning area.
Prepare and provide machine learning trainings to other colleagues.

Key Qualifications:

An interest in exploring how Machine Learning can be leveraged for the use cases in telecom domain
4+ years of software development experience post graduate school
2+ years of experience working in applying Machine Learning to solve complex problems
MS in Computer Science, Mathematics, Physics, Artificial Intelligence, or related

Additional Requirements:

Software development experience in Python/C++/Java.


Writing Hive or SQL queries for Data extraction.
Experience with open source software and libraries.
Experience in the telecom industry.
Hands-on experience with one or more Machine Learning algorithms such as Regression, Clustering, Trees/Random Forest
Familiarity with frameworks such as scikit-learn, Torch, TensorFlow, Caffe, Yolo, faster R-CNN etc.

Data Scientist
Stockholm, Sweden
Performance and Transformation

Job Description
Date: Apr 14, 2020
Are you interested in leading our company through the future challenges of the 5G industry? Do you have a passion for de

We are currently looking for Data Scientists in Stockholm, Sweden, to join our Digital Center.

Activities of a Data Scientist includes defining, processing, and analyzing data to identify actionable solutions, and applying

Responsibilities of a Data Scientist:

Identifies opportunities to create business value using advanced analytics / AI / ML


Defines datasets needed for executing on identified opportunities
Determines availability of and possibility to obtain necessary data
Processes, cleans, and verifies data
Applies algorithms to analyze data and identify patterns / trends
Comes up with actionable solutions based on executed analysis
Communicates recommendations to a project management and cross-functional team
Data Scientist basic qualifications: 

University degree / MBA in Computer Science, Machine Learning, Computer Engineering, Mathematics, Physics, or related
Significant experience from applying data science methodologies to solve challenging real-world business problems (predi
Strong foundation in mathematics and statistics
Fluency in at least one scripting language (e.g. Python, R)
Strong analytical skills and ability to acquire new knowledge and apply it in the job
Acumen for business flow understanding and expertise in data preparation, data mining, and pre-processing
Strong communication, presentation, and collaboration skills

Data Scientist preferred qualifications:

Familiarity with Agile ways of working


Understanding of end user / customer needs and perspectives
Ability to effectively collaborate and share knowledge and best practices with wider team
Experience with data visualization and dashboard creation
Experience supporting and working with cross-functional teams in a dynamic environment

Application Process:

The selection and interview process is ongoing. Therefore, please send in your application in English as soon as possible. Fo

We have a programming test as a qualifier and part of screening process for this position. Please note that you may be req

Software Developer
Gurgaon, India
Product Development

Job Summary:

Automation System Tester function is to ensure the execution of system test activities for the developed products. This fun

Responsibilities:

•Assigned test activities are realized within approved cost, time and quality

•Prepare test plans.

•Complete test report, analyze test results


•Maintenance and update of test documents

•Keep description of the test environment (both HW & SW) up-to-date. Track all software and hardware licenses and inven

•Handling the Test environment (including simulators, tools, test server)

•Test case execution and troubleshooting

•Establish integration & verification scope, design detail test cases according to SUT (system under test) implementation fe

•Execute functional / non-functional test cases according to detail test cases’ description, elementary bugs’ analysis and b

Key Qualifications:

•Education: Bachelor of Engineering/MCA/MSc

•Min years of experience: 4-7 years

Competence / Skills

Technical Competence
1

Shell Scripting

Important

Core Java

Important

UNIX/LINUX

Important

Database concepts

Important

Scalability & High Availability Concepts

Important

Virtualization

Important

Distributed File Systems (HDFS, MapR)

Important

Automation/ Tools (Non-functional – Performance tests)


Important

Cassandra (Distributed DB)

Important

10

Cloud Concepts (RHOSP)

Important

11

Veritas: Cluster/Storage (VxFS, VSF)

Important

12

Protocols (IPV6/V4, IPSEC, Diameter, SNMP)

Important

Test Competence

Testing Principles & Concepts

Important

2
Good knowledge of Test tools and Test environment

Important

Must have good aptitude for Quality Assurance - Verification

Important

Strong analytical & troubleshooting skills

Important

Working Experience in Java and scripting languages (PERL, Python, TTCN)

Important

Exposure of working in customer live/test environment

Important

Domain Competence

Telecom

Important

2
Domain -Mediation, Billing

Good to have

Others

Agile & Scrum concepts

Good to have

Aptitude to learn new technologies and function-domain

Must Have

Ability to work in Agile Methodologies (SCRUM, KANBAN)

Must have

Individual Capacities

(General abilities)
1

Communication Ability

Important

Ability for self-analysis

Important

Job Knowledge as per CV

Important

Result Orientation

Important

Customer Orientation

Important

6
Enthusiasm/ Drive

Important

Creativity & Innovation

Important

Discipline/Punctuality

Important

Adaptability

Important

10

Professionalism

Important

Experienced C++ Developer for Mobile Positioning System


Budapest, Hungary
Product Development

Job Description
Date: Apr 30, 2020
Have you ever heard of Mobile positioning? We are developing the Ericsson Mobile Positioning System that is being used b

From early studies and proofs-of-concept to deployment, we are responsible for the complete life cycle of the product. Ind

If you are eager to learn, have a can-do personality, would like to work in a project that has a truly global footprint, join us

What will you do:


Design the implementation of the product's functional and non-functional requirements, working in an Agile team
Own the implemented functionality: from design and implementation to production troubleshooting
Be responsible for the quality of the technical solution

Apply if you have:

Solid knowledge in Linux


Experience in C++ and scripting
Familiarity with version control systems (GIT)
Good command of English
BSc or MSc in Computer Science, Electrical Engineering or equivalent is required

Experience in positioning services, cloud service development and deployment is an advantage

What we offer:

Flexible working hours


High standard working environment (bicycle storage, several parking lots…)
Easily accessible location
Competitive compensation package, wide range of cafeteria elements, annual bonus
International career opportunities
Great team spirit, fun events
Professional communities
Extensive professional development opportunities
Subsidized language courses
Mobile and laptop
Local medical service
Relocation support

Data Scientist
Company NameHuawei Company Location Kuala Lumpur, Malaysia

Huawei´s Southern Pacific Regional Office Big Data Team is looking for a self-driven Data Scientist to join our team. Ideally,

Job Responsibilities:

Exploratory research to understand user behavior, selecting features, building and optimizing classifiers and building mach
Address the most important analytical questions with a view on driving product impact, and build products metrics.
Data mining using state-of-the-art methods
Extending company’s data with third party sources of information when needed
Enhancing data collection procedures to include information that is relevant for building analytic systems
Processing, cleansing, and verifying the integrity of data used for analysis
Doing ad-hoc analysis and presenting results in a clear manner
Desired skills and background:

Education: Master’s Degree in applied statistics, data mining, machine learning, physics or a related quantitative discipline
Working experience:>5years delivering world-class data science outcomes, you solve complex analytical problems using
Customer orientation with excellent understanding of operator’s business/technical requirements, you have a keen desire
Achievement orientation, energetic, strong influence skills, self-initiative, teamwork spirit, persistency, logical thinking abil
Great communication skills
Excellent understanding of machine learning techniques and algorithms, such as Deep Learning, Naive Bayes, SVM, Decisio
Experience with relational databases as you are with Hadoop-based data mining frameworks. You are familiar with SQL, Py
Experience in the use of statistical analysis environments such as R, NumPy/Pandas, SPSS or SAS.
Good applied statistics skills, such as distributions, statistical testing, regression, etc.
Good scripting and programming skills
Data-oriented personality

How to apply:

Leave your application through LinkedIn. Please remember to attach your CV to your application to yong.sze.miin3@huaw

Staff Data Scientist


Company NameHuawei Company Location Mountain View, California

Are you enthusiastic about applying big data to resolve real world problems?
Does it sound exciting to you to improve billions of people’s daily life by creating great mobile devices? If these describe yo

DUTIES AND RESPONSIBILITIES (Including Essential Functions)

Apply CNN, DNN, RNN/LSTM and other latest deep learning technologies to high-impact applications, such as anomaly det

Utilize GPUs to accelerate training process of deep learning models.

Compress deep learning models for the deployment on mobile devices.

JOB SPECIFICATIONS

Education/Knowledge:
Major/Discipline: Computer Science/ Math/ Statistics or related

Minimum Degree: Master Degree

Preferred: PhD Degree

Skills & Abilities (Technical or General):

Minimum Requirements:

Knowledge of data analysis techniques (probability, statistics, machine learning, etc) and experience with applications.

Hands-on experience with big data analytics tools, such as Python, R, SQL, Hadoop, Spark, Java, Tableau, Vertica, Pig,etc.

Strong analytical skills and detail oriented. Excellent analytical thought process – ability to understand the question, and d

Strong communication skills and ability to work well as a team member

Preferred:

Experience of implementing deep learning models on resource-restricted mobile devices.

Proven track of applying auto-encoder to solve real world problems.

Solid understanding on experimental design.

Versed in the process of applying effective algorithms to large scale data modeling.

Understanding and experience of deep learning modeling.

Huawei is a leading global information and communications technology (ICT) solutions provider. Driven by a commitment

The size of our cloud platform is gaining momentum and it is already planet scale. Huawei Cloud is one of the largest and f

Huawei's Munich Research Center is responsible for advanced technology research, architectural development, design and

We are seeking for a highly motivated Data Scientist (m/f/d) to join the Intelligent Cloud Operations team in Huawei Muni

Responsibilities
Propose new innovative approaches to operate planet-scale cloud platforms using AI (e.g., AIOps).
Rapid prototyping of innovative features using Big Data platforms (e.g., Spark, Kafka, HDFS).
Explore new approaches to develop cloud-native distributed systems (e.g., Netflix’s Hystrix).
Implement emerging observability paradigms (e.g., Google Dapper, OpenTracing, and OpenCensus).
Integrate modern AI algorithms (e.g., Deep Learning and Facebook PyTorch) into production systems.
Requirements
Excellent PhD in Computer Science, or related field.
First experiance as a data scientist, data engineer, computational biologist, or bioinformatician.
Experience with statistical software (e.g., Pandas, Scikit-learn).
Expertise with data analysis such as forecasting, multivariate analysis, stochastic models.
Experience with appling machine learning on large-scale datasets.
Demonstrated ability to solve challenging engineering problems is required.
Fluent written and spoken English.
What you can expect
Meaningful work: Our products and solutions connect people in over 170 countries, serving more than one third of the wo
Enormous investments in research, development and innovation: Huawei invests over 10% of its revenues in research and
Team spirit: At Huawei, we are proud of a strong social integration. Doors are open, and people collaborate with each othe
International work environment: Our business language is English and our team comprises unique experts from around 50
Robust growth across all business segments, thanks to balanced global presence and strategic focus.
If you are enthusiastic to shape the German Research Center in Munich together with us, being part of a multicultural team

Responsibilities
Use deep learning and machine learning to create scalable solutions for business problems.
Develop new tools using cutting edge technology focusing on efficiency and automation.
Work closely with the AIOps team to jointly develop innovative tools driven by AI.
Build ML systems in production settings.
Collaborate with colleagues from science, engineering, and business backgrounds.
Requirements
Experience with statistical software (e.g., Pandas, R) and programming languages (e.g., Python, Java).
Hands-on expertise with ML libraries (e.g., Scikit-Learn, TensorFlow, Keras).
Experience with applying machine learning on large-scale datasets.
Experience with fast prototyping.
Demonstrated ability to solve challenging engineering problems is required.
Fluent written and spoken English.
PhD in Computer Science or related field
What you can expect
Meaningful work: Our products and solutions connect people in over 170 countries, serving more than one third of the wo
Enormous investments in research, development and innovation: Huawei invests over 10% of its revenues in research and
Team spirit: At Huawei, we are proud of a strong social integration. Doors are open, and people collaborate with each othe
International work environment: Our business language is English and our team comprises unique experts from around 50
Robust growth across all business segments, thanks to balanced global presence and strategic focus.

Munich, Berlin, Hamburg

Data Reply

What sets us apart is our imagination and the ability to inspire our customers for pioneering technologies and to introduce

We are always on the lookout for enthusiasts who question the existing, try out new ideas and want to achieve exciting go
TASKS:

As part of our team, you support our customers in the successful development and implementation of complex solutions f
With the help of powerful open source and cloud technologies, you offer our customers scalable, cost-effective and flexibl
You are responsible for the implementation, improvement and evaluation of ML applications and their implementation
You communicate regularly with our customers and stakeholders in an efficient and professional manner

QUALIFICATIONS:

Successfully completed master's degree in computer science or similar


Knowledge of distributed systems , distributed data storage and processing, eg in Apache Spark and Apache Beam
Practical experience with data analysis technologies such as AWS Sagemaker, Spark, Spark SQL, MLib or Storm / Samza are
In-depth knowledge of the following high level programming languages: Scala, Java, Python
Experience with modern ML and DL frameworks such as TensorFlow, PyTorch, MxNet Caffe and Keras
High analytical and problem-solving skills
Excellent written and oral communication skills in German and English

Location:
Berlin, Copenhagen, Frankfurt, Helsinki, London, Munich, Oslo, Paris, Sao Paulo, Stockholm, Warsaw

Geography:
Central & South America, Europe & The Middle East

Capabilities:
Big data & advanced analytics, innovation & product development, technology & digital

Industries:
Automotive & Mobility, Biopharmaceuticals, Consumer products, Education, Energy & environment, Engineered products

About Us

Boston Consulting Group partners with leaders in business and society to tackle their most important challenges and captu

To succeed, organizations must blend digital and human capabilities. Our diverse, global teams bring deep industry and fun

Practice Area Profile

BCG GAMMA combines innovative skills in computer science, artificial intelligence, statistics, and machine learning with de

Role profiles

OVERVIEW OF GAMMA ENGINEERING:

The Gamma Engineering team is building the next generation of analytics tools. Clients need to easily interact with our ana
POSITION SUMMARY:

BCG Gamma is seeking a Machine Learning Engineer to join our engineering team. The ideal candidate will have industry e

As a strong software expert in building complex systems, you will be responsible for inventing how we use technology, ma

RESPONSIBILITIES:

We are looking for a Machine Learning Engineer who can bring bleeding edge machine learning models into production to

Your qualifications

REQUIREMENTS:

• Masters degree in computer science (or in a highly related area)

• Min. 3 years of experience in algorithms, data structures, and object-oriented programming

• Good understanding of machine learning fundamentals and deep learning

• Implementation experience in machine learning algorithms and applications

• Knowledge of or experience in building production quality and large scale deployment of applications related to natural

• Experience with distributed systems strongly preferred

TECHNOLOGIES:

• Strong programming skills in at least one object oriented programming language (Java, Scala, C ++, Python, etc.)

• Strong skills in parallel processing technologies and languages: Hadoop, Spark, Scala etc.

• Experience with Python applied to machine learning (Pandas, Scikit-learn, Scipy, Numpy etc.)

• Strong knowledge of machine learning techniques required (KNN, random forest, Bayesian statistics etc.)

• Strong knowledge of machine learning techniques preferred (TensorFlow, Keras, PyTorch, Caffe, MxNet)

• Cloud: AWS / GCP / Azure

• DevOps: Kubernetes, Docker, CI / CD

• Building REST API

• Knowledge of relational database management: SQL, PostgreSQL, Microsoft SQL etc.


• Knowledge of NoSQL databases (MongoDB, Neo4j, Redis etc.)

• Unix-based command line & development tools

WORK ENVIRONMENT:

• Fast-paced, intellectually intense, service-oriented environment

• Position is located in Gamma European hubs (Paris, London, Germany, Nordics etc.)

• Fluency in local language and English are required

• Expect time spent traveling

Date Posted:
4-Feb-19

We are looking for an expert who can lead us to success in the big data and database field. The position is a senior technic

If you have extensive cutting-edge technology research and system design experience, and extensive practical application

About the Ireland Research Center

The mission of the Irish Research Center is to position Huawei as a recognized technology leader in global information and

About IT Product Line

To become an innovator and leader in cloud data centers, Huawei's IT product line focuses on IT infrastructure and promo

JOB RESPONSIBILITIES:
As the lead of the Data Lake Platform Development team, you will help to create the next generation Big Data Analytics pla
Lead in analyzing the software requirements and software elements for Big Data Platform design.
Self-motivated AI engineer who takes ownership of the design and implementation of SW components
Take charge of the design and code writing for a specific cross-sub-system or codes of key algorithms.
Work with cross-functional teams to integrate AI-based solutions into production SW Stack
Text and Language processing, classification, summarization, topic extraction, and de-identification using state-of-the-art m
Research, design new ideas and develop them in real world
Lead in development of Big Data Platform incorporation with existing services
Work closely with the other teams to ensure architectural integrity.
Participate in different open source and standard meetings to present solutions
Travel as work needs, including visiting our HQ in China 2-4 times per year.
Reporting to the Chief Big Data Architect for the IRC Big Data Team
JOB QUALIFICATIONS:

Professional Knowledge:
PhD or Masters in Computer Science, Computer/Electrical Engineering. Bachelors degree with related industrial experienc
10+ years of work experience in related fields for principle level
Professional Skills:
Strong programming skills with Python, C, C++, Java
Hands-on experience in one of the AI frameworks such as TensorFlow, Caffe, etc
Demonstrated strong background/experience in Natural Language Processing
Strong background in deep learning models such as those with RNNs, LSTMs, and encoder-decoder.
Experience in deploying AI models in real industrial platform/hardware is required
Experience developing and using virtualization, container-based and cloud platforms such as Kubernetes, Open stack, Swa
Location

This is a full-time position at our Ireland Research Center based at Townsend Street in Dublin 2, Ireland.

Huawei Ireland
Senior Data Scientist
CT Operations Labs mission is to be recognized leaders of world class intelligent and autonomous operations by driving tec

As a Senior Data Scientist, your knowledge and experience in Data Science methodologies will be applied to solve challeng

Key Responsibilities include:


Contribute to rapid and iterative development of validated minimum viable solution addressing these needs. This includes
Lead studies and creative usage of new and/or existing data sources. Work with architects to leverage existing data model
Collaborate with product development teams and partners in Businesses to industrialize machine learning models and solu
Develop new and apply/extend existing, concepts, methodologies, techniques for cross functional initiatives.
Requirements
PhD in Electrical Engineer, Computer Science, Artificial Intelligence, Machine Learning, or related field
2+ years of Machine Learning experience in data science.
Demonstrated practical understanding of the specification, design, documentation, coding, testing and release of commer
Strong Programming skills in various languages (Python, golang, Java, R or C++)
Proven skills in Machine Learning, e.g., linear/logistics regression discriminant analysis, bagging, random forest, Bayesian m
Strong skills in the use of current state of the art machine learning frameworks such as Scikit-Learn, H2O, Keras, TensorFlo
Demonstrated ability to develop new algorithms and methodologies from leading open source initiatives and research pap
Experience with Big Data technologies such as Hadoop etc
Strong knowledge in Statistics, e.g., descriptive analysis and supervised and unsupervised analysis
Problem solving ability and lateral thinking capability;
Experience with Natural Language Processing (NLP) techniques.
Experience before in applying Machine Learning for Knowledge Graphs.
• Publication track records in machine learning and computer science conferences and journals, including, but not limited

• Experience in IP Filings and Patents is a great plus.

Benefits

This is a permanent position at our R&D center based in Dublin, Ireland. Business trips may be requested as necessary to s

Recruitment Privacy Notice: http://career.huawei.com/reccampportal/portal/hrd/weu_rec_all.html

Job Title: The Dock Senior Data Scientist (ML & DL)
The Dock Dublin 2

The Dock is a diverse team of creative problem-solvers within Accenture where design, business and technology meet und

We believe future commercial success will come from businesses that are conscious of the intended and unintended conse

We have an opportunity for a Senior Data Scientist in our Analytics & Artificial Intelligence (A&AI) Team. Operating across

We are looking for an experienced hands-on Data Scientist with proven Machine Learning and Deep Learning solution dev

What youll be doing

The Analytics & AI team within the Dock applies leading Analytics & Artificial Intelligence (AI) technologies and techniques

• Take a leading role in teams delivering innovative Analytics & AI projects; ranging from research, proofs-of-concept to th

• Be a deep hands-on expert practitioner, owning and shaping development and experimentation in the application of Ma

• Use AWS, Azure and/or GCP tools and services to develop prototype and scalable Analytics & AI solutions, demonstrating

• Work closely with software engineering colleagues to develop robust Analytics & AI applications encapsulated in softwar

• Drive the utilization of best practice in Analytics & AI technical delivery, methods and approaches on project teams.

• Actively contribute to bringing Analytics & AI perspectives to experimentation, design, workshops, sprints and prototypin

• Embrace working in multi-disciplinary teams and collaborate with designers, software developers, business experts and A

• Lead and mentor junior team members in projects and in their technical and professional development.

• Understand the key technology trends in the Analytics & AI domain, the business implications, and be able to match the

• Bring your passion for improving the world through technology innovation to our team and demonstrate the ability to th

Main Roles and Responsibilities

Analytics & AI Expertise

• Applying your deep technical expertise to solve problems and develop new solution offerings for our business and our cl

• Bringing deep expertise in developing data-driven solutions in Machine Learning, Deep Learning and Advanced Analytics

• Applying your expertise and hands-on coding experience with AWS, Azure and/or GCP tools and services to develop Ana

• Keeping up to date and learning the capabilities of leading AI platform solution providers and the AI ecosystem; including

• Applying data science, data engineering and solution architecture knowledge to design and develop robust end-to-end A
• Demonstrating deep understanding of the potential utility, limitations and challenges of a wide range of Advanced Analy

Ownership & Delivery of projects

• Being responsible for shaping, owning and delivering complex Advanced Analytics & AI applications and experiments and

• Being responsible for estimating, planning and managing your own work and the work of analytics teams.

Collaboration & Working in Teams

• Leading teams and nurturing new skills and capabilities in these teams. Contributing to the performance management an

• Interacting, collaborating and sharing experiences with colleagues from different disciplines and backgrounds.

Communicating & Working with Stakeholders

• Presenting to, interacting with and managing client and business stakeholders, clearly articulating and communicating An

• Producing quality technical components, documentation, write-ups, papers or articles that capture the value of the work

Developing the A&AI Community @ The Dock

• Contributing to the growth and development of the Analytics & AI capability and community in the Dock, outside immed

• Contributing to steering and shaping the development of the Innovation pipeline in Analytics & AI at the Dock, both in te
Requirements

• Masters Degree or higher in Computer Science, Mathematics, Engineering, Artificial Intelligence or a closely related disci

• Deep hands-on experience in a number of Advanced Analytics, Machine Learning, Deep Learning, NLP, Knowledge Graph

• 5 years experience in designing and implementing Advanced Analytics systems for R&D or commercial applications. Indu

• 5 years of project or product development methodology experience, for example agile development and CRISP-DM.

• Proven proficiency in several Analytics & AI related tools, programming languages & frameworks.

• Proven experience in designing and developing AI based systems and applications architected for scale.

• Experience in standards, methods and best practice for Advanced Analytics development, solution quality, accuracy and

• Hands-on experience and expertise in a number of statistical and application programming languages (e.g. Python, R, Jav

• Proven technical experience in solution development with AWS, Azure and/or GCP preferred.

• Experience in leading technical teams and in managing and mentoring team members.
• Experience in dealing with multiple business and technical stakeholders.

• Ability to work in a global collaborative multi-disciplinary team environment.

• Proficient verbal and written communication skills in English.

About Accenture & The Dock

Accenture is a global management consulting, technology services and outsourcing company, with more than 490,000 peo

Accenture is an equal opportunities employer and welcomes applications from all sections of society and does not discrim

Sage acquired AutoEntry in September 2019 - demonstrating our commitment to innovation and adding value to Sage Bus

Message me to find out more - james.lord@sage.com

#LI-JL
Key Responsibilities Key accountabilities and decision ownership:

• Solving problems from ideation to production, using machine learning.


• Experimenting, training, tuning, and shipping machine learning models.
• Writing production-quality code.
• Exploratory data analyses and investigations.
• Working with product managers to translate product/business problems into tractable machine learning problems.
• Working with machine learning infrastructure engineers to ship models.
• Presenting findings, results, and performance metrics to a broad range of stakeholders, including senior management.
• Influencing the broader development of the data science discipline within the organisation
• Be the subject matter expert demonstrating mastery of the delivery and use of Data Engineering techniques and Science
• Empower internal stakeholders using the art of the possible and to gain new meaning from data

Must have:
• Strong theoretical foundations in linear algebra, probability theory, optimization.
• Strong programming skills in Python.
• Experience in working with numpy, scipy, scikit-learn, pandas.
• Experience shipping production machine learning models.
• Experience communicating projects to both technical and non-technical audiences.
• Experience reporting machine learning accuracy in industry.
• You are familiar with (in no particular order): logistic regression, gradient descent, regularization, cross-validation, overfi

You may be a fit for this role if:


• You’re very comfortable with investigating open-ended problems and coming up with concrete approaches to solve them
• You know when to use machine learning and when not to!
• You’re a deeply curious person.
• You often think about applications of machine learning outside of your work life.

Desirable:
• PhD in Computer Science, Electrical Engineering, Statistics, Physics, or similar quantitative fields.
• Publications in top conferences.
• Experience writing complex SQL queries.
• You have deep experience with these things: logistic regression, gradient descent, regularization, cross-validation, overfi

Technical / professional qualifications:


• MS in Mathematics, Computer Science, Electrical Engineering, Statistics, Physics, or similar quantitative field.
• Real world application development experience
• Experience in IT roles across development and architecture with demonstrable experience and knowledge across all IT d
• Hands-on experience with technology that underpin Big Data and Data Integration including Hadoop, Spark, Scikit-Learn
Function Product
Country Ireland
Office Location Dublin
Location Region
Start your job application: click Apply Now

Huawei Ireland
3.5
Data Scientist - Knowledge Graph & ML Researcher (Contract)
Dublin

Huawei Ireland Research Centre is starting a new research team that will focus on research and developing AI algorithms f

The Ireland-based research team will work closely with the Huawei Consumer Business Group (CBG) in the Headquarters, C

Key Responsibilities
Responsible to build the NLP related model capability to solve the service requirement and issue.
Responsible for conducting scientific comparative analysis of operation data and find the proper AI techniques to apply to
Responsible for encapsulating the AI modules into workable research proof-of-concepts.
Responsible for handling huge amount of data and hence, buildings scalable machine learning models.
Responsible for technology transfer of the developed AI models and working closely with Huawei business units.
Skills & Qualifications
PhD in a relevant field is preferred (ML, Knowledge Graph, Computer Science…) and 2-5 years of experience applying adva
Experience with Natural Language Processing (ML, Knowledge graph) techniques is must.
Experience with Deep Learning (NNs, Recurrent NNs, Convolutional NNs, Encoder-Decoder with Attentions, Bayesian Deep
Experience working with Deep Learning Programming frameworks such as Keras, PyTorch, TensorFlow
Proficient in Supervised Learning, Un-Supervised Learning, and Semi-Supervised Learning algorithms.
Experience with Model-based Reinforcement Learning, MDPs and optimization is a good plus.
Publication track records in machine learning and computer science conferences and journals, including, but not limited to
Experience in Cloud or Data Centres optimizations using AI is a great plus.
Recruitment Privacy Notice: http://career.huawei.com/reccampportal/portal/hrd/...
Woebot
4.1
Senior Machine Learning Engineer
Dublin

WHY YOU SHOULD JOIN OUR DATA ENGINEERING TEAM:

As a Machine Learning Engineer, you will work closely with our Data Science, Product and Engineering teams to develop &

How You'll Thrive

Ramping Up
In your first 2 weeks, you'll learn about the Woebot content architecture and how ML and NLP are used to guide conversa
In your first 3 weeks, you'll list improvements that could be made to our existing set of classifiers.
Own Our Machine Learning Models, Systems & Processes
During your first 45 days you will develop infrastructure for the full cycle of our machine learning efforts, this includes, mo
To accomplish this you'll collaborate with engineers to integrate algorithms efficiently with backend production services w
You will also build machine learning models that enable Woebot to more naturally understand users' natural language inp
Improve Woebot's Sentiment Analysis.
Work with our Product team to define solutions and integrate them into a 1-year roadmap.
Help scale our services using GPUs and modern distributed processing tools in the cloud (AWS).
Dig into data with ad hoc analysis as necessary for technical, clinical, and user needs.
Help Woebot Have More Natural Flowing Conversations
Within your first 60 days you will be responsible for gathering data and building data labeling systems so that Woebot con
You will have unique datasets to work with, such as: 1M+ user conversations, support tickets, and other natural/unstructu
Turn Millions of Data Points Into Valuable Insights
By day 90 you will create user profiles and build models that define how Woebot interacts with each user. We consider pe
You will improve our machine learning models that enable Woebot to more naturally converse with and understand users
After a few months you're conducting deeper analysis to improve models that enable Woebot to derive insights about ind
To accomplish this you'll work closely with our Product team to deliver these insights at the right time and in the right man
This Might Be Your Next Career Move IF:
You care about helping make quality mental health care realistically accessible to millions of people nationwide.
You passionately follow the latest trends in NLP and are excited by the challenge of applying the latest research to real-wo
You want to get closer to the data and realize that advances in algorithms often come second to high-quality data.
You love tackling big meaningful issues with data, even though they are often hard to measure.
Core Competencies
You've deployed natural language processing and/or deep learning models using spaCy and/or NLTK
Experience deploying systems into production, at scale, using Docker, Kubernetes, or ECS
Knowledge of one or more modern ML/NLP frameworks, such as PyTorch or Tensorflow
Bachelors, Masters or PhD in Computer Science, Mathematics, Data Analysis or Machine Learning or related technical field
Ability to productize the latest research and establish a clear vision for how ML/NLP can be used in mental health
Agile go-to-market product mindset: we do research and we innovate, but we also ship often
Strong written and verbal communication skills
Our Core Values
Empathic: Place a high value on user-experience. Motivated to help others be successful.
Proactive & flexible: Hit the ground running. Even with ambiguity, you can get the job done.
Self-awareness and growth-mindset: Wants to learn and grow in the role.
High standards: Take pride in your work and apply high standards toward everything.
Strong work ethic: Work hard to get the job done.
Benefits
Competitive Salary
Health, Dental & Vision
Employee Volunteer Program

Bank of Ireland
3.1
Senior Data Scientist
Dublin

Division Description

Led by the recently appointed Chief Marketing Officer, we are creating Group Marketing function which will develop a stro

The Group Marketing function will be underpinned by the following strategic imperatives designed to transform the Bank

Description of business unit

The Group Customer Analytics team sits within the Group Marketing function and supports our customers, colleagues and

In a changing market environment, where customer expectations continue to grow and there is an increasing demand for

Purpose of the Role

The purpose of this role is to play a lead technical role in a team responsible for the development of analytical solutions to

Key Accountabilities

Deliver advanced data science projects that transform complex problems into compelling marketing and customer insights
Using the vast amounts of data available develop analytical models that support our customers and deliver on commercial
Lead the development of advanced analytical projects, such as propensity modelling, next best action modelling and text a
Act as technical expert on data mining, machine learning, statistical analysis, and modelling.
Create processes to measure the value of modelling to the organisation with clear targets, KPIs and measurement in place
Identify and integrate new internal and external data sources including internal structured data, semi-structured web data
Use data visualization tools to explain your results simply and succinctly to senior audiences.
Provide technical leadership to team of high-skilled data scientists, while mentoring and developing junior members of the

What is the opportunity

This is an exciting opportunity to take a senior positionon atalented data science team to solve complex problemsfocused
Essential Qualifications

Third Level qualification, in Maths, Computer Science, Statistics, Economics, Machine Learning, Analytics or an equivalent q

Essential Skills & Experience

At least 5 years analytics experience


Ability to manage and analyse large data sets with analytic rigor by using the usual toolset of statistical and machine-learn
Good understanding of supervised and unsupervised learning algorithms and strategies for data processing, data/text min
Experience with the standard data science stack like Python on Jupyter, Notebooks with Pandas, Scikit-learn, H2O.ai, Spark
Experience in Marketing Analytics or recommendations systems is a big plus, for example, propensity modelling, recomme
You have understanding of relational and non-relational databases, can wrangle data in different formats like JSON, flat-fil
You have experience in deploying models in a production environment.
Proven ability to work effectively with colleagues from different disciplines across business and technical functions. Excepti

Desirable Qualifications, Skills & Experience

Masters or PhD in Maths, Statistics, Computer Science, Economics, Engineering or Machine Learning.
Web-scraping skills are an advantage
Knowledge of other tools like Cloudera Data Science Workbench is a plus
Experience with Big Data technologies or with Digital/Web analytics a distinct advantage.
Knowledge of Cloudera Data Science Workbench, and associated analytical tools (i.e. PySpark)
People management and coaching experience.

Key Competencies

Customer Focused - Self


Accountable - Self
Champion Transformation - Self
Amplify Capability - Self

Developer position in Radio Software


Stockholm, Sweden
Product Development

Date: Apr 27, 2020


Unit Information:

PDU Radio Products is a part of Development Unit Networks (DNEW) and has an overall responsibility for development of
Radio SW is one of the sectors in PDU Radio, where Kista (Stockholm) is our base but the world is our arena. Radio SW pro
We are now looking for a talented experienced developer to work with one of our teams.

Position Summary:

As a Developer at Radio SW you will be responsible for build and delivery of our Radio SW products following Ericsson com

Responsibilities & Tasks:


Develop and maintain the automated build and delivery environment including frameworks and tools.
Track build and delivery failure issues until finished.
Work with Track Management in maintaining SW track and branch strategy.
Take part in early project phases to secure a smooth adaptations needed in the automated build and delivery environmen
Develop and communicate the SW CM strategy for Radio SW.
Interact with both internal and external Radio SW parties to align requirements and share information about our build and
Provide SW CM training to developers and document all processes and procedures

Position Qualifications:

Core Competences:

Very good experience of SW component build and deployment tools, such as GNU AutoTools and The Yocto Project includ
Experience in automation of build and delivery environments using Jenkins or similar scheduling utility for implementing a
Experience in Product Life Cycle Management and about the Software Development Life cycle including version control to
Experience in using Unicies, with ability to program or script if needed in BASH, tsch, Python, Perl, Ruby, Java, C++.
Experience of Modular Software, Software resue and SW development in general, ability to compile and link libraries, unde
Knowledge of Agile and Lean methodologies such as Scrum and Kanban.
Good knowledge about Radio system
Radio SW/HW product development knowledge
Knowledge about the DURA functional framework including the CI-machinery

Minimum Qualifications & Experience Requirements:

You have a minimum Bachelor’s degree in Computer Science or Electrical Engineering or equivalent and:

a minimum of 1 years of working experience from automation of large build and delivery systems
a minimum of 3 years of working with SW development
a minimum of 1 years of working experience from working with Yocto or similar build and deployment tools.
good English skills in spoken word and writing

Additional Requirements, Physical Demands, Region/Local Specifications:

Knowledge Sharing & Collaboration Skills


Communication Skills
Delivering results & meeting customer expectations
Planning & Organizing
Analyzing
Writing & Reporting
Following Instructions & Procedures

As a person you are:


are a self-driven and energetic team player
look forward to work in a Cross Functional Team with people from different disciplines and to also work outside your own
are willing to broaden your competence and get a deeper understanding of our products and our build and delivery enviro
like challenges and want to challenge to constantly improve

Applications:

For any enquires and for application details, please contact the responsible Recruiter, Suma Haregoppa Venkatagiri at sum

Please note that, you need to submit the application in English, and we do not accept any applications via email. The last d

Java Backend Developer


Istanbul, Turkey
Network Operation and Integration

Job Description
Date: Apr 22, 2020
At Ericsson, you can be a game changer! Because working here isn’t just a deal. It’s a big deal. This means that you get to le

Are you in?

Our Exciting Opportunity


We are now looking for a Java Backend Developer that will chip into the creation of customer solutions. As a developer, yo

You are responsible for ensuring that the delivered software components provide vital functionality and perform in accord

You Will,
Develop & Maintain different projects in the Telco sector.
Improve your technical knowledge every single day and share your knowledge with your team
Work with the Quality Assurance Team to make things smooth and workable to offer highly qualified products and service

To Be Successful In The Role You Are


Min. 2 years of experience in developing Java-based backend systems
Solid understanding of Java Technologies
Spring/Spring Boot experience is a big plus
Understanding of OOP and database concepts are required
Experience with frontend JavaScript Technologies like Angular & React JS is a plus
Familiarity with DevOps (Jenkins, Docker, etc..)
Ability to effectively articulate technical solutions across technical and shareholder groups
Solution-oriented can take the initiative, responsible,
BS or MS in Computer Science or related field preferred
No military obligations for male candidates, or postponed for 1 year
Proficiency in speaking & writing in Turkish
Java Fullstack Developer
Istanbul, Turkey
Network Operation and Integration

Date: Apr 22, 2020


At Ericsson, you can be a game changer! Because working here isn’t just a deal. It’s a big deal. This means that you get to le

Are you in?

Our Exciting Opportunity


We are now looking for Java Full Stack Developers that will supply to the creation of customer solutions. As a Java Full Stac

You are responsible for ensuring that the delivered software components provide vital functionality and perform in accord

You Will,
Develop & Maintain Java Backend services and & frontends using ReactJS / Angular
Your developed full-stack applications should ensure outstanding quality, performance, security and documentation stand
You will be located in scrum teams each dedicated to a different product line
Improve your technical knowledge every single day and share your knowledge with your team
Work with the Quality Assurance Team to make things smooth and workable to offer highly qualified products and service
To Be Successful In The Role You Are
Min. 2 years of experience in developing Java-based full-stack systems
Experience with Core Java, Spring, Spring Boot, Hibernate
Good level of understanding of Java technologies
Good software engineering academic background
Experience with React JS and/or Angular JS
Experience with HTML, CSS, JavaScript
Good level of relational database technologies especially Oracle
Familiarity with dev-ops Technologies such as Jenkins, Docker
Solution-oriented can take the initiative, responsible,
BS or MS in Computer Science or related field preferred
No military obligations for male candidates, or postponed for 1 year
Proficiency speaking & writing in Turkish

Software Developer for 5G


Göteborg, Sweden
Product Development

Job Description
Date: Apr 20, 2020
Software Developer

We are looking for a skilled software developer to help us build the applications in Ericsson’s 5G Core network, connecting
In addition to having solid software development skills, we also hope you would be interested in taking on the Scrum Mast

Typical tasks and responsibilities:

Develop high-quality software meeting telecom standards


Analyze requirements and propose a software design
Analyze verification needs and develop test cases validating your design
Develop test case automation
Execute tests, troubleshoot, and correct faults found
Lead and coach team through Scrum ceremonies

Skills and experience we value:

Master’s or bachelor’s degree in Computer Software Engineering or similar field


Proven skills in software development, preferably using Erlang or other functional languages
Experience of large-scale software development projects
Experience of agile development and the Scrum framework.
Knowledge of Git, Gerrit, Jenkins, HTTP2, TCP/IP, Linux, Containers

Characteristics we hope to see in you:

You are passionate about software and product development


You enjoy working with a team towards a common goal
You continuously strive to improve yourself and your output
You have a structured and disciplined approach to work
You are willing to learn and tackle complex challenges
You like communicating and networking in small and large teams

Experienced Software Developer (C/C++)

Job Description
Date: Apr 20, 2020
At Ericsson, you can be a game changer! Because working here isn’t just a deal. It’s a big deal. This means that you get to le

Are you in?

Our Exciting Opportunity


The Smart Services Router (SSR) provides operators with a highly scalable, consolidated platform that offers services for bo
Sounds interesting? Apply now.

You Will,
Design, implement and unit test features that meet the specifications and requirements
Interact with product management, system and technology team, software/hardware development teams, and stakeholde
Perform troubleshooting and support customer needs
Address complex technical challenges that warrant innovative and future proof solutions
Drive continuous improvements of products and processes
To Be Successful In The Role You Are
Deep experience in C and C++
Knowledge of scripting languages (PERL, bash, Python)
Solid understanding of Linux OS
Networking and IP protocol experience, knowledge of system software
Have good basic knowledge of Lean and Agile principles
Ability to work in agile team (Scrum, Kanban)
5+ years’ experience in similar role
BSc or MSc degree in Electronics Engineering / Computer Science

C++ developers for developing the heart of mobile internet @ 5G


Göteborg, Sweden
Product Development
Get Job Alerts
Share Job
Job Description
Date: Apr 14, 2020
Here’s the deal

As the tech firm that enabled the mobile internet connectivity around the world, at Ericsson we’ve made it our business to

We are on a quest, we´ve promised to never stand still; relentlessly innovating to make technology easy to adopt, easy to

Tech you’ll master!

With us you’ll be part of the next step developing our products. We enable millions of simultaneous connected mobile use

For this you’ll learn to master groundbreaking technology in IP networks, distributed real time embedded systems executin

Our office and our terms

The Ericsson office is located at Lindholmen in central Gothenburg, with beautiful scenery and modern facilities. We’re nex

You’ll be enjoying all the benefits of a good collective agreement, as well as a personal health account, gyms in the house,
The position

We are currently looking for software developers that have knowledge in cloud technologies and tools, container/microse

We stand at the forefront of Agile software development, using methods and principles like Lean and Scrum. Early custom

The team will design, implement and test the feature, product and system from requirements to production and commerc

About you

You are passionate about what you do which is obvious from your actions
You have a talent for software development and computer systems
You are comfortable learning from team members and sharing with team members
You are continuously developing your knowledge through experience, as well as reading and experimentation
You love solving problems and digging into complex problems
You have focus and ambition to understand customer needs on developed features
You take pride in understanding the whole product and its environment
Software is more than just a job for you

Technologies we use and need to master

As we are looking for developers to cover several different areas within our Packet Core Controller development, the used

C, C++, Python, Java


Development on Linux
Data communication and networking (Cloud networking, Container networking, Private/Public Clouds)
Build systems (cmake, gnu make)
TCP/IP, HTTP
Shell scripting
Experience in working with databases
Containers & Virtualization: Kubernetes, Docker, KVM, VMWare, Helm
Service Based Architecture, Microservices
Git, Gerrit, Jenkins

Qualifications

MSc/BSc level in Computer Science or the equivalent level of knowledge


Proficient in English, written and spoken.

BI Developer
Kraków, Poland
Network Operation and Integration

Job Description
Date: Apr 9, 2020
Job Summary:
Currently, for Application Development and Maintenance unit that is responsible for cooperation preparing applications an

As a new joiner, you will be a part of the team that strongly work to recognize the importance of delivery the highest quali

Key Responsibilities:
Designing and developing BI solutions based on Microsoft Business Intelligence stack focused on SSAS and SSRS with a bit
Providing high level estimations based on a given requirement,
Improving and maintenance software delivery environment (continuous integration, continuous delivery),
Hands-on development and end-to-end responsibility for features delivery (initial requirement, architecture, story refinem
Cooperating with team to maintain the architectural/platform road maps, practices and standards,
Following best software development practices to ensure high quality product and adherence to design objectives,
Sharing your knowledge and mentoring less experienced team members.
Key Qualifications:
Experience in Microsoft BI Stack (in particular: SQL Server, SSAS Multi-Dimensional model, SSRS),
Experience in working with DWH database platforms and understanding of Data Warehouse and ETL standards,
Basic understanding of Agile methodology (Scrum),
Average level of writing and verbal communication skills in English,
Team oriented – collaborative style willing to support others and ask for help when needed.

What We Offer:
Stable employment on the basis of an employment contract;
Work based on developing the latest solutions in the area of mobile technology;
Clearly defined career paths, trainings;
Rich benefit package (private medical care for the employee and their family, life insurance, Ok System MultiSport);
Work in an international environment based on cooperation;
Flexible working hours, laptop and mobile phone;
Work – life balance.

Integration Engineer - RAN


Dalian, China
Network Operation and Integration
Get Job Alerts
Share Job
Job Description
Date: Apr 7, 2020
Job Summary:
Ericsson has a position open for an Integration Engineer covering the field of Radio Access Networks (RAN). In this role, yo

As an Engineer you will analyze, prepare, implement and verify the configuration and integration of a node, network and/o

Responsibilities

• Perform Integration and Network rollout activities of LTE/5G nodes


• Perform software upgrades and updates of LTE/5G RAN nodes
• Involve in the System/Function Verification and support activities
• Participate in LTE/5G RAN trouble shooting
• Participate in global support case handling process
• Technical interface towards sales and/or R&D
• Plan and co-ordinate activities with internal/external customers
• Support Project Managers as technical subject matter expert

Qualifications & Requirements:

• Bachelor degree or above of first level University


• Fluent in English
• Business and management awareness
• Strong leadership and communication skill
• Comprehensive knowledge of technologies
• Well organized and have structure thinking
• Initiative and positive personal personality

Integration Engineer
Gurgaon, India
Network Operation and Integration
Get Job Alerts
Share Job
Job Description
Date: Apr 7, 2020
Ericsson is one of the leading providers of Information and Communication Technology (ICT) to service providers. We enab

Job Summary:
We are now looking for an Integration Engineer to analyze, prepare, implement and verify the configuration and integratio

Responsibilities:
You will support pre-sales activities, including pre-studies
Good experiance in operation support with ITIL process.
Very Good hands-on experience in Configuring ClickSoftware Service Optimization Suite ( Click Schedule, Click Mobile, Click
Good knowledge on integrating Click Service Optimization Suite with other products\systems
Must have worked in large scale deployment programs which involves multiple systems and integration
Good hands-on experience on Oracle database, PL/SQL scripting
Must have knowledge in Web deployment projects (.NET web services)
Good understanding in SDLC (Software Development Lifecycle). Must have involved in complete project lifecycle execution
Plan the implementation of the product configuration / integration work
Execute product configuration
Execute integration and migration work
Prepare system test, module test and accept test
You will work to identify and drive improvements
Post project activities
E2e technical understanding
Execute test
Scripting & coding
Knowledge sharing and collaboration skills
Key Qualifications:
Education: Academic degree, minimum on bachelor level, in engineering (IT, Telecom) or
Good hands-on experience in Configuring ClickSoftware Service Optimization Suite ( Click Schedule, Click Mobile, Click Plan
3-5 years’ experience of deploy system test and lead testing team.
Min years of experience: (Recruiter to supply)
Domain experience: (Recruiter to supply area of expertise – e.g.: Cloud, BSS, OSS etc.)
Creating & innovating
Applying expertise & technology
Analytical learning and researching skills
Delivering results & meeting customer expectations
You will need excellent planning and organizing skills
Additional Requirements:
If you have ISEB/ISTQB software testing qualifications that would be an advantage

C++ Developer
Guangzhou (Canton), China
Product Development
Get Job Alerts
Share Job
Job Description
Date: Mar 24, 2020
Job Summary:
We are now looking for a Developer to maintain products (units, nodes, networks, systems and solutions). Your role will in

Responsibilities And Tasks


Responsible for solution design and product development.
Work in Scrum team, follow Scrum Way of Working and ensure Scrum team running efficiently.
Key Qualifications:
Knowledge in Linux OS is mandatory.
Knowledge in TCP/IP and network protocols, such as BGP/GRE etc.
Competence in virtualization, such as KVM, Docker container.
Competence in high performance networking technologies, such as SR-IOV, DPDK.
Knowledge in Container orchestration (e.g. Kubernetes, Helm, lstio) and cloud-native design experience.
Familiar with automatic tool such as Ansible, and scripting in Shell and Python.
Technical leadership mentality and willingness to take challenge.
Be familiar with 3GPP standard and architecture on 4G/5G core network
General Qualifications:
Education: University degree or above.
Hard working, great teammate.
English in written and oral.

Developer position in Radio Software


Job Description
Date: Apr 27, 2020
Unit Information:

PDU Radio Products is a part of Development Unit Networks (DNEW) and has an overall responsibility for development of
Radio SW is one of the sectors in PDU Radio, where Kista (Stockholm) is our base but the world is our arena. Radio SW pro
We are now looking for a talented experienced developer to work with one of our teams.

Position Summary:

As a Developer at Radio SW you will be responsible for build and delivery of our Radio SW products following Ericsson com

Responsibilities & Tasks:

Develop and maintain the automated build and delivery environment including frameworks and tools.
Track build and delivery failure issues until finished.
Work with Track Management in maintaining SW track and branch strategy.
Take part in early project phases to secure a smooth adaptations needed in the automated build and delivery environmen
Develop and communicate the SW CM strategy for Radio SW.
Interact with both internal and external Radio SW parties to align requirements and share information about our build and
Provide SW CM training to developers and document all processes and procedures

Position Qualifications:

Core Competences:

Very good experience of SW component build and deployment tools, such as GNU AutoTools and The Yocto Project includ
Experience in automation of build and delivery environments using Jenkins or similar scheduling utility for implementing a
Experience in Product Life Cycle Management and about the Software Development Life cycle including version control to
Experience in using Unicies, with ability to program or script if needed in BASH, tsch, Python, Perl, Ruby, Java, C++.
Experience of Modular Software, Software resue and SW development in general, ability to compile and link libraries, unde
Knowledge of Agile and Lean methodologies such as Scrum and Kanban.
Good knowledge about Radio system
Radio SW/HW product development knowledge
Knowledge about the DURA functional framework including the CI-machinery

Minimum Qualifications & Experience Requirements:

You have a minimum Bachelor’s degree in Computer Science or Electrical Engineering or equivalent and:

a minimum of 1 years of working experience from automation of large build and delivery systems
a minimum of 3 years of working with SW development
a minimum of 1 years of working experience from working with Yocto or similar build and deployment tools.
good English skills in spoken word and writing

Additional Requirements, Physical Demands, Region/Local Specifications:

Knowledge Sharing & Collaboration Skills


Communication Skills
Delivering results & meeting customer expectations
Planning & Organizing
Analyzing
Writing & Reporting
Following Instructions & Procedures

As a person you are:

are a self-driven and energetic team player


look forward to work in a Cross Functional Team with people from different disciplines and to also work outside your own
are willing to broaden your competence and get a deeper understanding of our products and our build and delivery enviro
like challenges and want to challenge to constantly improve

Applications:

For any enquires and for application details, please contact the responsible Recruiter, Suma Haregoppa Venkatagiri at sum

Please note that, you need to submit the application in English, and we do not accept any applications via email. The last d

Systems Architect in VoLTE


Budapest, Hungary
Product Development
Get Job Alerts
Share Job
Job Description
Date: Feb 21, 2020
Creating the future world

Add your career to the resources of Ericsson and amazing things can happen. We are a world leader in the rapidly changin

We are now looking for a Systems Architect in VoLTE area at MTAS to join our development organization in Budapest.

At our Development Center MTAS within the Product Development Unit (PDU) Converged Core we develop and support th

MTAS comprises several application servers providing a rich feature set enabling communication services for multiple type

You will love working in an organization with agile Exchange To Exchange (E2E) development teams and close collaboratio

The MTAS development organization is distributed across Sweden, Hungary and India. This position is located in Budapest

What you will do:

Define the scope and shape the requirements in collaboration with product management and technical leaders
Feature break down into user stories and epics
Solution and system design in the areas of VoLTE/MTAS features.
Networking and collaboration with Market Areas, Customers and Strategic Product Management to understand customer
Developing in line with MTAS practices of lean, agile, continuous integration and continuous delivery
Contributing to MTAS architecture evolution and governance as part of the MTAS architecture community
Contributing to the continuous evolution of MTAS best practices in ways of working
Be a key person in knowledge sharing and learning
Have the opportunity to continuously develop your technical competence

Apply if you have

Education: Master of Science or corresponding university studies


At least 5 years of experience as a System Architect, Technical leader, Lead designer or in a similar role
Architectural approach
Domain experience:
Knowledge of communication services technologies VoLTE and IMS, preferably also Circuit Switched telephony technology
Knowledge of network function management, automation and life cycle management
System design for high availability, robustness, scalability and capacity
Solid knowledge in Cloud related technologies, like VMWare/Openstack Cloud
Business and customer focus
Strong driver mindset
Excellent English skills

Advantage if you have

Experience in agile, lean or DevOps development


Excellent communication skills on business and technical overview level as well as technical detailed level
Strong leadership skills in line with Ericsson values
Excellent collaboration and networking skills
Coaching & mentoring skills
A result oriented mindset and know how to meet customer expectations
The ability to adapt, respond to change and innovate
Product development processes knowledge
Product life cycle management knowledge

What we offer

Flexible working hours


High standard working environment (recreation room, bicycle storage, parking lots…)
Easily accessible location
Competitive compensation package, wide range of cafeteria elements, annual bonus
International career opportunities
Great team spirit, fun events
Professional communities
Extensive professional development opportunities
Subsidized language courses
Mobile with 4G, laptop
Local medical service
Relocation support

What you need to know about Recruitment at Ericsson?

As a first step within our selection process, you will be asked to fill out our technical tests based on your experience and pr

The candidates who best match the criteria for the position, will move on to the next step in the process – the interview.

Software Developer - 4G5G Broadcasting


Shanghai, China
Product Development
Job Description
Date: Feb 19, 2020

Position Description

•Work with leading edge technology for Ericsson’s media storage and delivery solution

•Perform software design and verification: from requirement analysis, system design, implementation, verification till deli

•Co-operate with colleagues in Europe, U.S.A and Israel to ensure a large-scale e2e solution with good quality

•Perform trouble-shooting on released software globally and travel across different countries if necessary

Qualifications

1. Required Qualifications and Skills

•University Bachelor or above degrees

•Knowledge of Object Oriented design (OOD)

•Solid experience on Java (or C, C++/Lua) programming experience, at least over 3 years

•Solid experience on Linux, including but not limited to configuration for different purpose, performance tuning and debug

•Familiar with the network architecture and protocol, e.g. TCP/IP, UDP, HTTP, as well as traffic analysis

•Experience on Network programming and Multi-thread programming is preferred

•Familiar with Nginx and its configuration

•Hands-on experience with Openstack, Docker etc.

2. Preferred Qualification and Skills

•Good knowledge and basic understanding in LTE system architecture.

•Knowledge on J2EE, OSGi, Web Services is advantage


•Experience with PostgreSQL and Hadoop

3. Others

•Knowledge of Software Engineering such as Agile, Scrum

•Good English communication skills (both in written and verbal)

•Accountable, and can work independently, easy to work with and strong teamwork spirit

•Team leading experience

•Ability to learn new technology quickly and apply to work tasks

•Innovative and solution thinking

•Open minded and willing to take challenges

Software Developer - 5G Core Load Testing Tool


Shanghai, China
Product Development

Job Description
Date: Feb 12, 2020
Job Summary:

We are developing world class Load Testing Tool(named Dallas) supporting the continued success of Ericsson’s Packet Core

Dallas is the essential part of the R&D process that maintains the SUT(System Under Testing) in premium quality standard

You will be immersed by ICT buzzword, e.g. 3GPP, 5G, Virtualization, Cloud, Distributed System, Kubernetes, ……

Responsibilities & Tasks:

Work as a developer to design and test high quality software product.


Support customer during the acceptance test phase for the feature developed
Drive or contribute for continuous improving of defined concepts, such as architecture and refactoring, unit/component te
Keep learning and trying new things in software craftsmanship which suits and benefits for team.
Position Qualifications:

Bachelor degree or above in Computer Science, Telecommunication, Information Technology, and Electronics.
Good knowledge in C/C++ programming
Good knowledge in software development methods and tools (Object Oriented design, Design pattern, UML, Rose)
Good knowledge in data structure, e.g. array, list, set, queue, tree, etc.
Good knowledge in Linux
Good knowledge in Python and Erlang is preferred
Good knowledge in Unix (Solaris) and VxWorks is preferred
Good English communication skills (both in written and verbal), able to express own thinking clearly
Ability to learn new technology quickly and apply to work tasks
Innovative and solution thinking abilities
Open minded and willing to accept challenges

Software (C++) Developer for 5GC


Shanghai, China
Product Development
Get Job Alerts
Share Job
Job Description
Date: Feb 12, 2020
Job Summary:
We at Ericsson Packet Core in shanghai are now looking for talented developers together with us in Evolved Packet Gatew

Responsibilities:
. Work as a developer to design and test high quality software product.
. Work directly with product owner, understand customer requirements and come up with solutions.
. Work in cross-functional team, which is self-organizing, international and highly independent with Lean/Agile best practi
. Drive or contribute for continuous improving of defined concepts, such as architecture and refactoring, unit/componen
. Keep learning and trying new things in software craftsmanship which suits and benefits for team.

Key Qualifications:
. University bachelor or above degree, major in Computer Science, Telecommunication, Software Engineering or equivale
. Experienced knowledge in C/C++, Python, Perl programming.
. Familiar with the network architecture, protocol and traffic analysis with Tcpdump, Wireshark and other packet analysis
. Linux development, working with high performance Linux application.
. Good English communication skills (both in written and verbal), able to express own thinking clearly
. Good troubleshooting, debugging skill, analyzing core dumps, memory leak and performance issue.
. Excellent in team working and communication

*LI-POST

What’s in it for you?

With over 90,000 employees across 180+ countries, we have a culture that respects and supports your ambitions, in alignm
Next Steps:

What happens next once you apply? Read about the next steps here

For your prep and reference, here is our overall Brand video and some insights about our innovations in 5G

Ericsson provides equal employment opportunities (EEO) to all employees and applicants for employment without regard

Ericsson complies with applicable country, state and all local laws governing nondiscrimination in employment in every loc

This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, terminati

Ericsson expressly prohibits any form of workplace harassment based on race, color, religion, sex, sexual orientation, marit

Primary country and city: China (CN) || || Shanghai || R&D

5G Data Exposure & Analytics Developer


Shanghai, China
Product Development
Job Description
Date: Feb 11, 2020
Job Summary:

CCES (Cloud Core Exposure Server) is a commercial realization of the 3GPP Network Function NEF.

It builds on programmability of 5GC (SBA/MSA/CNA) , expose network capabilities and structured data from 5G Core doma

CCES is fully developed based on Cloud Native principles, with a SW architecture based on micro service technology, with c

We sincerely invite you to join us, witness the coming success at the same time to develop your personal competence and

We can offer you:

Taking the opportunities to participate Ericsson's global R&D projects


Follow the trend of software development and new technology (e.g. 5G, big data, machine learning, etc.)
Enjoy the atmosphere of encouraging innovation and cooperation

Responsibilities & Tasks:

Work with product owner to understand customer requirements and come up with solutions
Work in Agile software development process to develop new features in an efficient way
Handle customer inquiry or perform product maintenance work with strong trouble-shooting skills
Design documentation like component description and modeling during system design and implementation
Co-operate with colleagues in Europe to ensure the software delivery on time with good quality

General Qualification

Education: Bachelor’s degree and above


Min years of experience: 3 years
Technical oriented people and be passionate for broader skills in software development and continuous integration.
Team work and good communication skills across multiple teams.
Be able to work independently and respond to multiple and conflicting requests under tight deadlines and in a dynamic wo
Excellent communication and presentation skills in both oral and written English.
Clear mind, ability to break down complexity and communicate things in a clear manner.
Ability to learn new technologies quickly, identify and resolve issue, mindset of distributed system.
Innovated mindset to break routines and make continuous improvement.

Technical Qualification

Good programming skills in Java, familiar with Java Core (Java 8 or later)
Good knowledge of Multi-thread programming, Garbage collection,
Good knowledge of OO concept, design principles, patterns, and so on.
Good knowledge of No-SQL Database (PostgreSQL, Cassandra), Performance tuning experience is preferred.
Familiar with Microservice, Container, and Kubernetes technology
Familiar with Helm - Package Management Tool
Familiar with MSA (Microservice Architecture) is plus
Familiar with Web Application development experience is plus
Familiar with Python, Shell, and Golang, Netty, Spring-Cloud
Familiar with Git
Familiar with Maven, Gradle build tool
Familiar with automation test frameworks and Skilled in writing test cases with JUnit in Java is preferred
Technique blog, open source / GitHub project is a plus
Familiar with TCP, HTTP protocols is preferred

Developer Radio System


Chengdu, China
Product Development
Get Job Alerts
Share Job
Job Description
Date: Jan 22, 2020
Job Objective:

Design the radio HW solution from system view. Lead the technique team to implement the design and responsible the de
You will work as Radio system design, which covering below areas:

Radio HW solution design


Radio Function solution design
Radio link budget on system level
Radio product design process handling
Participate the global radio system activity
Responsible the product release

Responsibilities:

Radio unit HW solution design: define the HW solution from system level.
Radio functions solution design: define the radio function solution, especially related with HW part.
Radio link budget: radio performance allocation, define the requirement on sub-level
Trouble-shooting
Write related design document
Process control of product development
Pre-study task for new technique or new product
Technical coordination within team

Qualification & Requirement:

Deep understanding of telecom industry, especially on LTE system and 5G


Deep understanding of system role & responsibility
>3 years working experience in base station design
Usage of related instruments
Master or above, major in EE, CS, or similar
Well organized and have structure thinking
Initiative and positive personal personality
Fluent in English speaking/writing
Skills needed from Jobs

ude services, consulting, software and infrastructure within Information and Communications Technology.

orked Society: a world connected in real time that will open up opportunities to create freedom, transform society and drive solutions

performance-driven culture and an innovative and engaging environment. As an Ericsson employee, you will have freedom to think bi

eration of wireless networks, i.e., 5G network with distributed edge compute, that will drive economic and social transformation for al

ologies is what Ericsson uses to drive thought leadership to automate and transform Ericsson offerings and operations. MI is also a key

cceleration for our group in Bangalore and Chennai.

pact; going beyond ML/AI for consumption and advertising?

ontiers to automate, simplify and add new value through large and complex data.

caling, monitoring and performance. You shall build effective AI models using stacking/ensemble techniques; and provide prediction ex

g real-world problems as part of a highly dynamic and global team. You will work in a highly collaborative environment where you com
reation for Ericsson in AI/ML
QL Databases. Design data pipelines and flow strategies.
ard/canonical data models by combining multiple data sources.
nderstand MI-driven business needs and opportunities
hese needs. This includes working with petabytes of 4G/5G-networks, IoT and exogenous data, and proposing/selecting/testing predic
o leverage existing data models and build new ones as needed.
machine learning models and solutions as part of Ericsson offerings including providing source code, workflows and documents
nurturing the communities and mentoring junior data scientists.

skills and technology portfolio for MI’s needs


ting papers, organizing sessions etc.

ning, Electrical Engineering or related disciplines from any of the reputed institutes. First Class, preferably with Distinction.
experience of about 15+ years.

2O, Keras, TensorFlow, Spark ML etc.


ves and research papers
application domain

ble scenarios

as NOSQL DBs
older business units, global customers, technology and other ecosystem partners in a multi-culture, global matrix organization with se

ng of data science and Machine Learning tools. Your knowledge and experience in Data Science methodologies will be applied to solve

nderstand MI-driven business needs and opportunities

hese needs. This includes working with petabytes of 4G/5G-networks, IoT and exogenous data, and proposing/selecting/testing predic
o leverage existing data models and build new ones as needed.
machine learning models and solutions as part of Ericsson offerings including providing source code, workflows and documents

nurturing the communities and mentoring junior data scientists.

skills and technology portfolio for MI’s needs


ting papers, organizing sessions etc.

ning, Electrical Engineering or related disciplines from any of the reputed institutes. First Class, preferably with Distinction.
experience of around 10+ years.

park ML etc.
initiatives and research papers addressing their functionalities, scalability and overall industrialization viability
industry/application domain

older business units, global customers, technology and other ecosystem partners in a multi-culture, global matrix organization with se

ptimizing data delivery, re-designing infrastructure for greater scalability, etc.


a wide variety of data sources using SQL and cloud-based ‘big data’ technologies from AWS, Azure and others.
ta-related technical issues and support their data infrastructure needs.
ategic customers/partners.
optimizing our product into an innovative industry leader.
odel life cycle management systems.

w, Dataiku, etc.

oring (SQL) as well as working familiarity with a variety of other databases/date-sources.


specific business questions and seek opportunities for improvement.

and workload management.

gn, development, integrate and documentation using groundbreaking engineering and delivery practices
ud-based systems and solution in consultation with external and internal partners

ong background in Artificial Intelligence, Cognitive Computing or Machine Learning technologies

pen Protocol Communication – Unified Architecture)

tion protocol stacks

hrough our networks. We enable the full value of connectivity by creating game-changing technology and services that are easy to use
graphics representation and reporting i.e. R, Python

ent use cases.


ng, Trees/Random Forest, Bayesian statistics, SVM, Neural networks, deep learning/reinforcement learning, etc.

ou have a passion for developing and deploying digital solutions in an iterative way together with cross-functional teams? Join our dig

e solutions, and applying and industrializing advanced analytics / AI / ML applications. The ideal candidate is able to show a strong trac
matics, Physics, or related field (e.g. applied mathematics/statistics)
business problems (predictive modeling, customer segmentation / clustering, network analysis, etc.)

processing

sh as soon as possible. For any questions and clarifications reach out to the recruiter, Hema Powar, hema.powar@ericsson.com. As a p

note that you may be requested to complete the same, when you apply for this position.

eloped products. This function ensures that a stable and correct test environment is available, and to perform system related tests for
rdware licenses and inventory.

r test) implementation features.

tary bugs’ analysis and bugs reporting / verification.


stem that is being used by more than 120 mobile operators across the globe. We do not only have more than 40% market share, but o

cycle of the product. Indoor and outdoor positioning 5G, IoT, Cloud development, modern SW delivery pipelines are all technologies

y global footprint, join us.


in an Agile team

to join our team. Ideally, you are passionate about technology, a problem solver, and enjoy working in new areas and taking on new c

sifiers and building machine learning models to improve accuracy of system


products metrics.
ed quantitative discipline.
nalytical problems using quantitative approaches with your unique blend of analytical, mathematical and technical skills.
s, you have a keen desire to solve business problems, and live to find patterns and insights within structured and unstructured data.
ency, logical thinking ability, fast learning ability.

Naive Bayes, SVM, Decision Forests, etc.


are familiar with SQL, Python, Java.

o yong.sze.miin3@huawei.com, applications without a CV will not be considered. Please make sure to have your attachments in Englis

ices? If these describe you, we would love to hear from you!

ons, such as anomaly detection, user behavior prediction, and many more in large scale.
ce with applications.

ableau, Vertica, Pig,etc.

tand the question, and devise an analytical approach to reach actionable answers.

Driven by a commitment to sound operations, ongoing innovation, and open collaboration, we have established a competitive ICT portf

s one of the largest and fastest-growing platforms in the world. It has strong presence with over 40 availability zones located across 4 c

development, design and strategic engineering of our products.

ns team in Huawei Munich Research Center (MRC), to participate in the rapid prototyping of new features, innovation, and proof of co
than one third of the world’s population.
evenues in research and development, which enhances our competitiveness while driving industry and technology.
ollaborate with each other in a constructive and solution oriented manner.
e experts from around 50 different countries.

art of a multicultural team and growing environment, feel free to contact us. Driving future technologies with focus on customer satisf

than one third of the world’s population.


evenues in research and development, which enhances our competitiveness while driving industry and technology.
ollaborate with each other in a constructive and solution oriented manner.
e experts from around 50 different countries.

nologies and to introduce innovative solutions quickly.

ant to achieve exciting goals together. Are you ready?


on of complex solutions for data analysis and machine learning applications
cost-effective and flexible solutions
their implementation

nd Apache Beam
Lib or Storm / Samza are desirable

nt, Engineered products & infrastructure, Financial institutions, Health care payers & providers, Insurance, Media & entertainment, Me

ant challenges and capture their greatest opportunities. BCG was the pioneer in business strategy when it was founded in 1963. Today

ing deep industry and functional expertise and a range of perspectives to spark change. BCG delivers solutions through leading-edge m

machine learning with deep industry expertise. The BCG GAMMA team is comprised of world-class data scientists and business consult

asily interact with our analytics applications to measure the success of their new analytics enabled organization or quickly make decisio
date will have industry experience working on a range of different machine learning disciplines, eg anomaly detection, payment fraud

w we use technology, machine learning, and data to enable the productivity of our clients. You will help envision, build, deploy and dev

odels into production together with a highly multi-disciplinary team of scientist, engineers, partners, product managers and subject do

ations related to natural language processing and machine learning

++, Python, etc.)


osition is a senior technical role in Intelligent Data department at Huawei Cloud & AI Business Unit and it will definitely provide a uniqu

ive practical application knowledge in the field of big data and database, furthermore, you are keen to explore and innovate, and build

n global information and communication technology (ICT) solution providers. To achieve this goal, we are building an industry-recogni

nfrastructure and promotes simplified enterprise IT systems for service flexibility. Huawei is constantly pursuing innovation, providing

tion Big Data Analytics platform while collaborating with experts and working with cutting edge technologies in the Big Data space:

n using state-of-the-art machine learning techniques

ated industrial experience are welcome to apply.


ernetes, Open stack, Swarm, Docker, etc. is an asset

operations by driving technical innovations, influencing market and product development in order to deliver significant business value

applied to solve challenging real-world problems. Your contribution will also help to create new offerings in the areas of ML driven pla

hese needs. This includes proposing/selecting/testing predictive models, recommendation engines, anomaly detection systems, statisti
rage existing data models and build new ones as needed.
learning models and solutions including providing source code, workflows and documents

g and release of commercial software;

andom forest, Bayesian model, SVM, neural networks, etc.


n, H2O, Keras, TensorFlow and Spark, etc.
tiatives and research papers addressing their functionalities, scalability and overall industrialization viability

ncluding, but not limited to, ICML, NIPS, AISTATS, UAI, AAA.

uested as necessary to support strategy engagements.


nd technology meet under one roof. Were a meeting place where Accenture, clients and partners connect to pioneer new ways to ful

ed and unintended consequences of their work. Thats why were passionate that true innovation must deliver value for Accenture, our

Team. Operating across all stages of the innovation spectrum, with a remit to build the future in real-time. The working environment i

ep Learning solution development skills. This role will focus on technical solution delivery across the Docks project portfolio; experime

nologies and techniques to address significant real-world business and societal challenges. Collaborating with our clients, designers, so

, proofs-of-concept to the delivery of real-world applications for our clients.

in the application of Machine Learning, Deep Learning and Advanced Analytics.

solutions, demonstrating your experience in the capabilities and limitations of these tools and platforms.

encapsulated in software applications.

es on project teams.

ps, sprints and prototyping to help identify breakthrough ideas, concepts and solutions to business problems.

rs, business experts and Accenture teams external to the Dock.

nd be able to match the future opportunities of those to the current and emerging challenges of our clients

monstrate the ability to thrive in a fast-paced, highly collaborative work environment.

r our business and our clients.

and Advanced Analytics.

services to develop Analytics & AI experiments and applications.

e AI ecosystem; including AWS, Azure and/or GCP.

elop robust end-to-end Analytics & AI solutions.


range of Advanced Analytics & AI techniques and keeping abreast of new developments in academic research and industry.

ons and experiments and being responsible for their business value and measures of success.

ormance management and professional development of people on your team.

backgrounds.

ng and communicating Analytics & AI concepts, solutions and value to non-technical audiences.

ure the value of the work undertaken on projects.

the Dock, outside immediate projects.

AI at the Dock, both in terms of new concepts, technologies, approaches and methods and in terms of business applications.

or a closely related discipline.

g, NLP, Knowledge Graph or other Artificial Intelligence domains and proven experience in the application of these technologies to add

mercial applications. Industry, product or commercial application experience is preferred.

ment and CRISP-DM.

on quality, accuracy and resilience.

uages (e.g. Python, R, Java, Spark) and 3rd party platforms.


h more than 490,000 people serving clients in more than 120 countries. The Dock at Accenture is a multidisciplinary research and incub

ety and does not discriminate on grounds of race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domes

adding value to Sage Business Cloud Accounting and Financial Management solutions. AutoEntry is one of the fastest growing automa

learning problems.

g senior management.

g techniques and Science and its supporting technologies

n, cross-validation, overfitting, bias, variance, convex optimization, eigenvectors, relational databases, SQL, latency, computational com

approaches to solve them.


n, cross-validation, overfitting, bias, variance, convex optimization, eigenvectors, relational databases, SQL, latency, computational com

titative field.

knowledge across all IT domains


doop, Spark, Scikit-Learn or other ML library, Statistics, CI/CD, coupled to standards including OData, Map/Reduce, Scala/R and/or Pyth

eveloping AI algorithms for solving such complex research challenges leveraging the recent advancements in AI. This position is expecti

G) in the Headquarters, CBG covers smartphones, PC and tablets, wearables, mobile broadband device, family device and device cloud

AI techniques to apply to it.

business units.

experience applying advanced AI techniques to real-world problems.

ttentions, Bayesian Deep Learning) with text or multivariate time-series data.

luding, but not limited to, ICML, NIPS, AISTATS, UAI, AAA.
ering teams to develop & productionize machine learning algorithms that are the core of Woebot's intelligence. You will work with NLP

e used to guide conversations with our users.

efforts, this includes, model training, feature extraction, deploying produced models, data processing, and rigorously A/B testing.
nd production services while helping define our data team's processes and tooling.
ers' natural language input, and generate appropriate responses. Initially you will prioritize the following:

ems so that Woebot continually learns from ongoing conversations to better recognize the sentiment and intent behind users' natural
other natural/unstructured data sources.

ach user. We consider personalization to be key for developing a relationship over time, and delivering precision intervention - that is,
th and understand users. Combining intent classifiers, chit chat, and task-oriented models that help users achieve their goals of feeling
derive insights about individual users, thus allowing it to give personalized feedback to users, such as "Did you realize that you're happ
time and in the right manner to help users gain new insights about themselves.

le nationwide.
atest research to real-world problems.
high-quality data.

or related technical field with 1+ years of applied experience


n mental health
which will develop a strong brand for Bank of Ireland, shaping and delivering a differentiated position for the Bank in the markets in w

d to transform the Bank of Ireland brand: Customer planning across business under one group brand strategy and agreed brand pillar p

ustomers, colleagues and partners across Ireland, UK and Internationally.

n increasing demand for more personalised customer experiences, Customer Analytics is leading the charge to enable and deliver this

of analytical solutions to deliver enhanced customer experience and actionable data insights based on data analysis. The types of proj

ng and customer insights.


nd deliver on commercial objectives.
tion modelling and text analytics.

nd measurement in place.
emi-structured web data, unstructured text data to improve model accuracy.

ng junior members of the team.

mplex problemsfocused on delivering an enhanced customer experience for our customers.


nalytics or an equivalent quantitative field.

stical and machine-learning methods


rocessing, data/text mining and artificial intelligence and you are able to implement them in business context.
Scikit-learn, H2O.ai, Spark ML.
sity modelling, recommender engines, Multichannel Attribution or Market Mix Modelling.
formats like JSON, flat-files or relational tables and understand how the data is accessed by APIs and web applications.

chnical functions. Exceptional communication skills.

bility for development of Radio products for all Radio Segments, like AAS, Macro, Indoor and High Frequency. The PDU has operations i
our arena. Radio SW provides possibility to work with the latest technology within telecommunication development and offers a rich n

ts following Ericsson common processes for configuration management, continuous integration and continuous deliveries. You will als
and delivery environment according to project requirements.

ation about our build and delivery environment.

The Yocto Project including Poky and BitBake for embedded software on Linux distributions.
tility for implementing and integrating continuous delivery pipelines.
luding version control tools like GIT, Gerrit and ClearCase.
Ruby, Java, C++.
le and link libraries, understand advantages of Shared Libraries.
o work outside your own discipline
build and delivery environment.

goppa Venkatagiri at suma.haregoppa.venkatagiri@ericsson.com .

tions via email. The last day to apply is before 18th May 2020.

s means that you get to leverage our 140+ years of experience and the expertise of more than 95,000 diverse colleagues worldwide. A

utions. As a developer, you will be involved in the development and maintenance of business critic applications.

ty and perform in accordance with the overall requirements as well as to the customer’s expectations.

fied products and services to customers.


s means that you get to leverage our 140+ years of experience and the expertise of more than 95,000 diverse colleagues worldwide. A

utions. As a Java Full Stack Developer, you will be involved in the development and maintenance of both backend services & web front

ty and perform in accordance with the overall requirements as well as to the customer’s expectations.

nd documentation standards

fied products and services to customers

ore network, connecting mobile devices around the world to the internet. You’ll be joining a small cross-functional development team
aking on the Scrum Master role in the development team, or taking the lead in system analysis, breaking down requirements and desi

s means that you get to leverage our 140+ years of experience and the expertise of more than 95,000 diverse colleagues worldwide. A

that offers services for both fixed and mobile network infrastructures. It offers services such as IP/MPLS edge routing and Evolved Pack
nt teams, and stakeholders to understand customer and product requirements

e made it our business to make a mark. Ericsson Packet Core has never had a greater opportunity to lead change; setting the bar for te

y easy to adopt, easy to use and easy to scale. This demands from all our people the creativity to discover, the accountability to delive

us connected mobile users while handling traffic from IoT to 4K video in Gbit speeds. All of this in a virtual deployment using container

bedded systems executing in a cloud native container environment!

dern facilities. We’re next to Chalmers and the university, with other cutting-edge tech companies as our closest neighbors in a buzzin

ount, gyms in the house, and more.


tools, container/microservices and data communication for 5G. We are in the middle of a containerization/cloud transformation to me

and Scrum. Early customer and partner feedback is a key element in our development process which is based on Continuous Integratio

production and commercial deployment. We use CI/CD (Continuous Integration/Development) flow and philosophy, with pipelines des

erimentation

r development, the used tech might vary. Here is a non-exclusive list of tech we use:
preparing applications and providing services to external customers we are looking for Software Developers to work in our office in Kr

delivery the highest quality product. Our team is full of passionate people building everyday a great ecosystem of different kind of app

SSAS and SSRS with a bit of integration from various applications and systems,

chitecture, story refinement, implementation, successful delivery to production and maintenance),

design objectives,

ETL standards,

stem MultiSport);
rks (RAN). In this role, you will be part of our customer-facing team(s) assembled to maintain and implement our Solutions at our Custo

of a node, network and/or system. Your scope of work could include the scenarios of introduction, upgrade expansion, functionality an

rvice providers. We enable the full value of connectivity by creating game-changing technology and services that are easy to use, adop

nfiguration and integration of a node, network and/or system. Your scope of work could include the scenarios of introduction, upgrade
hedule, Click Mobile, Click Plan, Click Forecast and Click Analyze)

roject lifecycle execution

e, Click Mobile, Click Plan, Click Forecast and Click Analyze)

olutions). Your role will include all development activities such as: requirement analysis, system design, architecture design, hardware d
bility for development of Radio products for all Radio Segments, like AAS, Macro, Indoor and High Frequency. The PDU has operations i
our arena. Radio SW provides possibility to work with the latest technology within telecommunication development and offers a rich n

ts following Ericsson common processes for configuration management, continuous integration and continuous deliveries. You will als

and delivery environment according to project requirements.

ation about our build and delivery environment.

The Yocto Project including Poky and BitBake for embedded software on Linux distributions.
tility for implementing and integrating continuous delivery pipelines.
luding version control tools like GIT, Gerrit and ClearCase.
Ruby, Java, C++.
le and link libraries, understand advantages of Shared Libraries.

o work outside your own discipline


build and delivery environment.

goppa Venkatagiri at suma.haregoppa.venkatagiri@ericsson.com .

tions via email. The last day to apply is before 18th May 2020.
er in the rapidly changing environment of communications technology – by providing hardware, software, and services to enable the f

nization in Budapest.

e develop and support the Multimedia Telephony Application Server (MTAS) which is a key component of current and future commun

services for multiple types of customers and deployments. Our customer base is global and stretches from tier 1 mobile operators to fi

ms and close collaboration with product management and other technical leaders.

on is located in Budapest/Hungary.

hnical leaders

to understand customer expectations

ed telephony technology, Cloud related technologies


n your experience and preference to work in.

process – the interview.


ation, verification till delivery

good quality

rmance tuning and debug, shell/Python programming


of Ericsson’s Packet Core Business.

emium quality standard wrt. Stability/Robustness/Capacity &Characteristics, in addition, also a key differentiator during Sales activitie

ubernetes, ……

oring, unit/component test, automated testing and continuous delivery.

d Electronics.

ttern, UML, Rose)


in Evolved Packet Gateway (EPG) product development. Our EPG product is a main node in the Evolved Packet Core and supports multi

with Lean/Agile best practices. The team will design, implement and test the feature, product and system from requirements to produc
actoring, unit/component test, automated testing and continuous delivery.

e Engineering or equivalent level of knowledge.

and other packet analysis tools

your ambitions, in alignment with our values of Respect, Professionalism and Perseverance. Ericsson is extremely focused on learning
loyment without regard to race, color, religion, sex, sexual orientation, marital status, pregnancy, parental status, national origin, ethn

employment in every location across the world in which the company has facilities. In addition, Ericsson supports the UN Guiding Princ

ent, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, training and development.

sexual orientation, marital status, pregnancy, parental status, national origin, ethnic background, age, disability, political opinion, soci

data from 5G Core domain, to enable automated control loops for different use cases, thereby creating new revenue streams for the o

ervice technology, with common development framework across portfolio of products, deployable on containers on bare metal or VM

ersonal competence and career.


nuous integration.

ines and in a dynamic working environment.

n and responsible the delivery of the product.


Engineer IV - Svcs RF
Toronto, Canada
Network Operation and Integration

Job Description
Date: Apr 7, 2020
Location: Mississauga Ontario Canada

Are you a highly detailed person who’s not afraid of digging into the data and p

Our Exciting Opportunity:

We are the Products and Services Readiness team within Market Unit North Am

society and drive solutions to some of our planet’s greaWe are currently looking for Network Engineer who will perform the role of a F

l have freedom to think big and the support to turn id As a Feature Test Engineer, you will work in our radio labs, as well as in the cust

You may also be asked to work on automation projects that aim to improve and

ocial transformation for all aspects of society. Machine Le

operations. MI is also a key competence for to enable You will:

Study the functionality, analyze requirements and specifications in order to des


Execute testing and record test case results
Troubleshoot, document and track defects
Analyze test results and produce feature test reports
Perform lab and field configuration related to feature testing
Prepare and present progress reports
Participate in knowledge sharing activities
Drive and participate in knowledge sharing activities
Lead test activities and mentor new team members
Collect and document customer requirements
Assist in the development of a service proposal
Identify add-on sales opportunities
; and provide prediction explainability and prescriptiv Travel is expected 70% of the time.

vironment where you communicate and plan tasks and ideas. You will be working on high impact initiatives with other DS in Machine
To be successful in the role, you must have:
Bachelor or Master’s Degree, in Electrical Engineering, Computer Science, Com
4+ years of relevant experience in Telecom Industry, ideally in Radio Domain
Knowledge of Radio access technologies (LTE, NR)
Background in communications theory and digital signal processing
Linux knowledge
Data analysis knowledge
Good command of MS Office, especially MS Excel and PowerPoint
Hands on experience with common tools (TEMS, MapInfo, ACTi, Nemo, ITK, AM
Coding and scripting skills. Languages include: Java, C, C++
Good to have: Python and/or R scripting languages
Good to have: Database access scripting (e.g. SQL)
ng/selecting/testing predictive models, recommendatioGood to have: Machine Learning

ws and documents
You might also have:

Creative, problem solving, think-out-of-the-box attitude


Organized and efficient way of working
Able to work in a team environment
Able to communicate effectively both orally and written in English
Excellent customer interface skills
h Distinction.
Backend SW Developer
Ottawa, Canada
Product Development
Get Job Alerts
Job Description
Date: Oct 10, 2019
Location: Ottawa, ON

Our Opportunity:

There is nothing average about what we do on the Ericsson team and we aren’t

As a Software Developer, reporting to the Manager of Software Development,

We are now looking for several developers to maintain products (units, nodes,
In this role you would:

Contribute to overall direction within our world-leading development team


Help your team size, plan, and execute development tasks
Design, implement, and test new fault features and capabilities
matrix organization with sensitivity and persistence Measure and analyze solutions performance and scalability to assist with system
Research new software development tools and technologies and guide develop

To be successful in this role you should have:

es will be applied to solve challenging real-world problems


A university degree in computer science, software engineering or a related field
3+ years of software development experience
A good understanding of software development lifecycle
Proven ability to deliver complex software solutions, on-time and on-quality
Capability to understand and propose solutions to new and complex problems
Proven ability to design and implement complex software requirements and sp
Experience designing server architectures and working with technologies and c
Experience in one or more of the following areas would be considered an asset
ng/selecting/testing predictive models, recommendatioPrior experience in the Telecom network management space is highly desired

ws and documents
Test Environment Scrum Master
Ottawa, Canada
Product Development
Get Job Alerts
Share Job
Job Description
Date: May 1, 2020
Location: Ottawa

Our Exciting Opportunity:


We are looking for a motivated and hardworking team member to work on stat

h Distinction.

You Will:
ensure the team lives agile values and principles and follows the established pr
plan, execute and deliver of code (test cases and common modules), as per agr
work in close collaboration with system architect, product owner and the syste
implement ways of working and communication flows to and from the develop
develop software code in Java for automated execution of radio performance t
plan resources for test system support to verification teams
provide formal documentation of test setup and code modules
help to maintain and improve the development lab environment
give to the overall success of the verification team goals
be responsible for competence development and over all team success

You Have:
B.Sc. in Computer or Electrical Engineering (or equivalent)
minimum 5 years SW development experience in object-oriented programming
experience using Java, Eclipse IDE, JCAT, Git or ClearCase
good knowledge of Linux/Unix environment
good understanding of 2G, 3G, 4G and 5G wireless communications technologi
hands on experience in the lab using spectrum analyzers and signal generators

You Might Also Have:


strong social skills are a valuable asset
LabVIEW is nice to have
matrix organization with sensitivity and persistence good time management, prioritization and problem-solving skills
excellent English communication skills, both written and spoken

Software Architect, Machine Learning


Ottawa, Canada
Product Development
Get Job Alerts
Share Job
Job Description
Date: Apr 17, 2020
Location: Ottawa Ontario Canada

Ericsson CENX fundamentally changes the way service providers view their netw

The architecture team works with product management, developers, and the b

Our Exciting Opportunity:

The Machine Learning Architect will be primarily responsible for guiding CENX’s
Skills in software design, applied statistics and technical leadership combine to
The CENX Machine Learning Architect will contribute to the full stack of machin

You will:
Design and architect new application components, tools, or core software com
Define, document and communicate architectural decisions
Participate in the entire software development cycle by analyzing, specifying re
Design solutions to complex problems with an emphasis on efficiency, quality, a
Manage risk identification and risk mitigation strategies associated with archite
Constantly research and innovate in bringing the new tools, technologies, soluti
Drive strategy in evolving existing data model to meet demands of new applica
Work with developers daily to provide mentorship, resolve technical issues and

To be successful in the role, you must have:

An undergraduate degree in computer science, software engineering or a relate


Strong software development skills with 10+ years of related development exp
Big data tools like Spark, Kafka, Cassandra, Hadoop, etc.
Hands-on experience with ML algorithms such as Regression, Clustering, Boosti
Hands on experience working with data - data collection, wrangling, data mana
Detailed knowledge in one or more of the following areas: distributed back end
Competence in one or more programming language such as Java, JavaScript, Sc
Objected oriented, functional, or concurrency oriented programming experienc
Proven ability to analyze and document business requirements and correspond
Experience in SQL and NoSQL databases including data modeling

You might also have:

High level of creativity in developing innovative solutions


Strong technical leadership in guiding the direction and architecture in product
Willingness to learn and ability to demonstrate credibility in work
The ability to maintain a high level of composure with people in all levels of the

DevOps DBA
Montreal, Canada
Product Development
Get Job Alerts
Job Description
Date: Mar 4, 2020
Job title: DevOps / Database administrator

Location: Montreal, QC

rvices that are easy to use, adopt and scale, making o

(English to follow)
Our stimulating proposition:

We are a large international company and we are looking for an experienced ca

You will be posted in Canada, in Montreal, and you will be part of an internatio

Your tasks:

Architecture, service development / automation (packaging, solution optimizati


Ensure the management of applications and database, monitoring, troubleshoo
Provide support for applications (1st to 3rd lines, use and technique)
Be active within the community, possibly contribute to development / to open

For this position, you must have:

An engineering or computer degree


5 years or more of relevant experience in the following fields:
Linux, including related computing environments
Administration and optimization of relational databases (SQL)
Automation and scripting
Services Web (REST/SOAP)
Different aspects and test tools (unit, characteristics, load)
Agile development methodology (Scrum, Scaled agile, Kanban)
Excellent team spirit and problem solving skills with a high level of motivation a

Would be a plus:

Linux application administration, troubleshooting and support


Automated deployment platforms (eg Ansible)
Network configuration and load balancing
Control platforms (eg Prometheus, Grafana)
D evelopment Java, Groovy
Reading, understanding and troubleshooting of Java / JVM applications
DevOps tools: Git, Gerrit, binary management systems (Artifactory, Nexus), Jen
Source version management (GIT)
Container/Cloud : Docker, Kubernetes, AWS
Micro-services development and architecture

Network professional - Professional Svcs RF


Montreal, Canada
Network Operation and Integration
Get Job Alerts
Share Job
Job Description
Date: Apr 7, 2020
***** English Version included below *****

Job summary:

We are looking for a network engineer who will be responsible for optimizing th

The role focuses on designing, auditing and optimizing services, for which you w

Main responsibilities:

Identify additional sales opportunities


You will participate in the establishment of a service proposal
Participate in knowledge sharing activities
You will collect and document customer requirements
Find innovation opportunities with the Solutions team
Moderate to significant customer interaction, to ensure correspondence to sco
ctional teams? Join our digital team and help us take digVery advisory, with good communication skills
Main contacts: CPM, GSC managers, RF / C & E engineers / managers clients
Key qualifications:

Education: University degree, minimum a Masters degree, in engineering (IT, te


Minimum seven (between 4 and 10) years of experience in RF engineering.
At least three to four years of recent experience in design, establishment and t
able to show a strong track record of developing imp Leadership experience in at least five 3G, LTE / VoLTE sites.
A site must be the leader of all large-scale technical activities (large metropolita
Understanding of signaling between different interface designs
Detailed knowledge of the budget concerned
Have used planning tools such as Aircom Asset3G, Planet EV, Atoll
Detailed knowledge of required ideas and how to create a new project in a plan
Able to set patterns
Having used the ACP (Automatic Cell Planning) tool
Able to make recommendations to improve the design of a network (cell plan)
Experience in scrambling code setting and neighbor scheduling
Capable of making recommendations regarding parameter changes, activation
Detailed knowledge of optimization tools such as TEMS Investigation, Actix, Agi
Experience in adjusting a busy network and knowledge of the difference betwe
Detailed knowledge of Layer 3 messaging
Capable of collecting and analyzing test data and performance statistics, and so
Knowledge of how IRAT procedures can affect overall quality of service, for bot
Suggest improvements to tools
Knowing the impact of adding a 2 e / operator 3 e operator on optimizing a net
Has the skills required to establish and test RAN functionalities relating to traffi
Knows the relationship between Soft / Softer transfers and network capacity, a
Detailed knowledge of OSS Ericsson and how to use optimization tools
Detailed knowledge of KPIs and ability to make recommendations to improve t
Detailed knowledge and experience in HSDPA, LTE and VoLTE optimization
Additional skills
Mastery of MapInfo, MS Office
Extensive travel flexibility in the United States
Excellent verbal and written communication skills
Ability to work in a very independent manner, while managing customer reque
Ability to coordinate activities with various Ericsson departments, divisions and
Ability to identify and interpret client needs and develop plans to meet them
Advanced technical knowledge in the field of networks (Radio, Transport, Core
Knowledge of local OSH regulations and group OSH requirements
English skills

owar@ericsson.com. As a part of GDPR compliance, we Engineer


are IV - Svcs RF Job stage 5 - Requisition 340242

Job Summary:

We are now looking for a Network Engineer that will be responsible for perform

The role focuses on executing design, audit and optimization of services, where

Key Responsibilities:

Identify add-on sales opportunities


You will assist in the development of a service proposal
Participate in knowledge sharing activities
You will collect and document customer requirements
Drives innovation opportunities with Solutions team
m system related tests for the product. This function alModerate to significant customer interaction to ensure alignment on scope, tim
Highly consultative with good communication skills
Primary Contacts: CPM, GSC Leads, Customer RF / C & E engineers / managers
Key Qualifications:

Education: Academic degree, minimum on bachelor level, in engineering (IT, Te


Minimum seven (4-10) years of RF engineering experience.
Minimum three to four years of recent experience in radio network design, imp
Past Leadership experience in at least five 3G, LTE / VoLTE projects.
One project must be the leader for all technical activities in a large scale (major
Understanding of signaling in between different interfaces Design
Detailed understanding of the link budget
Have used the planning tools such as Aircom Asset3G, Planet EV, Atoll
Detailed understanding of inputs required, and how to create a new project in
Able to perform model tuning
Have used the ACP (Automatic Cell Planning) tool
Able to make recommendations to improve a network design (cell plan) • Deta
Experience in Scrambling Code and Neighbor planning Tuning
Able to make recommendations in parameter changes, feature activations on d
Detailed understanding of optimization tools such as TEMS investigation, Actix,
Experience in tuning a loaded network, and understand the difference between
Detailed understanding of Layer 3 messaging
Able to collect and analyze drive data and performance statistics, and solve com
Knowledge on how IRAT procedures can affects the overall Service Quality both
Suggests improvements to tools
Understand the impact of adding 2nd carrier / 3rd carrier on the network Optim
Has the competence to implement and experiment on RAN features relative to
Understands the relationship between Soft / Softer Handovers and Network Ca
Detailed knowledge of the Ericsson OSS, and how to operate the optimization t
Detailed knowledge of KPIs and able to make recommendations to improve the
Detailed understanding and experience in HSDPA, LTE & VoLTE optimization Ad
Proficient with MapInfo, MS Office
Flexibility of extensive travel within the US
Excellent verbal and written communication skills
Ability to operate with high degree of independence, while managing project a
Ability to coordinate activities with various Ericsson departments, divisions and
Ability to identify and interpret customer requirements, and develop plans to a
Advanced technical knowledge in the network area of ​working (Radio, Transpo
Knowledge of local OHS statutory regulations and group OHS requirements
English Skills

Ericsson provides equal employment opportunities (EEO) to all employees and

Ericsson complies with applicable country, state and all local laws governing no

This policy applies to all terms and conditions of employment, including recruiti

Ericsson expressly prohibits any form of workplace harassment based on race, c

Primary country and city: Canada (CA) || || Montreal || ProdMgt

Req ID: 340242

Nokia Data Scientist Internship – Optimization of RRM procedures in 5G Networks: M

Context
Within the Mobile Networks Business Group, in the 5G product architecture de

This internship fits into exploratory projects conducted in our department, and
The considered applications domain is the allocation and the optimization of ra

Role

In this context, the main objective of the internship is to contribute in the intro

This requires fundamental skills in machine learning from model design to impl

It is also necessary to have a good understanding of the implementation constr

The validation of the proposed solutions would consist in assessing the benefit

Your mission will consist in:

1.Getting familiar with the procedures for radio resources allocation in 5G netw

2.Selecting one (or more) specific use cases and identifying the necessary optim

3.Identifying from the state-of-the-art methods, the most suitable candidate m

4.Validating the proposed solution on Machine Learning software platform or o

At the end of this internship, this experience will allow you to reinforce and put

Your theoretical expertise in Machine learning Algorithms,


Your knowledge in mobile radio communications, especially 5G networks,
A methodology for analysis / exploitation of large volumes of data as well as fo
Qualification:

Study: Master, Engineer or PhD with specialization in Machine Learning and/or

Expertise in Machine Learning is required.

Knowledge in wireless radio communications, telecommunication is highly reco


It is mandatory to have good level in programming: Python (Keras, PyTorch), M

Good level in English is mandatory (Oral and written).

Internship - Hard Real Time Software Development (UL-PHY LOKI)

Nokia is a world leader in designing technologies at the heart of our connected

- Integrated into an AGILE team, you will participate in real-time software deve

Err:509

Err:509

- We need your ideas to build the Nokia of the future!

Profile BAC + 5 end-of-study internship / Pro Contract / Apprenticeship

• Software development: Linux / C / C ++ / Python / JavaScript / Matlab,

• Languages: Operational English,

• Telecom & Network: general knowledge,

• Signal processing: general knowledge


Sense of autonomy and responsibilities, spirit of analysis and synthesis, manage

R & D Engineer
🔍Bangalore, Karnataka, India
📁Applied R&D💼NSW Nokia Software20000000RM
Key responsibilities/Job description:
--> Development activities with flavours of UT, NT & functional tests of Nokia R
--> Feature delivery along with handling of legacy topics (issues, customer topi
--> Responsible for the product quality & deliverables in new features, custom
--> Support during customer escalations for Nokia Registers & Legacy HLR/HSS
--> Quick solutions to team for supporting team

Skills required:
--> Mandatory to have developemnet hands on expereience with C++
--> Domain & Protocol knowledge in the areas of UDM, AUSF, HSS, HLR, DIAME
--> Knowledge in Analysis, Design, Development/Testing.
--> Strong analytical and debugging skills.
--> Team Player, Self-motivated and able to work with little supervision
--> Usage of ROBOT, Rammbock or IPSL is mandatory. (for testing)
--> Excellent Interpersonal and Communication skills
--> Good knowledge in Python (mandatory)
--> Knowledge in GO language, Java technologies, python is an added advantag
--> Platforms: Linux

Senior/Principal Scientist, machine learning for visual media


You will be given the following responsibilities (but don't let the list limit you!)
Research and development of visual media technologies by utilizing modern ma
Publication of high quality research results and necessary intellectual property
Technical contribution to the international standardization organizations such a
Proof of concept development and validation of the research results
Collaboration with other teams when they are impressed with your results and
Leading/mentoring other researchers related to your expertise area (spread the
Skills are what you're going to build and help building in our lab! As a starting p

Hands on experience and knowledge on computer vision and/or neural networ


Expert knowledge on training neural networks, unsupervised and supervised le
Applying machine learning on 2D/3D visual data (images or video)
Experience in developing computer vision algorithms (especially on object re-id
Software development skills using Python and C++
Experience in using frameworks such as TensorFlow, Keras, PyTorch and OpenC
Experience in utilizing Neural Processing Units (NPUs) on mobile devices is a plu
Let's now check the qualifications:
A PhD degree (or about to complete) in disciplines such as machine learning, co
5+ (senior) / 10+ (principal) years of relevant work experience (academic work
Experience in mentoring other colleagues is a plus
Excellent in written and verbal English language skills
We believe in people and we already know that you are a great person. You pro

Innovative, full of ideas and eager to invent and apply new things
Self-guided and eager to take responsibility
Motivated to learn and explore new ideas and technologies
Be an out of the box thinker and problem solver
Team player and believer in the power of "we" rather than "me"
Looking forward to meeting you!

Media Technologies Research Lab

(Note: This open position is for Tampere or Espoo)

L1 SW C++ Developer
🔍Saint Petersburg, Russian Federation, Russian Federation
New
📁Applied R&D💼MN Mobile Networks1900000KTH
Nokia is a global leader in the technologies that connect people and things. Wit
Serving customers in over 100 countries, our research scientists and engineers
Position for 5G L1 SW development.

Role & Responsibility


L1 SW development for 5G Base Station (on ARM and DSP in C/C++) as part of i
Contribution to specifications and creating documentation for the relevant part
Code re-factoring and optimization, code maintenance, and bug corrections
Requirements
Master's Degree in Computer Science
Relevant 4+ years’ experience
Good C/C++ programming skills
Experienced with DSP programming and Matlab
Knowledge of embedded system aspects, real-time operating systems and the
Good understanding of software system design
Good technical English
Familiarity with Agile methodologies and Scrum
Ideally you have a knowledge of LTE/5G technologies
You don’t need to master everything, but you should already be a specialist on

Technical Communication Tools Developer


Job Description:
This is full-time software development position who will work in Nokia Software
Responsibilities:
Jointly develop software tools to generate technical documentation from XML s
Work on an agile team conforming to the development team's workflow:
User Story and Task Definition
Unit Test Definition and Execution
Version Control conventions
Code Reviews
Integration Tests
Deployment to Production
Update the status of user stories in our backlog tool (Jira) in a timely fashion
Build tools on top of Nokia Software's DevOps platform (Jenkins, Git, Maven, An
Create and maintain tool documentation in our wiki
Work with end-users to support their migration onto our new Harmonized Pipe
Explore opportunities for automation and streamlining the user experience of o
Location: The candidate should be located in Austin, Texas

Requirements:
Bachelor’s, Master’s, or Doctorate’s degree in Computer Science, Information T
Project experience with at least 3 of the following programming languages: Jav
Project experience with at least one version control tool, preferably Git
Ability to learn new programming languages and tools quickly
Excellent English communication skills in both writing and speaking
Ability to solve complex problems logically
Strong time management and self-discipline skills
Ability to work well in a team-oriented environment with international colleagu
Ability to cope with tight deadlines
Demonstrates enthusiasm, curiosity, and motivated to learn and try new things
Dedicated team player with innovative spirit

Nokia is an equal opportunity employer that is committed to diversity and inclu

Internship - Mobile Networks A & S intern

Internship - Mobile Networks A & S intern

As intern, you will belong to the team in charge of specifying and developing a

In an agile team, you will have ownership for:

Err:509
Err:509
Err:509
Err:509
For this intership the following skills are required:

- Web development (PHP, Javascript, HTML/CSS)


Err:509
Err:509
Err:509
Err:509

Engineer , 5G L1 integration

General purpose:

Do you want to contribute 5G as an Integration expert? Are you interested abo


If yes,you might be the person we are looking for.
Our team at Oulu (part of 5G L1 Productization tribe) is responsible for 5G L1 fe

Now we are looking for several high motivated, enthusiastic and talented futur

You will have state of the art tools and methods available for your job with high
Depending on role you will do integration and testing or prepare your thesis wo

Main Responsibility Areas:

Integration and testing or thesis work .

Responsibilities:

You will contribute integration of L1 level 5G features , 5G base station and CI (


used globally for 5G system verification.
n 40% market share, but our market is about to explode, driven by the positioning needs of IoT, 5G and self-driven cars.

elines are all technologies we are using in our daily development practices.
Mandatory Requirements:
Self-drive and drive to lear
Team player with good communication (verbal and written) skills
Decent English skills

Good to have:
Python or C or C++ or Java or bash competence
GIT competence
Jenkins competence
Data Engineer 2

Job Description:

We are looking for Interested candidates for Data Engineer to not only help us

Key Responsibilities / Functions:

Develops and maintains scalable data pipelines and builds out new API integrati
Working experience with Tableau, QlikView, Mode, Matplotlib, Jupyter, or simi
Extensive experience analyzing data using SQL
Required Minimum Qualifications: (Education, Technical Skills/Knowledge)

2+ years of Python or Java development experience


2+ years of SQL experience (NoSQL experience is a plus)
3+ years of experience with schema design and dimensional data modeling
Ability in managing and communicating data warehouse plans to internal client
3+ years of relevant experience such as implementing statistical analysis, devel
Good communication and writing skills to facilitate productive collaboration wi
Strong knowledge of project management principles and concepts;
Experience solving problems with an emphasis on product development
Experience with predictive modeling and dissemination of research results;

SW Dev 1, MME/SGSN R&D


🔍Naperville, Illinois, United States
New
areas and taking on new challenges. 📁Applied R&D💼ION IP Optical Networks20000003RJ
Nokia is a global leader in the technologies that connect people and things. Wit
Serving customers in over 100 countries, our research scientists and engineers

Business Group Description:

You will be part of the Cloud Mobility Manager (CMM) within the ION Division o

The CMM delivers a converged packet core solution which addresses both evol
Key Job Responsibilities:

chnical skills.
and unstructured data. Development & prototyping in Virtualization/Cloud Computing including techno

Openstack and other well-known NFV-I platforms, including AWS and Containe
Data analytics/Artificial Intelligence/Machine Learning including a strong knowl
Coding in C/C++
You will need to be:

a self-starter, self-motivated and able to work independently


creative and an analytical thinker with problem solving skills
innovative and eager to learn state-of-the-art technologies
a fast learner; able to gain a lot of complex knowledge in a short time frame
able to keep abreast of evolution in the industry and cloud infrastructures
able to demonstrate excellent technical skills, work efficiently and be responsib
able to communicate/explain technical information/risks and provide status up
your attachments in English, PDF form is preferred. If yoable to work well across organizational boundaries, multi-site and multi-cultura
able to keep in mind the customer’s point of view

Qualifications:

Required: Candidates shall have completed their bachelors/masters degree in C

Skills:

Essential:

Python, C++/C, proven practice/application of Deep Learning Algorithms.

Desired:

TensorFlow, Pandas DataFrames, R, Networking IP Knowledge (IPv6/IPv4), fami


At Nokia, employment decisions are made regardless of race, color, national or

We are an Equal Opportunity Employer and do not discriminate against any em

Junior Engineer, Embedded Software Development in L1 Physical Layer SW R&D

Nokia is an equal opportunity employer that is committed to diversity and inclu

At Nokia, employment decisions are made regardless of race, color, national or

The L1 Downlink Physical Layer team in Ulm is looking for an ambitious embedd

The work assignment will be the Nokia base station and as a member of a L1 sc

planning, designing, implementing, reporting and testing of base station functio


maintenance and monitoring of delivered functionalities to assure best quality
participating and organizing code reviews for the technical area of responsibilit
executing technical activities and projects within own responsibility area
analysing requirements and creating R&D documentation for feature teams
supporting project and program planning
introducing best development practices to daily job
developing specialist knowledge and solving complex engineering problems in d
participating in relevant technical studies

Expectations:

proven track of experience in C/C++ programming preferably in embedded env


hed a competitive ICT portfolio of end-to-end solutions iobject oriented design and programming
Linux operating system
ty zones located across 4 continents and 23 geographi test methodology, test automation
experience in scripting languages e.g. Perl, Python
enthusiasm in telecommunication or programming
experience in source code control tools like SVN and GIT
nnovation, and proof of concepts using artificial intelligknowledge of SW development methodologies (Agile/Scrum)
good communication in English (both written and spoken) and basic communic
Bachelor's degree in Engineering, Telecommunication, Computer Science or soft
experience required: less than 1 year
Beneficial Experiences:
Software optimization with profiling tools
Experience in embedded SW development, distributed real time programming
knowledge of SW development methodologies (Agile/Scrum)l
Telecommunication domain knowledge (Layer 1/ Uplane/ Cplane) in any techno
Additional Requirements:
good interpersonal and teamwork skills and capability to work in international
result oriented
eager to learn
responsible and committed
initiative, creative and analytical
ability to communicate and cooperate with other people in various situations
self-organized

Software Developer, R&D


h focus on customer satisfaction is one of our missions. 🔍Dallas, Texas, United States

*********US Citizens or Green Card Holders Preferred for Multiple


Positions*********

General Purpose:

We are looking for highly skilled, advanced technical experts to be part of a tea

Main Responsibility Area:

Following an architecture and requirements to design, develop and test softwa

Skills & Qualifications:


0-3+ years relevant professional experience for different level positions.
BS or MS in Computer Engineering or equivalent
Strong 'C', 'C++' skills in embedded systems
Solid understanding of Object-Oriented Analysis and Design methodologies
Experience with scripting languages
Experience developing highly distributed, highly scalable, carrier-grade softwar
Ability to consider the long-term maintainability of the software created and th
Strong knowledge of real time operating systems
Knowledge of Continuous Integration and Continuous delivery methodologies a
Solid experience in test automation methodologies and tools
Experience with working in scrum team using agile methods
Excellent problem solving and debugging skills which may require making labor

Self-motivated and proactive thinking


Ability to work well independently and within a team
Good communication skills, ability to build working relationships with engineer
Motivated by technical challenges, with a strong desire to deliver quality produ
Good written and oral skills with excellent interpersonal and presentation skills
Ability to travel internationally if needed (travel overall expected to be less than

Test Engineer Trainee - RF Sharing:


Role description

You will join Applied R&D SRAN (Single Radio Access Network) team within Nok

Your mission

Be part of the testing team for delivering the newest technology and features e
Your main responsibilities will be
To analyse system concepts and complex product and system features;
Test environment building, integration and maintenance;
According to project assignment, to create automated test suites and scripts fo
To execute manual or automated test cases against real equipment and to repo
To write defect reports and verifying corrections;
To collaborate with the stakeholders (system specification team & SW develope
To act with independence and discretion in routine matters.

What does Nokia offer you?

Competitive salary based on your experience


Christmas and Easter bonuses
Interaction with international teams of software developers, architects, project
Visibility on latest configuration control technologies, 5G innovating mobile net
Media & entertainment, Medical devices & technology, MetalE-learning platforms such as NokiaEDU, Harvard ManageMentor, LinkedIn Lear
Access to Nokia Garage innovation space where you can express your technical
Various certifications
Events and conferences on technical matters but also on well-being at work an
Flexibile schedule to encourage you pursuing your university studies in parallel
was founded in 1963. Today, we help clients with total 24+ paid days-off per year
Sports, Wellness and Culture reimbursement
ns through leading-edge management consulting along Bookster
wi
Public Transport/Private Parking allowance or bike/electrical scooter reimburse
Meal tickets + extra food allowance
Private health and life insurance
ntists and business consultants who specialize in the useKindergarden/after-school reimbursement
Relocation package for Timisoara non-residents
One of the highest employee referral bonuses on the market so you can bring y

Training

on or quickly make decisions based on what and how they


A customized training plan will be proposed at your arrival. During the first mon
Your qualifications

Do not hesitate to apply, if you are a student who:


detection, payment fraud, fraud detection, search ranking, text / sentiment classification, spam detection and others. The position wil
Is currently enrolled in a Bachelor/Master’s university degree in Electronics and
sion, build, deploy and develop our next generation of Has basic knowledge of telecom, Linux and automation;
Is fluent in English (both in written and oral communication);
Searches to learn continuously and fast, in an ambitious and resourceful way.

t managers and subject domain experts.


If you want to take part in the 5G adventure that will shape the future, join us.

Software Development Engineer


🔍Timisoara, Romania, Romania

Role description

You will join Research and Development 5G Software team, responsible for dev

Software Development Mission

You will contribute to the development of software functionalities in our base s

In general, in 5G positions, our days are composed of:

20% specification and innovation

40% development

30% interface with internal colleagues

10% up to us to build it together

100% FUN!

Team Description

You will be part of a Development Unit (around 1300+) among 11 R&D Tribes in

Latest achievements
Since the beginning of the race towards first position in 5G market, our team ha

What does Nokia offer you?

Competitive salary based on your experience


Performance, Christmas and Easter bonuses
Interaction with international teams of software developers, architects, project
Visibility on latest configuration control technologies, 5G innovating mobile net
E-learning platforms such as NokiaEDU, Harvard ManageMentor, LinkedIn Lear
Access to Nokia Garage innovation space where you can express your technical
Various certifications
Events and conferences on technical matters but also on well-being at work an
Flexible time and teleworking
24+ paid days-off
Sports, Wellness and Culture reimbursement
Bookster
definitely provide a unique opportunity for you to Public Transport/Private Parking allowance or bike/electrical scooter reimburse
Meal tickets + extra food allowance
ore and innovate, and build a system that provides big Private health and life insurance
Kindergarden/after-school reimbursement
Relocation package for Timisoara non-residents
One of the highest employee referral bonuses on the market so you can bring y
uilding an industry-recognized team of big data and

Training

uing innovation, providing customers with big data, da A customized training plan will be proposed at your arrival. During the first mon

Qualifications
in the Big Data space:
Key must-haves:

You can prove your knowledge on object-oriented programming (OOP) principl


Equally important, you are dedicated to learning, welcome feedback, show flex
Nice to have:

Knowledge on Agile methodology, wireless technologies, cloud-computing platf


We are mainly looking for experience in software development using C++ and t

If you want to take part of this adventure that will shape the future, join us. Ap

If you have any questions, reach out via e-mail to recrutare@nokia.com or to A


Engineer, RF simulations
🔍Oulu, Finland, Finland

We have open position on area of advanced RF simulation techniques and met

The work area combines RF and system design tasks as well as RF measuremen

This gives good opportunity to develop wide competences on RF engineering b

r significant business value. To achieve this we are building an industry-recognized multi-discipline lab of experts with focus on medium

the areas of ML driven platform intelligent monitoring The tasks require capability to communicate efficiently with suppliers and proje

detection systems, statistical model, deep learning, reinforcement learning and other machine learning systems

Main Responsibility Area

Your tasks include

Evaluation of new design tools and supporting taking them in wide use within R
RF design tasks and HW development
Documentation
Participation to RF measurements to close loop from verification to simulations

Qualifications

BSc/MSc degree in electrical engineering or similar


Knowledge of RF, radio/ telecommunication systems and signal processing
Knowledge of RF simulation or HW design tools
Experience of RF/electronics measurements
Some SW experience
Ability to work independently and in teams as well as in a multicultural environ
Excellent communication skills in English (written and spoken)
Fast learning and interest to work on various topics
Active and positive person with hands-on attitude

o pioneer new ways to fulfil human needs using emerging technology.


Data Science, NLP Specialist
er value for Accenture, our clients and society. Date: Apr 25, 2020

he working environment is open and interactive, fa Location: Montreal, QC, CA

roject portfolio; experimenting and problem-solving wiReq Id: 250773

h our clients, designers, software engineers and businesAt Bell, we do more than build world-class networks, develop innovative servic

If you’re ready to bring game-changing ideas to life and join a community that v

Bell’s forward-thinking Customer Operations team is creating the ultimate serv

Bell’s Business Intelligence team is responsible for the management and optimi

Responsibilities

Lead the development of machine learning products and models from inceptio
Explore new data sources to uncover new business opportunities at all levels of
Identify areas for ML/AI opportunities and demonstrate to internal clients how
Build and implement strategies for ML-driven projects
Work with partners within Customer Operations and across Bell to make data-d
Work with and present to all management levels
Maintain and expand your knowledge of ML/AI and current technology through

Core Skills

Algorithms
Advanced knowledge of ML models: deep learning, reinforcement learning, NLP
Hands-on experience and expertise with different AI/ML frameworks such as Ke
Stay abreast of new technology and techniques in the ML/AI space
Coding
Advanced Python development skills
Experience in other programming languages Scala, C, C++, Java, Shell
Excellent code design (OOP, Algorithms, and Data Structures)
h and industry. Experience with CI/CD pipelines
Data
Understanding RDBMS, Distributed, and NoSQL databases
Proficiency in SQL
Understanding of Spark and MapReduce
Quick learner with ability to think out of the box

Bilingualism is an asset (English and French); adequate knowledge of French is r

Additional Information:

Position Type: Management


Job Status: Regular - Full Time
Job Location: Canada : Ontario : Ottawa || Canada : Ontario : Toronto || Canad
Application Deadline: 05/31/2020

Please apply directly online to be considered for this role. Applications through

ess applications.
At Bell, we don’t just accept difference - we celebrate it. We’re committed to fo

these technologies to address real-world business pr Accommodations are available on request for candidates taking part in all aspe

Created: Canada, QC, Montreal

Java Developer

The role
Java Developer required to perform strong coding throughout development of

Responsibilities:
Err:501
Err:501
Err:509

Key experience required


-Java 9-11
#NAME?
#NAME?
#NAME?
#NAME?
#NAME?
#NAME?
#NAME?
#NAME?
iplinary research and incubation hub, as an accelerator fo#NAME?
#NAME?
citizenship, marital, domestic or civil partnership status #NAME?
#NAME?
#NAME?
#NAME?
e fastest growing automation software businesses in the market. Its intelligent technology eliminates the pain point of data entry for a
Seniority Level
Mid-Senior level

Industry
Information Technology & Services Computer & Network Security Information

tency, computational complexity, sparse matrices, feature engineering, clustering.


atency, computational complexity, sparse matrices.

educe, Scala/R and/or Python

AI. This position is expecting to have a strong AI background with deep AI algorithmic knowledge such as anomaly detection, reinforce

ily device and device cloud service, and is the second largest smartphone manufacturer in the world. Huawei Consumer BG is dedicate
ce. You will work with NLP to create a best-in-class conversational engine, manage data labeling teams to ensure high quality training

gorously A/B testing.

tent behind users' natural language inputs.

sion intervention - that is, methods that are tailored to each user.
hieve their goals of feeling happier while also feeling natural and conversational.
ou realize that you're happiest on Sundays, and least happy on Tuesdays?"
e Bank in the markets in which we operate. This will enable us to shape and deliver on our purpose to enable customers, colleagues an

y and agreed brand pillar priorities, Deep brand & customer insight, Consistent & compelling customer communication across all touch

to enable and deliver this personalised experience through data analytics and the application of machine learning and artificial intellig

analysis. The types of projects involved in achieving this vision include leading the development of recommendation engines, building
. The PDU has operations in Kista (KI), Gothenburg (LN), Beijing (BJ), Nanjing (NJ), Lund (LD), Ottawa (OT) and Chengdu (CH). PDU Radio
opment and offers a rich number of opportunities in an everyday learning, creative and challenging atmosphere. With an agile way of

ous deliveries. You will also be responsible for developing and maintaining our Radio SW build and delivery environment including nee
e colleagues worldwide. As part of our team, you will help solve some of society´s most complicated challenges, enabling you to be ‘th
e colleagues worldwide. As part of our team, you will help solve some of society´s most complicated challenges, enabling you to be ‘th

kend services & web frontends applications.

ctional development team working with feature development, where the tasks include requirement analysis, system design, developm
wn requirements and designing feature behavior. Experience from either of these roles is a plus, but not necessary.

e colleagues worldwide. As part of our team, you will help solve some of society´s most complicated challenges, enabling you to be ‘th

e routing and Evolved Packet Gateway functionalities. SSR enables complete network convergence so subscribers can access services f
ange; setting the bar for technology to be inclusive and accessible; empowering an intelligent, sustainable and connected world.

he accountability to deliver and the courage to remove complexity wherever it presents itself. Your commitment to these qualities will

eployment using containers.

osest neighbors in a buzzing area. There are plenty of opportunities for learning and networking at Lindholmen Science Park, and there
loud transformation to meet the 5G journey.

d on Continuous Integration SW practices. We rely on team collective responsibility to finish the tasks according to a prioritized backlo

osophy, with pipelines designed for as-a-Service and to enable DevOps, with fast feedback loops. We believe in test driven developme
to work in our office in Kraków. Our employees in ADM department work for global and local projects building relationships with busi

m of different kind of applications using considerable number of technologies.


our Solutions at our Customers. The position will be based in Dalian, China.

expansion, functionality and capacity. You will work in a diverse multi-national team, and working experience from a telecom operator

that are easy to use, adopt, and scale, making our customers successful in a fully connected world. Headquartered in Stockholm, Swed

s of introduction, upgrade expansion, functionality and capacity. Your work will in part form our customer legacy.Good hands-on expe
tecture design, hardware design, software design, integration, verification, simulations, tools design, Product Lifecycle Management su
. The PDU has operations in Kista (KI), Gothenburg (LN), Beijing (BJ), Nanjing (NJ), Lund (LD), Ottawa (OT) and Chengdu (CH). PDU Radio
opment and offers a rich number of opportunities in an everyday learning, creative and challenging atmosphere. With an agile way of

ous deliveries. You will also be responsible for developing and maintaining our Radio SW build and delivery environment including nee
nd services to enable the full value of connectivity.

urrent and future communication services solutions across the globe. Our portfolio has strong market traction and we are getting into n

er 1 mobile operators to fixed replacements and cable operators to also include the enterprise market. Our goal is to provide a system
tiator during Sales activities.(e.g. Evaluation& Benchmark testing)
et Core and supports multi-access, GSM, WCDMA, LTE, 5G and interworks with Wi-Fi and CDMA with seamless transitions between th

m requirements to production and commercial deployment.

emely focused on learning and development, supports mobility and flexible working hours. We are also committed to diversity and inc
tatus, national origin, ethnic background, age, disability, political opinion, social status, veteran status, union membership or genetics.

ports the UN Guiding Principles for Business and Human Rights and the United Nations Global Compact.

ility, political opinion, social status, veteran status, union membership or genetic information.

w revenue streams for the operator.

iners on bare metal or VMs and fully stateless with separation of business logic and data storage.
digging into the data and problem solving? Do you love the border between development and product? Ericsson is this place, and we

thin Market Unit North America. We lead testing and introduction of new Radio technology, such as IoT and 5G, for North American te

will perform the role of a Feature Test Engineer.

labs, as well as in the customer live networks. You will gain familiarity with the Ericsson radio products (SW and HW) through product

ts that aim to improve and accelerate the testing process.

ecifications in order to design and document test cases

with other DS in Machine Intelligence to drive growth and economic profitability for Ericsson and its customers by accelerating curren
g, Computer Science, Computer Engineering or related field
ideally in Radio Domain

nal processing

d PowerPoint
pInfo, ACTi, Nemo, ITK, AMOS, QXDM, etc.)

tten in English

icsson team and we aren’t looking for average people, if you are exceptional, then you will fit right in. To succeed you must appreciate

f Software Development, you will be responsible for the design and implementation of scalable, high-performance, fault tolerant capa

in products (units, nodes, networks, systems and solutions). Your role will include all development activities such as: requirement anal
ng development team

ability to assist with systems engineering and application deployments


nologies and guide development by identifying and implementing approved new software tools and/or technologies in compliance with

gineering or a related field

on-time and on-quality


w and complex problems
ware requirements and specifications
ng with technologies and concepts as Clojure, Java, Scala, Python, Apache Spark, Apache Casandra, unit testing, test automation frame
uld be considered an asset: streaming and processing of big data, performance and scalability, OSS Software design and implementatio
nt space is highly desired

m member to work on state-of-the-art multi-standard radio equipment for global deployment by leading mobile operators. Your role w

follows the established processes and practices


mmon modules), as per agreed achievements
oduct owner and the system and planning
s to and from the development team towards other development XFT and partners
on of radio performance test cases
nvironment

er all team success

ect-oriented programming

ommunications technologies
zers and signal generators is an asset

olving skills

e providers view their networks. As leading provider of network and service operations software solutions, our product ingests all of a

ent, developers, and the broader Ericsson R&D community to shape and guide our overall development strategy and focus. We bridge

ponsible for guiding CENX’s machine learning strategy just as our customers are beginning to adopt Big Data and Machine Learning tech
cal leadership combine to form a superpower that can deliver revolutionary solutions to a key global industry. Ericsson works with mo
to the full stack of machine learning software, from platform design and specification to data exploration and algorithm tuning. The s
ools, or core software components and provide technical direction to the team.

by analyzing, specifying requirements, designing, and developing new tools, product features or platform core features.
asis on efficiency, quality, and simplicity. Consistently deliver performant and high-quality software in an agile fashion
ies associated with architecture
w tools, technologies, solutions, ideas and frameworks to the forefront.
t demands of new application domains
esolve technical issues and hurdles, and implement features.

are engineering or a related field. Master’s or PhD preferred


related development experience

gression, Clustering, Boosting, SVM, deep-learning/reinforcement learning, etc.


tion, wrangling, data management data pipeline design and implementation
areas: distributed back end systems, big data, cloud native systems, software design & implementation
such as Java, JavaScript, Scala, Python, Scheme, Clojure, C++
ed programming experience
uirements and corresponding technical solution
ta modeling

nd architecture in product solutions


bility in work
h people in all levels of the organization in the face of significant challenge or ambiguity.
king for an experienced candidate in DevOps and database administration, with solid knowledge and excellent skills, capable of identi

ill be part of an international team with members in different countries.

kaging, solution optimization, automated development, test automation, monitoring and analysis, HA, cluster) and integration into a l
e, monitoring, troubleshooting (in depth, up to the level of Java codes)
and technique)
to development / to open source communities.

a high level of motivation and initiative.

/ JVM applications
ms (Artifactory, Nexus), Jenkins
esponsible for optimizing the design and auditing of a network to meet customer requirements. The position relates to the design and

g services, for which you will be responsible for part of the solution and the service process. Therefore, you will need to comply with E

ure correspondence to scope, deadlines and deliverables

eers / managers clients

gree, in engineering (IT, telecommunications) or


nce in RF engineering.
esign, establishment and tuning / optimization of radio networks on 3G and LTE systems.

ctivities (large metropolitan area or state-wide geographic area), and responsibilities must include technical organization and work sup

anet EV, Atoll


ate a new project in a planning tool

gn of a network (cell plan) • Detailed knowledge of Monte Carlos analysis.

meter changes, activation of functionalities on data from tests, in order to improve system performance.
MS Investigation, Actix, Agilent,
ge of the difference between artificial and real traffic loading
formance statistics, and solving complex problems
l quality of service, for both PS and CS traffic

erator on optimizing a network


tionalities relating to traffic analysis
rs and network capacity, and how to optimize both.
optimization tools
mmendations to improve them • Know how to improve the uplink setting
nd VoLTE optimization

managing customer requests and site needs


departments, divisions and companies
elop plans to meet them
ks (Radio, Transport, Core or Wireline)
equirements

be responsible for performing design optimization and audit of a network to meet customer requirements. The position is applicable f

mization of services, where you will be accountable for part of the solution and of the service process. Hereby, you should be able to ke

re alignment on scope, timelines, and deliverables

& E engineers / managers

evel, in engineering (IT, Telecom) or

radio network design, implementation and tuning / optimization on 3G & LTE systems.
oLTE projects.
ties in a large scale (major metropolitan area or state wide geographic area) and responsibilities must include technical organization a
faces Design
G, Planet EV, Atoll
to create a new project in a planning tool

k design (cell plan) • Detailed understanding of Monte Carlos Analysis.

es, feature activations on drive test data to improve the system performance.
TEMS investigation, Actix, Agilent,
nd the difference between artificial and real traffic loading

ce statistics, and solve complex issues


overall Service Quality both for PS and CS traffic

rrier on the network Optimization


n RAN features relative to traffic analysis
Handovers and Network Capacity, and how to optimize both.
operate the optimization tools
mendations to improve them • Understand how enhance uplink tuning is performed
E & VoLTE optimization Additional Skills

while managing project and customer expectations


departments, divisions and companies
nts, and develop plans to address them
f ​working (Radio, Transport, Core or Wireline)
oup OHS requirements

EEO) to all employees and applicants for employment without regard to race, color, religion, sex, sexual orientation, marital status, pre

all local laws governing nondiscrimination in employment in every location across the world in which the company has facilities. In add

loyment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, trai

arassment based on race, color, religion, sex, sexual orientation, marital status, pregnancy, parental status, national origin, ethnic back

l || ProdMgt

edures in 5G Networks: Machine learning based approaches


G product architecture department, we are implementing solutions based on Artificial Intelligence approaches, in order to optimize th

ed in our department, and aiming at the introduction and the implementation of machine learning techniques in 5G networks.
and the optimization of radio resources for Multi Users massive MIMO systems in 5G. Beamforming, scheduling, traffic load balancing

s to contribute in the introduction of machine learning algorithms in radio resource management procedures for 5G networks.

rom model design to implementation and a good know-how of alternative ML approaches (such as reinforcement, supervised, unsupe

he implementation constraints in the products, such as inference latency, availability of data, in order to select the learning method ta

st in assessing the benefit of the machine learning based approaches compared to the conventional methods. This validation will be pe

urces allocation in 5G networks

tifying the necessary optimizations taking into account the RAN constraints

most suitable candidate machine learning methods, to solve the problem.

ing software platform or on Nokia HW platform.

w you to reinforce and put into practice:

pecially 5G networks,
umes of data as well as for validation of machine learning algorithms.

Machine Learning and/or telecommunications, wireless networks or information theory.

mmunication is highly recommended.


ython (Keras, PyTorch), Matlab, C++.

UL-PHY LOKI)

he heart of our connected world. With the research and innovation capabilities of Nokia Bell Labs, we provide communications service

n real-time software developments for 4G / 5G base stations (Level 1 software layer).

t / Apprenticeship

avaScript / Matlab,
ysis and synthesis, management of priorities, teamwork, curiosity, enthusiasm, open to others, communication.

functional tests of Nokia Registers (3G, 4G, 5G) network element(s)


pics (issues, customer topics) of Nokia Registers & work on continuous delivery/continuous integration
s in new features, customer issues
egisters & Legacy HLR/HSS product deployments

ereience with C++


M, AUSF, HSS, HLR, DIAMETER, IMS, VoIP is mandatory

th little supervision
y. (for testing)

thon is an added advantage. (for development)

on't let the list limit you!) :


gies by utilizing modern machine learning concepts. Main application areas are computer vision and neural network compression.
sary intellectual property generation. (Submissions to world-class conferences and participation is highly encouraged!)
zation organizations such as MPEG, 3GPP and ETSI.
research results
ssed with your results and want to take your tech further into their product (well done!)
expertise area (spread the knowledge!)
g in our lab! As a starting point, here are some requirements that we expect from you:

sion and/or neural network compression


pervised and supervised learning, reinforcement learning, generative and discriminative models, deep learning techniques, data pre-pr
ges or video)
(especially on object re-identification, tracking, activity recognition, depth sensing, semantic segmentation, anomaly detection)

Keras, PyTorch and OpenCV


) on mobile devices is a plus

ch as machine learning, computer vision and data science is desired. If you have an MSc and a lot of experience in the topics we are in
perience (academic work and R&D project experience within your PhD duration is usually considered)

are a great person. You probably have a blend of these personal attributes, which we value:

y new things

r than "me"

ect people and things. With state-of-the-art software, hardware and services for any type of network, Nokia is uniquely positioned to h
h scientists and engineers continue to invent and accelerate new technologies that will increasingly transform the way people and thin

d DSP in C/C++) as part of international team


tation for the relevant part of the software
ce, and bug corrections

perating systems and the Linux/Unix environment

already be a specialist on some of the above mention topics. And finally, we are looking for an ambition to further develop as 5G L1 So

will work in Nokia Software Technical Communication. Nokia Software Technical Communication is a global team of technical commun
documentation from XML sources
nt team's workflow:

Jira) in a timely fashion


m (Jenkins, Git, Maven, Ant, Docker)

our new Harmonized Pipeline


ng the user experience of our tools

uter Science, Information Technology or equivalent disciplines, with at least 1 year student status from the time of hire.
ogramming languages: Java, XSLT, Javascript, Python, SQL
ool, preferably Git

g and speaking

with international colleagues

o learn and try new things

itted to diversity and inclusion. At Nokia, employment decisions are made regardless of race, color, national or ethnic origin, religion, g

ecifying and developing a web application supporting the 5G Node B information model. Interacting with stakeholders, you will propos
rt? Are you interested about gNb integration , test script development , test automation or SW development?

) is responsible for 5G L1 feature integration and verification including Radio units.

usiastic and talented future engineers to join our integration team.

able for your job with highly motivated and experienced team.
g or prepare your thesis work.

s , 5G base station and CI ( continuous integrtion automation environment ) .As a result our customers will get fully functional systems

-driven cars.

written) skills
gineer to not only help us build data pipelines to efficiently and reliably move data across systems but also to build the next generation

builds out new API integrations to support continuing increases in data volume and complexity.
Matplotlib, Jupyter, or similar data visualization tools

ical Skills/Knowledge)

nsional data modeling


use plans to internal clients
g statistical analysis, developing cloud-based data lakes / data warehouses, managing data science projects, developing APIs, developin
roductive collaboration with other team members and business units;
and concepts;
oduct development
on of research results;

ect people and things. With state-of-the-art software, hardware and services for any type of network, Nokia is uniquely positioned to h
h scientists and engineers continue to invent and accelerate new technologies that will increasingly transform the way people and thin

M) within the ION Division of Nokia.

which addresses both evolution scenarios: 5G (AMF), 4G LTE overlay today with 2G/3G consolidation, or 2G/3G renewal today with fut
omputing including technologies like:

cluding AWS and Container technologies


g including a strong knowledge in Python & Deep Learning Algorithms.

e in a short time frame


cloud infrastructures
fficiently and be responsible for completing own code quality and/or testing activities given provided requirements
isks and provide status updates on assigned tasks
multi-site and multi-cultural teams

helors/masters degree in Computer Science, Computer Engineering, Electrical Engineering or Information Technology (with focus Artifi

earning Algorithms.

nowledge (IPv6/IPv4), familiar with Linux, Ansible, KSH, HTTP2.0, XML, PHP, YAML, JSON, Web Design (JavaScript, Ajax, JQuery), SQL (m
of race, color, national or ethnic origin, religion, gender, sexual orientation, gender identity or expression, age, marital status, disabilit

scriminate against any employee or applicant for employment because of race, color, sex, age, national origin, religion, sexual orienta

n L1 Physical Layer SW R&D

itted to diversity and inclusion.

of race, color, national or ethnic origin, religion, gender, sexual orientation, gender identity or expression, age, marital status, disabilit

g for an ambitious embedded software development expert who will work in engineering role. You will be part of a scrum team respon

nd as a member of a L1 scrum team, you will be responsible for:

ting of base station functionalities as a member of a scrum team


ties to assure best quality
hnical area of responsibility
n responsibility area
ation for feature teams

engineering problems in different technology areas

eferably in embedded environments

oken) and basic communication in German (level A2)


n, Computer Science or software Technology or equivalent education
ed real time programming or DSP SW programming would be beneficial

ane/ Cplane) in any technology (2G/3G/4G/5G)

y to work in international multi-site environment

ople in various situations

ed for Multiple

experts to be part of a team that is responsible for embedded software development for Cellular Radio Modules. You will design and i

n, develop and test software for the new radio base stations. Participation in analysis and solving complex engineering problems. Parti

ent level positions.

Design methodologies

able, carrier-grade software products


he software created and the customer impact of any proposed changes to software already released.

s delivery methodologies and tools

may require making laboratory measurements and/or interacting closely with the hardware design team in order to observe the actua

elationships with engineers within and outside of the department.


re to deliver quality products on-time.
nal and presentation skills.
all expected to be less than 10% of the time)

Network) team within Nokia’s biggest business group Mobile Networks, responsible for testing software& hardware functionalities for

technology and features expected by customers all over the world.

d system features;

d test suites and scripts for relevant test tools;


eal equipment and to report test results into tools;

ation team & SW developers) and support other testers and developers;

elopers, architects, project managers, testers, integrators etc.


, 5G innovating mobile networks, Cloud, shared testing resources
ageMentor, LinkedIn Learning etc. at your disposal for technical training and personal development
can express your technical creativity and implement new projects with support from local community

o on well-being at work and other general-purpose subjects


niversity studies in parallel and a compensation & benefits package to boost your student budget

ectrical scooter reimbursement

market so you can bring your friends too :)

rrival. During the first months at Nokia you will have support from your manager, team and dedicated buddy and the opportunity to g
nd others. The position will involve taking these skills and applying them to some of the most exciting and massive data and analytics p
y degree in Electronics and Telecommunications or Computer Science;

ous and resourceful way.

shape the future, join us. Apply now!

team, responsible for developing software functionalities for the Nokia 5G base stations, ready to be delivered to customers all over t

unctionalities in our base station product and be involved in the complete development cycle (specifications, coding, testing).

+) among 11 R&D Tribes in different countries& continents. Within your Tribe in Timisoara, you will belong to one of the Squads, each
in 5G market, our team has full responsibility for 5G Cloud Control Unit in Timisoara (Requirements, Architecture and Design) and suc

elopers, architects, project managers, testers, integrators etc.


, 5G innovating mobile networks, Cloud, shared testing resources
ageMentor, LinkedIn Learning etc. at your disposal for technical training and personal development
can express your technical creativity and implement new projects with support from local community

o on well-being at work and other general-purpose subjects

ectrical scooter reimbursement

market so you can bring your friends too :)

rrival. During the first months at Nokia you will have support from your manager, team and dedicated buddy and the opportunity to g

ogramming (OOP) principles and communicate in English fluently.


come feedback, show flexibility and take initiative.

gies, cloud-computing platforms (OpenStack, Google Kubernetes, Docker, VM).


velopment using C++ and the latest C++ standards BUT familiarity with any OOP programming language (Java, C#, etc.) is a plus.

ape the future, join us. Apply now!

rutare@nokia.com or to Andra Rugea on LinkedIn.


ation techniques and methods development. This position is in RF Platforms Development organization and it is located in Oulu.

as well as RF measurements with evaluation of new tools and methodologies targeting to design and verification flow optimization. Th

ences on RF engineering but also requires willingness and capability to learn new.

erts with focus on medium-term to long-term issues. The Lab will work closely with an open innovative ecosystem with Huawei Europe

ly with suppliers and project teams which you may support in taking new tools and methods into use after evaluation.

them in wide use within RF project teams.

verification to simulations

and signal processing

s in a multicultural environment
develop innovative services and create original multiplatform media content – we’re revolutionizing how Canadians communicate.

nd join a community that values bold ideas, professional growth and employee wellness, we want you on the Bell team.

creating the ultimate service experience for our residential, wireless and small business consumers. We lead strategic development an

e management and optimization of BI systems used to analyze customer behavior, automate business insight processes target marketi

and models from inception to production


pportunities at all levels of the business (strategic to operational)
ate to internal clients how ML/AI can improve their business

across Bell to make data-driven business decisions

urrent technology through training opportunities, conferences, etc.

einforcement learning, NLP, and others


ML frameworks such as Keras, Pytorch, TensorFlow, SparkML, Scikit-Learn
e ML/AI space

C++, Java, Shell


e knowledge of French is required for positions in Quebec.

Ontario : Toronto || Canada : Quebec : Montreal

role. Applications through email will not be accepted.

e it. We’re committed to fostering an inclusive, equitable, and accessible workplace where every team member feels valued, respected

ates taking part in all aspects of the selection process. For a confidential inquiry, simply email your recruiter directly or recruitment@b

roughout development of our products and collaborate with multiple teams across the organization. This person will greatly influence
in point of data entry for accountants, bookkeepers and businesses, so they can spend time on the things that really matter to their bu

work Security Information Services


omaly detection, reinforcement learning, supervised and un-supervised learning algorithms. For researching and developing scalable A

i Consumer BG is dedicated to delivering the latest technologies to consumers and sharing the happiness of technological advances w
nsure high quality training data and pipelines, enable Woebot to deliver meaningful insights to users at scale, personalize Woebot's con
e customers, colleagues and communities to thrive and to build the National Champion Bank in Ireland.

munication across all touchpoints, Data driven marketing, Customer experience insight & innovation design, Integrated Technology tra

arning and artificial intelligence. To deliver on this vision the Customer Analytics team leverage internal and external data and apply da

endation engines, building and deploying predictive models and the commercial activation of business insights. This role will include lea
Chengdu (CH). PDU Radio Products consists of approximately 3000 R&D professionals and development partners, developing, suppor
here. With an agile way of working we develop 5G, LTE, WCDMA, GSM Network solutions to operators all over the world.

environment including needed SW CM tools. The position also requires that you work pro-actively with aligning our build and delivery e
ges, enabling you to be ‘the person that did that.’ We’ve never had a greater opportunity to inspire change; setting the bar for technol
ges, enabling you to be ‘the person that did that.’ We’ve never had a greater opportunity to inspire change; setting the bar for technol

, system design, development, and verification. As a team, you take responsibility for your feature from analysis to delivery and beyon
ges, enabling you to be ‘the person that did that.’ We’ve never had a greater opportunity to inspire change; setting the bar for technol

ibers can access services from any devices or locations. Ericsson Hungary is a main contributor in the development of the SSR product
nd connected world.

ment to these qualities will always be encouraged, and never go unnoticed. As a team, we are helping to tackle some of society´s most

en Science Park, and there is a wide range of restaurants for your convenience.
ding to a prioritized backlog. Our teams are multi-functional, self-organizing and highly independent. You’ll be working directly with yo

e in test driven development (TDD) and therefor competence in test case development and execution is valuable.
ng relationships with business and co-workers within international environment.
e from a telecom operator, telecom vendor, BSS/OSS vendor, consultancy or other professional services team is a plus.

artered in Stockholm, Sweden, Ericsson is proud of its global presence across 100+ countries and market areas. With a strong focus on

egacy.Good hands-on experience in Configuring ClickSoftware Service Optimization Suite ( Click Schedule, Click Mobile, Click Plan, Click
t Lifecycle Management support and product documentation. Our focus is on Lean and Agile ways of working. We prioritize in multi-fu
Chengdu (CH). PDU Radio Products consists of approximately 3000 R&D professionals and development partners, developing, suppor
here. With an agile way of working we develop 5G, LTE, WCDMA, GSM Network solutions to operators all over the world.

environment including needed SW CM tools. The position also requires that you work pro-actively with aligning our build and delivery e
n and we are getting into new markets, expanding existing deployments, enhancing the feature set and transforming our portfolio from

goal is to provide a system that is in the first line of the communication service evolution at the same time as it can be integrated and i
ess transitions between the access types. We believe it offers the best performance for our customers on the market today. We are loo

mitted to diversity and inclusion and to be a responsible and relevant driver of positive change. We also offer some awesome benefits,
membership or genetics.
csson is this place, and we have an exciting opportunity for you in a fast-paced, highly collaborative technical environment.

5G, for North American telecom operators.

and HW) through product testing, as well as with the Ericsson product acceptance process for our large customers in North America. Y

ers by accelerating current Ericsson offerings. Your contribution will also help to create new offerings in the areas of MI driven 4G and
cceed you must appreciate that everything matters; every feature, every team member, and every user. We rely on innovative thinking

mance, fault tolerant capabilities. You will apply analytics principles to massive data sets, to unearth crucial insights into the impacts o

such as: requirement analysis, system design, architecture design, hardware design, software design, integration, verification, simulati
nologies in compliance with industry best practices

ng, test automation frameworks


design and implementation, databases, operating systems

obile operators. Your role would primarily involve leading software development team and code development to support the radio per
our product ingests all of an operator’s network data, across multiple domains and physical and virtual infrastructure. Harnessing the p

tegy and focus. We bridge the gap between business requirements and technology in both directions, constructing solutions to meet

and Machine Learning techniques into the operation of their networks. To this day much network operation is a manual affair, execut
y. Ericsson works with mobile vendors across the world, large and small. Our data scientists have access to rich data sets with years o
nd algorithm tuning. The successful candidate will connect to business stakeholders, local development teams and the broader Ericsso
ent skills, capable of identifying and solving complex problems requiring in-depth analysis by creation, adaptation or use of appropriate

er) and integration into a large-scale IT environment.


relates to the design and optimization of RAN, BBA, Transmission, Core, OM and Services networks. You will be involved in the whole

will need to comply with Ericsson's requirements for time, performance and quality, as set out in customer contracts.

organization and work supervision, without forgetting to demonstrate an excellent level of communication with the team, the site ma
The position is applicable for design & optimization of RAN, BBA, Transmission, Core, OM and Services Networks. You will be engaged i

y, you should be able to keep time, performance and quality according to Ericsson requirements and customer contracts.

e technical organization and oversight of work and demonstrated excellence in communications with team, project manager, and cust
ntation, marital status, pregnancy, parental status, national origin, ethnic background, age, disability, political opinion, social status, ve

mpany has facilities. In addition, Ericsson supports the UN Guiding Principles for Business and Human Rights and the United Nations Gl

sence, compensation, training and development.

national origin, ethnic background, age, disability, political opinion, social status, veteran status, union membership or genetic informati
hes, in order to optimize the performance of mobile access networks.

es in 5G networks.
ling, traffic load balancing, QoS / QoE management and contextualized data mining are some practical examples of optimization.

s for 5G networks.

ement, supervised, unsupervised or hybrid). Knowledge, of optimization techniques such as Transfer Learning / Federated Learning an

ect the learning method tailored to the problem.

s. This validation will be performed on a software platform embedding the usual libraries in machine learning (e.g. Keras, PyTorch) and
e communications service providers, governments, large businesses and end users with the most comprehensive portfolio of products
etwork compression.

ng techniques, data pre-processing

anomaly detection)

nce in the topics we are interested in, that also counts!


is uniquely positioned to help communication service providers, governments, and large enterprises deliver on the promise of 5G, the
m the way people and things communicate and connect.

urther develop as 5G L1 SoC and L1 algorithm expert!

eam of technical communicators who create customer documentation solutions for Nokia Software products.
or ethnic origin, religion, gender, sexual orientation, gender identity or expression, age, marital status, disability, protected veteran st

keholders, you will propose solutions to lead the implementations of new features.
et fully functional systems in time. 5G Test libraries created during the process by team are
o build the next generation of data tools to enable us to take full advantage of this data. In this role, you will learn and work with the c

developing APIs, developing machine learning models, creating advanced data visualizations.

is uniquely positioned to help communication service providers, governments, and large enterprises deliver on the promise of 5G, the
m the way people and things communicate and connect.

3G renewal today with future evolution to 4G LTE and 5G technology on the same platform. The Cloud Mobility Manager (CMM) produ
chnology (with focus Artificial Intelligence/Machine Learning),

cript, Ajax, JQuery), SQL (mySQL) - or other database exposure. Experience with projects in virtualization/VM/public cloud and containe
ge, marital status, disability, protected veteran status or other characteristics protected by law.

in, religion, sexual orientation, gender identity, status as a veteran, and basis of disability or any other federal, state or local protected

ge, marital status, disability, protected veteran status or other characteristics protected by law.

art of a scrum team responsible for implementation of 4G and 5G functionalities in next generation base station.
ules. You will design and implement features and functionality for the products focusing on all technologies including 5G NR / mMIMO

ngineering problems. Participation in functional specifications reviews to identify issues. Close cooperation with other project teams a

order to observe the actual versus expected behavior of the software running on the radio.
ardware functionalities for the Nokia 4G/5G products, ready to be delivered to customers all over the world.

y and the opportunity to get in touch with 5G technology and our development environment.
assive data and analytics problems across multiple industries. We are looking for machine learning engineers to join the ML effort for o

red to customers all over the world.

, coding, testing).

o one of the Squads, each composed of 5-9 people. Your team has objectives to deliver in a sprint (1 month). In an environment where
cture and Design) and successfully implemented a development environment fully deployed in Cloud.

y and the opportunity to get in touch with 5G technology and our development environment.

a, C#, etc.) is a plus.


it is located in Oulu.

ation flow optimization. This involves product and RF component level simulation tools and methodologies optimization. RF system sim

ystem with Huawei European customers to address real-world issues. The Lab will also engage with key European universities to build
nadians communicate.

e Bell team.

d strategic development and execution of day-to-day operations, develop tools and processes to drive service enhancements, manage

t processes target marketing & contact strategy opportunities and provide insight to drive optimal business decisions.
ber feels valued, respected, and supported, and has the opportunity to reach their full potential. We welcome and encourage applicati

directly or recruitment@bell.ca to make arrangements. If you have questions regarding accessible employment at Bell please email ou

rson will greatly influence the quality levels of our products and have an important role within the operation.
at really matter to their business.
g and developing scalable AI algorithms, the team will have an access to large real datasets collected from Huawei devices across Europ

technological advances with more people around the world. Walk the walk and make dreams come true.
, personalize Woebot's content and relationship to each user, and execute experiments that allow Woebot to deliver the right method
Integrated Technology transformation, Robust Marketing spend effectiveness & governance.

external data and apply data analytics, machine learning and visualisation techniques to maximise the value of this data and deliver po

ts. This role will include leading data science projects from a technical point of view, as well as providing technical leadership to data sc
rtners, developing, supporting, and, maintaining Ericsson products.
er the world.

ng our build and delivery environment and tools with the DURA functional framework requirements for our common CI-machinery. Si
setting the bar for technology to be inclusive and accessible; empowering an intelligent, sustainable, and connected world.
setting the bar for technology to be inclusive and accessible; empowering an intelligent, sustainable, and connected world.

ysis to delivery and beyond.


setting the bar for technology to be inclusive and accessible; empowering an intelligent, sustainable, and connected world.

pment of the SSR product in direct partnership with other Ericsson sites. We are looking for Senior Developer our Evolved Packet Gate
kle some of society´s most complicated challenges, enabling you to be ‘the person that did that’.
e working directly with your Product Owner and partners in a collaborative manner. This requires maturity and a ‘team first’ approach
m is a plus.

as. With a strong focus on innovation, we possess 49 thousand registered patents and a global strength of over 95 thousand competen

ck Mobile, Click Plan, Click Forecast and Click Analyze) .


g. We prioritize in multi-functional development teams in which continuous improvement, innovation and sharing knowledge is part o
rtners, developing, supporting, and, maintaining Ericsson products.
er the world.

ng our build and delivery environment and tools with the DURA functional framework requirements for our common CI-machinery. Si
sforming our portfolio from virtualized products to cloud native products.

s it can be integrated and interoperable with legacy technologies and the different regulatory requirements of the global market.
e market today. We are looking for intelligent, creative and self-motivated software developer who are passionate about design code o

r some awesome benefits, amazing career development and training programs to provide an empowered career in a connected world
l environment.

omers in North America. You will be exposed to the latest radio products that will be rolled out in our biggest customers networks. Th

areas of MI driven 4G and 5G network, distributed cloud, IoT and other emerging businesses.
rely on innovative thinking to come from every member of the team – we know brilliant ideas can come from anyone. If you can envis

nsights into the impacts of alarms to services and reduce the cost to fix alarms for our customers.

ation, verification, simulations, tools design, Product Lifecycle Management support and product documentation. Our focus is on Lean
nt to support the radio performance verification of new radio products. Testing is executed in a high-tech lab environment using both a
tructure. Harnessing the power of big data analytics, CENX visualizes network and service topology, inventory, fault, and performance

tructing solutions to meet concrete needs, and proposing opportunities made possible by new technologies and techniques.

n is a manual affair, executed by network engineers using heuristics and accepted practices. However, the deployment of 5G networks
rich data sets with years of history that document a deep variety of facts about network operation across the full stack, from radio phy
ms and the broader Ericsson Machine Learning community to innovate and execute concrete solutions, cross pollinate knowledge, and
ation or use of appropriate procedures, techniques and methods. The candidate must be motivated, have an excellent capacity for lea
be involved in the whole process, from the pre-sale of services and networks to the delivery and acceptance of services.

with the team, the site manager and the client.


orks. You will be engaged in the whole process from pre-sales of services and networks to service delivery and acceptance.

mer contracts.

project manager, and customer.


l opinion, social status, veteran status, union membership or genetics.

and the United Nations Global Compact.

ership or genetic information.


ples of optimization.

g / Federated Learning and of ways to implement them considering the level of complexity they introduce, is required as well.

g (e.g. Keras, PyTorch) and / or Nokia platform.


nsive portfolio of products, services and licenses on the market. From the deployment of infrastructures for 5G, from the Internet of T
on the promise of 5G, the Cloud and the Internet of Things.
bility, protected veteran status or other characteristics protected by law.
learn and work with the company's products, data consumers and analysts.

on the promise of 5G, the Cloud and the Internet of Things.

lity Manager (CMM) product combines the MME (4G LTE) and SGSN (2G/3G) functions, paving the way for AMF (5G) functionality.
/public cloud and containers.
al, state or local protected class.
including 5G NR / mMIMO / 4G LTE.

with other project teams and stakeholders from other foreign locations around the world. Due to the close interaction between the e
s to join the ML effort for our teams, building ML-based systems, tools, and services that serve as infrastructure for our internal and ex

. In an environment where trust and autonomy are encouraged, each team member selects the tasks to work on and exchanges daily
ptimization. RF system simulations and component/sub-module models development are part of job role. All these help RF projects to

opean universities to build a basic research capability to support Huawei technical projects.
e enhancements, manage customer loyalty and retention, and leverage big data and artificial intelligence to create intellectual propert
e and encourage applications from people with disabilities.

ent at Bell please email our Diversity & Inclusion Team at inclusion@bell.ca.
uawei devices across Europe and will have access to Big Data processing infrastructure as needed (e.g., GPUs, Spark Clusters).
o deliver the right method to the right person at the right time.
of this data and deliver positive outcomes for our customers. The complex challenges the team faces means there is considerable oppo

hnical leadership to data scientists in the development of such projects.


common CI-machinery. Since Radio SW is multi-standard, you will need to interact with both internal and external Radio SW parties tr
nnected world.
nnected world.
nnected world.

r our Evolved Packet Gateway and Wifi Gateway product portfolio.


nd a ‘team first’ approach, which we believe is the best way to create great software.
ver 95 thousand competent professionals. We also take pride in supporting networks that cater to a capacity of 40 percent of the worl
haring knowledge is part of the daily work.
common CI-machinery. Since Radio SW is multi-standard, you will need to interact with both internal and external Radio SW parties tr
of the global market.
onate about design code or creating advanced automated test cases both. We provide a fun and challenging environment where you

reer in a connected world.


st customers networks. The goal is to validate the functionality of the latest Radio Products and to find and correct as many faults as p
m anyone. If you can envision a path outside of the ordinary and can bring unparalleled skills to Ericsson, then we want to talk to you.

ation. Our focus is on Lean and Agile ways of working. We organize in cross functional development teams in which continuous improv
environment using both automated and manual methods.
y, fault, and performance in a single pane, in real time. We enable the world's largest and most innovative service providers to scale th

and techniques.

eployment of 5G networks will explode the scale and complexity of wireless networks well past the ability of this paradigm to cope.
e full stack, from radio physics to video streaming quality. In the CENX group we particularly look for opportunities that cross those bo
pollinate knowledge, and educate colleagues about our ML efforts.
n excellent capacity for learning, be independent and have the desire to continuously improve.
e of services.
nd acceptance.
s required as well.
5G, from the Internet of Things, to emerging applications in the fields of virtual reality and digital health, we are shaping the future of
MF (5G) functionality.
interaction between the embedded software and the radio hardware, the successful candidate must be able to understand specificati
ure for our internal and external clients.

rk on and exchanges daily with his/her team mates on the progress and difficulties.
l these help RF projects to develop higher quality products in faster manner and evaluating RF performance even before actual HW is
create intellectual property.
s, Spark Clusters).
there is considerable opportunity for team members to grow and develop both their technical and non-technical skills to achieve thei
xternal Radio SW parties trying to align many different requirements on our build and delivery environment.
y of 40 percent of the world’s mobile traffic, thereby connecting more than 2.5 billion subscribers and counting. We are a world leader
xternal Radio SW parties trying to align many different requirements on our build and delivery environment.
g environment where you can growth together other members in inspired cross-functional team.
orrect as many faults as possible, prior to rolling out the product network wide. As part of your work, you will be interacting with vario
n we want to talk to you.

which continuous improvement, innovation and knowledge sharing is part of the daily work.
ervice providers to scale their operations as the network scales.

this paradigm to cope.


unities that cross those boundaries.
are shaping the future of technologies in order to transform the human experience.
e to understand specifications and datasheets for hardware components that are managed by the software to be written.
even before actual HW is available. You will evaluate new tools, write reports and develop documentation and trainings to take these
hnical skills to achieve their full potential at Bank of Ireland.
ng. We are a world leader in the rapidly changing environment of communications technology – by providing hardware, software, and
ill be interacting with various Ericsson organizations such as development unit, customer units, but also directly with the customers.
nd trainings to take these into use by design teams.
g hardware, software, and services to enable the full value of connectivity.
ctly with the customers.
Power Control Parameters Optimization

Data needed ( Data Collection ) directly impact P.C


MRR + KPIs + FAS
Data Preparations
Merge Data Files >> Data Per Site ( or CHGr-1) Skipping Data of CHGR-0
Clean
Missing
#N/A
Nan
Empty
Outlier
Duplicates

Panads for Data Analysis


Scaling Data

MatPlotLib and SeaBorn for Visulaization

K-Mean Clustering
Elbow
Cluster
Cluster Analysis + Mapping
Cluster Visaulaization
KPIS Supervised Learning (NQI Targets) (ML (Supervised + DQN ) SDCCH
depth study relationship betwe
(ML (Supervised + DQN ) SDCCH Establishment >> NQI Prameter tuning >> DQN
depth study relationship between them
tuning >> DQN
sources
Coursera Udacity Udemy Medium TowardDataScince Analticvhdica
masterymachinelearning Chigao University very important Books University Lecture
Element Toronot >> UoT
BritishColumbia
Waterloo
cornell
CMU
Packt oreilly
Groups:

Group1: Basics
Math + Prob + Stat + Convex + Discerete + Linear Algebra
PGM ??

Group2: ML + DL + RL
ML
Deep Learning
NLP
Reinforcement Learning
Experience building end-to-end pipeline and deploying machine learning models
Time Series
TensorFlow
Keras
Pytorch
(Mlflow (An open source platform for the
Kubeflow,(The Machine Learning Toolkit for Kubernetes)
lifecycle) , Neptune Experience with ML collaborative platforms/pipelinesmachine learning,
Notebook experience (Jupyter, Zeppelin, Databricks, etc.)
Experience with ML collaborative platforms/pipelines (Mlflow (An open source platform for the machine learning
Group3:
Software Developer: DS + Design pattern + refactoring + DDD + Application Performance and Memory Management
Fluent Python >> High High Level programming
OOP+Solid+Pattern + DS + Algorithms
BlockChain
Java > Udacity >> Coursera OOP+Solid+Pattern
android + Ios
C++ >> Udacity
APIs
REST APIs
Ideal Candidates Will Also Have
Proficiency in Python's asyncio and aio
TDD, DDD and refactoring skills
Experience in developing and supporting frameworks
Hands-on experience with Docker
Web full stack experience
Familiarity with Jenkins

https://www.coursera.org/specializations/advanced-app-android
Imperial College London

Group4:
Cloud Computing : GCP + AWS + Azure
Linux command line and shell scripting.

Group5:
data mining language (e.g., R, SAS, SPSS),

Group6:
Big Data:
Hadoop, Spark, PySpark or SparkR,MapReduce,MLib
Databases:Postgres, Mongo, SQL, NoSQL
Casandra , ElasticSearch,CouchDB
cloud data warehouses - ie: Snowflake, Azure Data Warehouse, etc.,
 a variety of databases - ie: SQL, PostgreSQL, Azure SQL, Oracle,
 data automation and ETL tools ie: WhereScape, SSIS, Informatica
 analytical tools - ie: SSRS, Cognos, PowerBI, Tableau,
Google BigQuery
AWS ecosystem such as ElasticSearch, S3, and DynamoDB
multiple relational databases and data warehouses (Redshift, Snowflake, Postgres, MySQL, etc…)
Understanding of GCP/Azure Data Engineering Stack, Big Data Tools and Technologies ( HDFS, HBase, Spark, Kafka)
Frameworks: Spark, Airflow, DataBricks, ONNX, Kafka, Netty
Databases: MySQL, Snowflake, S3/Parquet
Data mining experience working with Relational, NoSQL and Graph databases
Knowledge of RPA automation tools such as UIPath or Blue Prism
Splunk
Experience with NoSQL databases, such as MongoDB, Cassandra, HBase
Some of the popular DBMS include: MySQL, SQL Server, Oracle, IBM DB2, PostgreSQL and NoSQL databases (MongoDB, Couch

Group7:
Agile Scrum and/or kanban method
 Agile software development and tools ie: JIRA, Azure DevOps,

Group8:
 visualization tools (e.g., Power BI, Tableau, Shiny)
data visualization tools (Spotfire, Tableau, Qlik)
Experience in Tableau, Apache SuperSet, Looker or similar BI tools.
Experience in BI platform and data visualization tools (QlikView, Microstrategy, Tableau, Power BI, etc.)
Tableau, Power BI

Group9:
Docker, Kubernetes
Knowledge of management and productivity environments and tools (e.g., Jira, Git, Bitbucket, Jenkins)
Git, GitHub, and Zenhub
Knowledge of CI tools and processes (Jenkins, TeamCity, Bitbucket pipelines, etc);
Experience with container-type environment: Docker, Kubernetes, Openshift, PCF
Experience with Version Controlled data pipelines (Pachyderm)

Group10:
Software Engineering : Hammad

Group11:
BlockChain
3G+4G+5G Planning & Optimization

Group12:
Business Finance
HackerRank Question ( interveiw)

Group13:
Optimization
PGM
Bayesian
Gaussian

Group14:data-science-interview-questions-and-answers/

https://data-flair.training/blogs/data-science-interview-questions-and-answers/

More and More Time Managment:

Bayesian Russia + Duke Bayesian Statistics very very important + University of California, Santa Cruz (mcmc-bayesian-statistic

Month-4
Time-ser udemy + Coursera 15-April
Err:509
John Hokins
Washington
Time series udemy + practical-time-series-analysis Feature Engineer + Feature Selection + Projects
ensebmle project packt
applied AI
https://courses.analyticsvidhya.com/courses/applied-machine-learning-beginner-to-professional/?utm_source=blog&utm_me
Projects >>Hisham >> Udacity + Udemyy

Coursera
https://dzone.com/articles/5-best-reinforcement-learning-courses
Practical Reinforcement Learning (Russia Coursera)
Reinforcement Learning in Finance + advance (NYU ) Coursera

Month-5 >>NLP maybe start DL >> tensorflow >> pyto


Book > Orelliy hands-on unsupervised learning
NLP
recommended

PGM Coursera + CMU


http://www.cs.cmu.edu/~epxing/Class/10708-14/lecture.html

https://www.cs.cmu.edu/~epxing/Class/10708-20/lectures.html

https://www.cs.cmu.edu/~epxing/Class/10708-19/lectures/

https://www.youtube.com/playlist?list=PLoZgVqqHOumTqxIhcdcpOAJOOimrRCGZn

Apart from the MOOC by Daphne Koller as mentioned by Shimaa, you can look at the following courses on PGMs:

1. Machine Learning and Probabilistic Graphical Models by Sargur Srihari from University at Buffalo. You can find the video lec

2. Probabilistic Graphical Models by Andreas Krause from Caltech. You can find the slides at this link: http://courses.cms.caltec

3. Probabilistic Graphical Models by Eric Xing from CMU. Slides at this link: http://www.cs.cmu.edu/~epxing/Cl...

4. Probabilistic Graphical Models by David Sontag from NYU. Slides at this link: http://cs.nyu.edu/~dsontag/cours...

Projects

then >> Deep >> Tensorflow >> Pytorch >> KEras

Inferential Statistics (university of amstrdam)


Stochastic processes (Coursera) >> Game theory

data strcture and algrithms


PGM
Convex
Game theory >> Linear Programming
Discrete optimization specialization
stochastic process

(Month Data >> MRR )


Monta-carlo + Markov chain
Gussian Models >> LDA QDA
Bayesian statistics

Udacity 13-March or 15-March Finish


advance ML Coursera >> Kaggle win Coursera
applied Machine learning in python University of Michigan
Err:509

Data Sicnce John Hopkins Coursera Youtube


https://www.youtube.com/playlist?list=PL_onPhFCkVQgQ2-FHMii4Vr3fZlQ1XlRK
(Bob Trenwith)
>> Stop >> Time Series Udemy >> Coursera
IBM Coursera https://www.youtube.com/user/TheBigDataUniversity/playlists
appliedAI

after finish them >> Data Camp


Columbia University >> Edx + Harvard
Machine Learning: Algorithms in the Real World Specialization >> Alberta Coursera
============================
Udacity Projects >> Kaggle Projects

Deep ( Hisham parallel with andrew )

Lectures or more theory to understand Deeply meaning ?


Projects Pro
Bayseian ( New Ideas )
Book >> Mastering
PGM Coursera
Mastering Probabilistic Graphical Models using Python
Udmey A-Z (R programming)
NLP udacity + Coursera ( New Ideas )
Reinforcment learning udacity > Pracital Coursera ( New Ideas )
Time seires >> Coursera >> Udemy + DataCamp
DL >> Tensoflow + Keras + PYtorch ( NN in details understnad )
DataCamp All Career Track

Groups:

Group1: Basics
Math + Prob + Stat + Convex + Discerete + Linear Algebra
PGM ??
Group2: ML + DL + RL
ML
Deep Learning
NLP
Reinforcement Learning
Experience building end-to-end pipeline and deploying machine learning models
Time Series
TensorFlow
Keras
Pytorch
Experience with ML collaborative platforms/pipelines (MLflow, Neptune, Kubeflow, etc.)
Notebook experience (Jupyter, Zeppelin, Databricks, etc.)

Group3:
Software Developer: DS + Design pattern + refactoring + DDD + Application Performance and Memory Management
Fluent Python >> High High Level programming
OOP+Solid+Pattern + DS + Algorithms
BlockChain
Java > Udacity >> Coursera OOP+Solid+Pattern
android + Ios
C++ >> Udacity
APIs
REST APIs
Ideal Candidates Will Also Have
Proficiency in Python's asyncio and aio
TDD, DDD and refactoring skills
Experience in developing and supporting frameworks
Hands-on experience with Docker
Web full stack experience
Familiarity with Jenkins

https://www.coursera.org/specializations/advanced-app-android
Imperial College London

Group4:
Cloud Computing : GCP + AWS + Azure
Linux command line and shell scripting.
deployment

Group5:
data mining language (e.g., R, SAS, SPSS),
Group6:
Big Data:
10+11+12
Garage>>Data Eng (Udacity >> Udemy >> Coursera
Hadoop, Spark, PySpark or SparkR,MapReduce,MLib
Databases:Postgres, Mongo, SQL, NoSQL
Casandra , ElasticSearch,CouchDB
cloud data warehouses - ie: Snowflake, Azure Data Warehouse, etc.,
 a variety of databases - ie: SQL, PostgreSQL, Azure SQL, Oracle,
 data automation and ETL tools ie: WhereScape, SSIS, Informatica
 analytical tools - ie: SSRS, Cognos, PowerBI, Tableau,
Google BigQuery
AWS ecosystem such as ElasticSearch, S3, and DynamoDB
multiple relational databases and data warehouses (Redshift, Snowflake, Postgres, MySQL, etc…)
Understanding of GCP/Azure Data Engineering Stack, Big Data Tools and Technologies ( HDFS, HBase, Spark, Kafka)
Frameworks: Spark, Airflow, DataBricks, ONNX, Kafka, Netty
Databases: MySQL, Snowflake, S3/Parquet
Data mining experience working with Relational, NoSQL and Graph databases
Knowledge of RPA automation tools such as UIPath or Blue Prism
Splunk
Experience with NoSQL databases, such as MongoDB, Cassandra, HBase
Some of the popular DBMS include: MySQL, SQL Server, Oracle, IBM DB2, PostgreSQL and NoSQL databases (MongoDB, Couch

Group7:
Agile Scrum and/or kanban method
 Agile software development and tools ie: JIRA, Azure DevOps,

Group8:
 visualization tools (e.g., Power BI, Tableau, Shiny)
data visualization tools (Spotfire, Tableau, Qlik)
Experience in Tableau, Apache SuperSet, Looker or similar BI tools.
Experience in BI platform and data visualization tools (QlikView, Microstrategy, Tableau, Power BI, etc.)
Tableau, Power BI

Group9:
Docker, Kubernetes
Knowledge of management and productivity environments and tools (e.g., Jira, Git, Bitbucket, Jenkins)
Git, GitHub, and Zenhub
Knowledge of CI tools and processes (Jenkins, TeamCity, Bitbucket pipelines, etc);
Experience with container-type environment: Docker, Kubernetes, Openshift, PCF
Experience with Version Controlled data pipelines (Pachyderm)

Group10:
Software Engineering : Hammad
Group11:
BlockChain
3G+4G+5G Planning & Optimization

Group12:
Business Finance
HackerRank Question ( interveiw)
LeetCode
System Desgin

Group13: advance Topics


Optimization
PGM
Bayesian
Gaussian

Group14:data-science-interview-questions-and-answers/

https://data-flair.training/blogs/data-science-interview-questions-and-answers/

Group15: IELTS

Group16:application
Optimization
Trading AI

udacity ML
Coursera Bayesian
Coursera NLP + Slideshare
[Master ML >> ML >> ML 3 best Course]
appliedAI Courses
datacamp
udemy A-Z >> R

IBM Data Science Professional Certificate 5 Courses


University of Michigan 3Coursers
Johns HopKins >>Data Science: Statistics and Machine Learning Specialization 10 Courses
University of Washington >> Machine Learning Specialization 4 Courses
Linkden Courses
Kaggle

How to Win a Data Science Competition: Learn from Top Kagglers


Time Series
DS Coursera >> DS Udacity + Discrete Math
Udacity > intro Tensorflow > intro to pytorch
Udacity >> Reinfocement Learning in depth
Err:509
>>python fluent >> OOP >> Design Pattern

>>SQL Expert: + Cornge Melon University


Coursera:Learn SQL Basics for Data Science Specialization 5 Courses (UCDAVIS)

>>Deep LEarning
Coursera+Hisham >> Tensorflow in parctice
#NAME?

>> Data analyst + Data Engineering + Big Data


San-Diago
Yandex
Cloudera
Udacity
Udemy All
Data Engineering, Big Data, and Machine Learning on GCP Specialization

DeveOps : Docker + Kubernets + Git

Torrent:
University of Michigan

Applied Machine Learning in Python


Applied Text Mining in Python
Applied Social Network Analysis in Python

very important Slides Advanced Machine


http://www.cs.cmu.edu/~tom/mlbook-chapter-slides.html
achine learning

mory Management
ase, Spark, Kafka)

databases (MongoDB, CouchDB, DynamoDB, HBase, Neo4j, Cassandra, Redis)


ruz (mcmc-bayesian-statistics) (Bayesian Statistics: From Concept to Data Analysis) + Improving your statistical inferences (Eindhoven unic

?utm_source=blog&utm_medium=comprehensive-guide-k-means-clustering
urses on PGMs:

o. You can find the video lectures and slides at this link: http://www.cedar.buffalo.edu/~sr...

nk: http://courses.cms.caltech.edu/c...

u/~epxing/Cl...

~dsontag/cours...
mory Management
ase, Spark, Kafka)

databases (MongoDB, CouchDB, DynamoDB, HBase, Neo4j, Cassandra, Redis)


al inferences (Eindhoven unicersity of technology)
More and More Time Managment:

Udacity 13-March or 15-March Finish


advance ML Coursera >> Kaggle win Coursera
applied Machine learning in python University of Michigan
Err:509

Data Sicnce John Hopkins Coursera Youtube


https://www.youtube.com/playlist?list=PL_onPhFCkVQgQ2-FHMii4Vr3fZlQ1XlRK
(Bob Trenwith)
>> Stop >> Time Series Udemy >> Coursera
IBM Coursera https://www.youtube.com/user/TheBigDataUniversity/playlists
appliedAI

after finish them >> Data Camp


Columbia University >> Edx + Harvard
Machine Learning: Algorithms in the Real World Specialization >> Alberta Coursera
============================
Udacity Projects >> Kaggle Projects

Deep ( Hisham parallel with andrew )

Lectures or more theory to understand Deeply meaning ?


Projects Pro
Bayseian ( New Ideas )
Book >> Mastering
PGM Coursera
Mastering Probabilistic Graphical Models using Python
Udmey A-Z (R programming)
NLP udacity + Coursera ( New Ideas )
Reinforcment learning udacity > Pracital Coursera ( New Ideas )
Time seires >> Coursera >> Udemy + DataCamp
DL >> Tensoflow + Keras + PYtorch ( NN in details understnad )
DataCamp All Career Track

Groups:

Group1: Basics
Math + Prob + Stat + Convex + Discerete + Linear Algebra
PGM ??

Group2: ML + DL + RL
ML
Deep Learning
NLP
Reinforcement Learning
Experience building end-to-end pipeline and deploying machine learning models
Time Series
TensorFlow
Keras
Pytorch
Experience with ML collaborative platforms/pipelines (MLflow, Neptune, Kubeflow, etc.)
Notebook experience (Jupyter, Zeppelin, Databricks, etc.)

Group3:
Software Developer: DS + Design pattern + refactoring + DDD + Application Performance and Memory Management
Fluent Python >> High High Level programming
OOP+Solid+Pattern + DS + Algorithms
BlockChain
Java > Udacity >> Coursera OOP+Solid+Pattern
android + Ios
C++ >> Udacity
APIs
REST APIs
Ideal Candidates Will Also Have
Proficiency in Python's asyncio and aio
TDD, DDD and refactoring skills
Experience in developing and supporting frameworks
Hands-on experience with Docker
Web full stack experience
Familiarity with Jenkins

https://www.coursera.org/specializations/advanced-app-android
Imperial College London

Group4:
Cloud Computing : GCP + AWS + Azure
Linux command line and shell scripting.
deployment

Group5:
data mining language (e.g., R, SAS, SPSS),

Group6:
Big Data:
10+11+12
Garage>>Data Eng (Udacity >> Udemy >> Coursera
Hadoop, Spark, PySpark or SparkR,MapReduce,MLib
Databases:Postgres, Mongo, SQL, NoSQL
Casandra , ElasticSearch,CouchDB
cloud data warehouses - ie: Snowflake, Azure Data Warehouse, etc.,
 a variety of databases - ie: SQL, PostgreSQL, Azure SQL, Oracle,
 data automation and ETL tools ie: WhereScape, SSIS, Informatica
 analytical tools - ie: SSRS, Cognos, PowerBI, Tableau,
Google BigQuery
AWS ecosystem such as ElasticSearch, S3, and DynamoDB
multiple relational databases and data warehouses (Redshift, Snowflake, Postgres, MySQL, etc…)
Understanding of GCP/Azure Data Engineering Stack, Big Data Tools and Technologies ( HDFS, HBase, Spark, Kafka)
Frameworks: Spark, Airflow, DataBricks, ONNX, Kafka, Netty
Databases: MySQL, Snowflake, S3/Parquet
Data mining experience working with Relational, NoSQL and Graph databases
Knowledge of RPA automation tools such as UIPath or Blue Prism
Splunk
Experience with NoSQL databases, such as MongoDB, Cassandra, HBase
Some of the popular DBMS include: MySQL, SQL Server, Oracle, IBM DB2, PostgreSQL and NoSQL databases (MongoDB, Couch

Group7:
Agile Scrum and/or kanban method
 Agile software development and tools ie: JIRA, Azure DevOps,

Group8:
 visualization tools (e.g., Power BI, Tableau, Shiny)
data visualization tools (Spotfire, Tableau, Qlik)
Experience in Tableau, Apache SuperSet, Looker or similar BI tools.
Experience in BI platform and data visualization tools (QlikView, Microstrategy, Tableau, Power BI, etc.)
Tableau, Power BI

Group9:
Docker, Kubernetes
Knowledge of management and productivity environments and tools (e.g., Jira, Git, Bitbucket, Jenkins)
Git, GitHub, and Zenhub
Knowledge of CI tools and processes (Jenkins, TeamCity, Bitbucket pipelines, etc);
Experience with container-type environment: Docker, Kubernetes, Openshift, PCF
Experience with Version Controlled data pipelines (Pachyderm)

Group10:
Software Engineering : Hammad

Group11:
BlockChain
3G+4G+5G Planning & Optimization

Group12:
Business Finance
HackerRank Question ( interveiw)
LeetCode
System Desgin

Group13: advance Topics


Optimization
PGM
Bayesian
Gaussian

Group14:data-science-interview-questions-and-answers/

https://data-flair.training/blogs/data-science-interview-questions-and-answers/

Group15: IELTS

Group16:application
Optimization
Trading AI

udacity ML
Coursera Bayesian
Coursera NLP + Slideshare
[Master ML >> ML >> ML 3 best Course]
appliedAI Courses
datacamp
udemy A-Z >> R

IBM Data Science Professional Certificate 5 Courses


University of Michigan 3Coursers
Johns HopKins >>Data Science: Statistics and Machine Learning Specialization 10 Courses
University of Washington >> Machine Learning Specialization 4 Courses
Linkden Courses
Kaggle

How to Win a Data Science Competition: Learn from Top Kagglers

Time Series
DS Coursera >> DS Udacity + Discrete Math
Udacity > intro Tensorflow > intro to pytorch
Udacity >> Reinfocement Learning in depth
Err:509
>>python fluent >> OOP >> Design Pattern

>>SQL Expert: + Cornge Melon University


Coursera:Learn SQL Basics for Data Science Specialization 5 Courses (UCDAVIS)

>>Deep LEarning
Coursera+Hisham >> Tensorflow in parctice
#NAME?

>> Data analyst + Data Engineering + Big Data


San-Diago
Yandex
Cloudera
Udacity
Udemy All
Data Engineering, Big Data, and Machine Learning on GCP Specialization

DeveOps : Docker + Kubernets + Git

Torrent:
University of Michigan

Applied Machine Learning in Python


Applied Text Mining in Python
Applied Social Network Analysis in Python

very important Slides Advanced Machine


http://www.cs.cmu.edu/~tom/mlbook-chapter-slides.html
senior data scentist
Senior Data Scientist (known internally as Sr Manager Data Scientist)

Rogers DAMT (Data Analytics and Marketing Tech) is looking for a Lead Data Scienti

Data Scientist will be expected to lead, define, and execute challenging projects tha

Data Scientists will be primarily responsible interpret and solve deep business prob
Manage customer churn
Acquire and retain customers
Build Likelihood To Recommend (LTR/NPS) clusters or models,
Increase revenue through cross sell and up sell products/services
Increase campaign effectiveness

Overall Role Requirements


Serve as a subject matter expert in Data Science and Data Engineering technology a
Work with engagement leads, program owners, and with leaders across divisions th
Understand business issues and decompose them into measurable data or modelin
Build predictive models leveraging advanced techniques ML/DL, tools and analytics
Perform model tuning and frequent training to maintain proper health of predictiv
Monitor the model performance and make ongoing corrections by executing exten
Prepare comprehensive model development and deployment documentation
Use advanced data mining techniques and perform data discovery to uncover the h
Develop and maintain an advanced knowledge of Rogers business and IT methodol
Deliver, track the value and articulate to the program owners

Skills/Qualifications
Bachelor or Master s degree in a technology/analytical field such as Computer Scie
A minimum of five years of relevant professional experience in an Agile environme
Deep expertise in developing and maintaining the predictive models using advance
Excellent programming knowledge in any of the languages (SAS , Phyton , R, Spark/
Expertise in data extraction, manipulation, feature engineering and validations
Experience in building Deep neural networks (MLP, CNN, RNN) and use of AI/Deep
Experience in working with any of industry standard analytical/ML mod
Experience in managing and leading complex projects and/or multiple projects sim
Strong organizational skills and the ability to manage multiple tasks simultaneously
Exceptional interpersonal, persuasion, and communication skills in order to commu
Exposure to Reinforcement learning (DynaQ/Q+, SARSA, TD, Monte Ca

Schedule Full time

What is the opportunity?

As a Full-Stack Data Scientist, you will analyze, design, and implement AIOps solutio
What will you do?
Work on challenging and research-based initiatives using advanced machine learni
Provide analytics support to all TI pillars. This involves collaborating proactively wit
Prepare and integrate large and various types of data (structured/non-structured)
Implement machine learning models, data mining methods, and statistical analysis
Leverage visualization tools/packages to create powerful representations of results
Produce data-driven insights to help in informed decisions and actions by telling a c
Effectively communicate findings to business partners and executives
Collaborate with the development team to deploy production-scale solutions
Quickly learn new tools and technologies and use them in the daily analytics exerci
emory Management
What do you need to succeed?

Must-have
Bachelor, Masters or PhD. in Computer Science, Statistics, or relevant fields.
Expert in Python programming to write production-ready codes
Strong data profiling, cleaning, mining and technical documentation skills
2+ years of experience in building machine learning models (Supervised/Unsupervi
2+ year(s) experience with NLP and text analytics methods and packages
Experience building end-to-end pipeline and deploying machine learnin
Experience with big data technologies - parallel processing techniques and Apache
Experience with container-type environment: Docker, Kubernetes, Openshift, PCF
Experience with custom Web interfaces, API calls, and systems integration
Familiar with Linux environment, shell scripting, and Git
Experience working in an agile environment

Nice-to-have
Software engineering background with a focus on statistics and/or analytics
Experience in deep learning methods for NLP in Tensorflow or Pytorch
Familiar with Technology Infrastructure

With a strong history of innovation for over 100+ years, we are Canada’s largest div

Being part of the RACE21™ team, you’ll be leading the charge of a company-wide r

With the financial backing and commitment from our leadership, we envision a full
Renew , Automate , and Connect material, processes, equipment flows, and data s
Empower our employees by investing in digital skills and capabilities to enhance cre
We are world-class leaders in sustainability and safety – and we are building a bett

Reporting to the Manager of Technology, the Data Scientist, RACE21 brings deep u

Responsibilities
Be a courageous safety leader, adhere to and sponsor safety and environmental ru
Designs, develops, and implements end-to-end cloud based machine le
Ensures that data pipelines are scalable, repeatable, and secure, and can serve mu
Enables big data and batch/real-time analytical solutions that leverage emerging te
Collects, parses, manages, analyzes and visualizes large sets of data using multiple
Translates complex functional and technical requirements into detailed architectur
Codes, tests, and documents new or modified data systems to create robust and sc
Implements security and recovery tools and techniques as required
Works with fellow Data Scientists and Engineers to ensure that all data solutions ar
Ensures all automated processes preserve data by managing the alignment of data
Develops standards and processes for integration projects and initiatives
Base, Spark, Kafka) Qualifications
Master’s or PhD Degree in Information Technology, Computer Science, or a related
Minimum five years of experience in data science
Understanding of high performance algorithms and R statistical software
Experience in industry data science (e.g., machine learning, predictive maintenance
Capability to architect highly scalable distributed systems, using different tools
Demonstrated experience with object oriented design, coding and testing patterns
L databases (MongoDB, CouchDB, DynamoDB, HBase, Ne
Expert knowledge of data modeling and understanding of different data structures
Strong understanding of Agile methodologies
Experience as a Data Scientist on an agile team or other rapid development metho
Excellent problem solving, critical thinking, and communication skills
Experience in developing presentations and communications to be shared with inte
Brings a high energy and passionate outlook to the job and can influence those aro
Able to build a sense of trust and rapport that creates a comfortable & effective wo
Passion for innovation and “can do” attitude
Ability to travel to our sites in British Columbia 20-40% of the time

Responsibilities

In this role, you would report to the VP Engineering. Responsibilities will include:
Conceiving and developing data driven solutions for clients
Creating and presenting results and findings to clients
Collaborating with developer resources to build data pipelines and operationalize s
Data cleaning & reshaping

Qualifications
Masters or Ph.D in Econometrics, Financial Economics, Mathematics, Statistics, Phy
2-5 years of experience in a Data Science role
Experience in one or more programming languages (Preferably Python)
Experience with one or more data science frameworks (e.g. Numpy, Scikit-learn, Sc
Strong understanding of data and business
Ability to work autonomously and in a team
Working in a fast-paced environment
Ability to solve complex problems
Strong communication skills
Nice To Haves
Past experience with one or more business intelligence/analytics platfo
Analytical experience related to the food and beverage industry
Past experience working with time series data

Skills
Jupyter
Forecasting
Python
Statistical Modeling
Statistics
Interpreting Data
Problem Solving
Science

What You Will Do


Collaborate with a multi-disciplinary team of engineers and analysts on a wide rang
Bring analytical rigor and statistical methods to the challenges of measuring quality
Build innovative predictive analytics and data science solutions for a myriad of cybe
Think like an adversary.
Identify and articulate risks and remediation in a relevant and approachable manne
Identify data sources, collect, transform, and prepare large amounts of data for ana
Develop tools to help the data collection process as needed.
Use appropriate methods, tools, and algorithms to analyze data and create an impl
Validate the results of the data analysis to avoid errors.
Interpret results and identify value from the results.
Work with colleagues and customers, providing guidance on risks and limitations.
Monitor and continuously improve the data sources, usability, and analytical result

Education And Experience


Bachelor's or Master's degree in Computer Science, Statistics or similar quantitative
3-5 years of working experience in machine learning and data science projects
2-3 years of experience in working with large scale production data sets
Good understanding of the foundations of machine learning methods
Exceptional coding skills in R or Python

Must Have Skills Include


Experience with machine learning methods, models and frameworks
Familiarity with multiple programming and scripting languages (such as Java, Javasc
Familiarity with data visualization tools
Experience with passive and active measurement techniques
Experience with applying statistical modeling, machine learning, and data mining a

Highly Desirable
Experience in the cybersecurity industry
Experience big data analytics databases is a plus
Knowledge of networking concepts, technologies and devices (Firewalls, Routers, S
Knowledge of network and related web protocols (such as TCP/IP, UDP, IPSEC, HTT
Knowledge of SQL
Linux System knowledge as user and administrator
Background in statistics

We might be a good match if...


You have expertise in Machine Learning and/or Natural Language Processing
You have familiarity working with the Google Cloud Platform and also want to help
You have deployed models to production with an Engineering team
You enjoy having diverse projects and stakeholders.
You believe data is most useful when it tells a story or addresses a business problem
You have a curiosity for patterns and problem solving.
You're both a self-starter and team player.
You love to think outside the box.
You take great pride in the quality of your work.
You have experience with machine learning, and scientific inquiry.

What will you do?


Work with a team of ambitious and passionate data scientists on a variety of projec
Work with the Client Success team to explore how recognition/engagement data re
Work with Product Managers and Development teams to develop Machine Learnin
Work with the Workforce Science team to push the boundaries on what is known a
Work with the Product team or other internal teams to analyse data such as featur
Bring an “A-Player” attitude: aiming for the happiness of your stakeholders; erring

Qualifications
Advanced degree in Statistics, Computer science, Behavioural Science or Mathema
4+ years of experience analyzing product and/or SaaS data, building ML and/or NLP
Demonstrated experience in leveraging data for actionable insights using different
Demonstrated experience working with both structured and unstructured data and
Have software engineering skills (scripting is important for quick prototyping, but k
Experience working with Google Cloud Platform
Experience building recommender systems and preferably NLP applications
Have deployed models to production with an engineering team
Excellent communication – written, conversational, presentation, and data-visualiz
Experience with both software engineering (ideally in an agile environment and wit
Experience in an HR space (e.g. People Analytics) or knowledge of Organizational P

Highly Desirable Qualifications


PhD in Machine Learning, Mathematics, Statistics, Computer Science or in another
Experience with Java and Spark
Experience in Predictive Analytics
Expertise in machine learning methods including Time series analysis,

Data Scientist

Job Description

As a Data Scientist, you are an experienced machine learning practitioner and pyth
Signal processing
Geo-spatial data processing
Natural Language processing
Social network analysis
Recommender system
Adaptive experimentation techniques
You will be part of our data science team and work in a dynamic and multi-cultural

Depending on your level of experience, you will lead your own projects and guide o

Job Description
You will help the team to improve upon current methods and models. With a practi
Your software development experience will allow you to closely collaborate with o
You will work in our headquarters in Antwerp (Belgium) or in Toronto (Canada)

Depending On Your Domain Of Expertise


You will work with big data (GPS fixes and sensor data) from hundreds of thousand
You will train algorithms related to activity detection (type of transports, home/wo
You will create machine learning pipelines capable of extracting insights from socia
You will design context-aware recommender systems

Requirements

Desired skills and expertise


You have a masters degree or PhD in computer science or a related field.
You are fluent in Python programming and its machine learning stack
You are an expert in machine learning and data modeling.
You have knowledge/experience with software engineering and deployment to pro
You have experience with at least one of the fields listed above.
You possess a deep understanding of clustering, manifold learning and predictive m
You can use big data tools such as Spark for development
You can work independently and take matters into your own hands.

Experience with any of the following is considered a plus:

Sensor data modeling from mobile devices or wearables


Data engineering technologies (Spark, Kafka, database,...)
Java
Mobile programming (Android or iOS)

Job Description:

Working closely together with our machine learning experts, you will design, imple

Your analytical skills and your profound interest in machine learning will push you t

Your software development knowledge will allow you to work closely with the engi

Requirements
Desired Skills and Expertise:

You have a master’s degree or a PhD in computer science or related field.

You have knowledge of software engineering

You have worked with main database technologies like SQL

You have experience in Python and its data analysis and machine learning stack

You have professional or academic experience in machine learning and data model

You have an analytical mindset and have a strong mathematical background

You are fluent in English. Dutch is a plus.

You can work independently and take matters into your own hands.

The ability to quickly learn new technologies and successfully implement them is es

Experience with any of the following is considered a plus:


Sensor data modeling from mobile devices or wearables

Data engineering knowledge (Spark, Kafka, database,...)

Experience/knowledge of Java or C++

Experience/knowledge of mobile programming (Android or iOS)

Knowledge in specialized fields of interest for Sentiance is considered a plus:

Signal processing

Geo-spatial data processing

Natural language processing

Social network analysis

Recommender system

Statistical analysis

Time series analysis

(Adaptive) experimentation techniques

Causal inference

Data Engineer
Job description
We are looking for the most ambitious and curious engineers in the field. You have

Sentiance aims to provide a resilient platform that serves as an insights engine on t


Bring a DevOps mentality
Attention to detail
Know the Reactive Manifesto by heart
Uphold best practices in engineering, security, and design
Enjoy working with a diverse group of people with different technical backgrounds
Task & Responsibilities

Integrate complex human behaviour models into a scalable production platform.


Develop software components and data processing pipelines.
Do performance and scalability engineering.
At Sentiance people come to have an impact and learn. You’ll be a part of an inter

Requirements

You have an academic degree (BSc or MSc) in computer science or related field.
Experience programming in Java and Python.
Practical understanding of software engineering best practices.
Know your way around the Linux operating system.
Work experience with Docker containers.
Work experience on distributed computation frameworks (Kafka, Spark, …) and No
You are fluent in English.
You can work independently and take matters into your own hands.
The ability to quickly learn new technologies and successfully implement them is es
Bonus points

Proven experience scaling to terabyte-sized datasets and managing pipelines to pro


Notions of and affinity for machine learning and data mining
Experience in statistical modeling
Experience programming in Go
Notions of functional programming

Your technology stack

You have mastered the SciPy stack (Python, pandas, numpy, scikit-learn).
You know your way around source control and reproducibility tooling (git, conda, p
Experience with Deep Learning frameworks (TensorFlow, PyTorch), an

python software developer


What you'll do

You will work with our team of engineers to design, implement and deploy end-to-

Suggesting and implementing architecture designs that function as the backbone o


Designing and implementing APIs to expose Machine Learning services.
Promoting good Software Engineering practices to the team.
Improving response times through the use of correct platforms, frameworks and lib
Integrating DevOps pipelines to optimise workflows.

Your qualifications

You consider yourself an excellent Software Engineer.


You have a keen interest in Machine Learning and Artificial Intelligence.
You have a degree (Bachelor's or Master’s) in Computer Science or equivalent work
You have delivered at least one real-life project.
You have great communication skills and love to learn new things.

Your technology stack

You are skilled in Software Engineering with Python and have a strong grasp on sou
You know your way around web development frameworks (e.g. flask, d
You have a good overview of platforms and services that support production system
You master software design patterns and architecture design.
Experience with parallel computing libraries (dask, numba) and the SciPy stack (pan

Qualifications

BS / MS in Mathematics, Computer Science or an Engineering discipline from a top


2-5 years of experience in data science and/or statistics, using tools such as R or Py
Alternatively, a PhD focused on quantitative research and discovering insights from
2+ years of experience in SQL and relational databases (PostgreSQL preferred), and
Strong understanding of techniques in data science (machine learning algorithms, r
Strong data analysis skills, knowing what can / cannot be possible, and ability to vis
Good communication skills - written, visual (presentations), and verbal
Ability to deliver adequate results on deadline, and make compromises to enable s
Geo-spatial skills are a plus
Knowledge of transportation data / industry is a plus
Quick learner, and a strong team playe

Your Role Is To
Crack our business problems and come up with deployable machine learning mode
Interface with business to make sure they are asking the right questions
Sift through our data and find us some gems
Be a world-class hands-on deploy master

Charm Us With
An earned stripes in coding, in data handling, in statistics and in machine learning (
A good balance between theory and practice and a strong desire to learn and keep
Hands-on experiences and good understanding of various machine learning algorit
A good knowledge of what’s “Under the hood” of statistical methods
Coding, coding, coding (Python, Java, C++, Scala, R, …)
Some extra points on: SQL, NLP, image processing, recommendation system, busin

you should possess the creativity to invent and customize when necessary.
Analyze results and trends in order to assist in making appropriate recommendatio
Responsible for the development and insights of weekly, monthly, quarterly & ann
Collect performance data from internal and external data sources
Manage timeline for all digital reporting
Provide planning teams with key research and insights that help steer quality digita
Actively engage with industry partners to build relationships and grow understandi
Help to build key presentations to Client, Agency and Industry
Work with Mindshare Analytics team to evolve digital measurement process for bo
Qualifications
University degree in Information Systems, Statistics, Commerce or similar field or e
2+ years of relevant work experience
Strong attention to detail
Excellent analytical and problem solving skills
Ability to clean and dive into messy data to find insights
Intermediate knowledge of MS Office product suite
Intermediate to advanced experience with relational databases (i.e. SQL, mySQL)
Experience with media buying platforms, online advertising, SSPs, or DSPs is a plus
Intermediate experience with data visualization software (Tableau, Dato
Experience with open source programming languages (R, Python) is a plus
Experience with web analytics software (Google Analytics, Omniture, e
Expedia Career
About you:
You'll dive into groundbreaking machine learning models, experiment and apply ne
You're passionate about asking and answering questions in large datasets, and you
You have a keen desire to tackle problems and live to find patterns and insights wit
You propose analytics strategies and solutions that challenge and expand the think
You're looking for a role with diverse learning opportunities, growing and having fu
Be enthusiastic in collaborating, developing relationships within the company, and
Your experiences:
MSc or PhD degree in machine learning, or computer science/statistics/Physics wit
Knowledge of Python programming language
Good programming practices, ability to write readable, fast, object-oriented code
Expertise in machine learning: framing business problems as machine learning prob
Good understanding of supervised, unsupervised and reinforcement learning (plus
Experience with common data science toolkits, such as Scikit-learn, Spark ML, perfe
Very good understanding of data technologies; Hadoop, Spark, and standard relatio
Experience working in a fast-moving commercial environment and excellent organi
Ability to work collaboratively, as well as to manage workflow in accordance with p
Good communication and team management skills to technical and business audie
Strong business/commercial sense to combine with analytics to help drive recomm
Working knowledge of statistics: hypothesis testing, confidence intervals and A/B t

What you will do?


You will drive machine learning and optimization efforts to help improve how we i)
You will work on key projects and apply machine learning, data mining and statistic
You will contribute to Expedia Group's wider data science efforts – an area of majo
You will run A/B test design, implementation and analysis on the website to find ou
Stay up to date with the latest data science/ ML / AI technologies and techniques a
Who you are
Extensive and relevant experience in data science and machine learning roles espe
MS or PhD degree in a highly quantitative field such as Machine Learning/AI, Mathe
Deep understanding of methods such as Learn to Rank, Recommender
Clear track record of turning ML into action
Experience with statistics/machine learning packages such as Keras, Tensorflow or
Excellent written and oral communication skills, including an ability to communicat

Responsibilities:

You own pricing and sort from end-to-end, including all aspects of analytics, test de
You develop consumer and pricing insights from sophisticated data-driven analysis
You deploy, test, and manage pricing and sorting models.
You work closely with Product Managers and Engineers to design and execute site-
You provide management with market and/or trend information, needed to make
You lead the development and evolution of analytical models establishing objective
You manage cross-functional ad hoc research requests.
Qualifications:

Master’s degree with 2-5 years of experience in a quantitative discipline (Compute


Professional experience in e-commerce, with focus on supply/demand dynamics, p
Prior analytics and data mining experience, focusing on segmentation strategies an
Proven programming skills in Python with 1+ years hands-on coding experience.
Familiarity with Spark and Hadoop strongly desired.
Strong business sense and logical skills to balance data-driven decisions with intuiti
Ability to access, analyze, and draw insight from multi-terabyte data-sets.
Experience in statistical modeling (response models and predictive models a plus).
Desire and capacity to learn, develop, and lead.
Ability to understand complex systems.

Responsibilities include but are not limited to:

Develop machine learning model algorithm to optimize revenue strategy


Help automation of machine learning model management to efficiently provide eff
Report and analyze business performance from a product, marketing, and custome
Deep-dive into product tests for insurance team to deliver additional insights
Identify business opportunities across different brands in our portfolio
Deliver impactful analysis based on robust economic, financial, and statistical analy
Function independently, assessing existing and upcoming business needs
Requirements:

Master’s degree is required


At least 7years of experience in insurance or data science area
Demonstrated ability to work through complex business problems and partner with
Strong analytical/modeling skills with ability to convert raw data into actionable ins
Strong hands-on experience in Python or R
Fluent in SQL
Great communication and presentation skills
Innovative

What you will do


You will be developing state of the art algorithms to power various aspects of highl
You can articulate and understand a business problem, identify challenges, formula
You will provide technical leadership, identify and understand key business challen
You will collaborate extensively with internal partners, program management, and
You will collect and manipulate large volumes of data; build new and improved tec
You will communicate results in a comprehensible manner to all levels of the comp
You will brainstorm with other team members
Who you are
PhD or MS in computer science or equivalent quantitative fields with background b
Strong algorithm design and programming skills in at least one low lev
Experience with distributed computing (Hadoop/Spark)
Hands-on deep learning technologies are a huge plus
Passion for solving interesting and meaningful real-world problems using principled
2+ years’ experience of working with live data (data cleaning, data visualization and
Collaborates well in a team and sensitive to clients' needs, while developing warm
Experience with documenting, synthesizing, and communicating results.
Willingness to think through ambiguous problems with others
r Data Scientist)

looking for a Lead Data Scientist role to deliver Machine Learning Acceleration Project as part of Marketing Transformation initiative.

xecute challenging projects that provide the technology and processes to deliver data science models in a fast-paced, agile environment. D

and solve deep business problems by developing predictive models and analysis that make the best use of the wealth of Customer, Netw

ucts/services

Data Engineering technology and related processes


with leaders across divisions that can influence our Backlog and Roadmap
o measurable data or modeling requirements. Analyze data different sources and provide key insights throughout the strategy, design, de
ues ML/DL, tools and analytics frameworks to expose actionable insights in order to support Rogers businesses mange the customer base
ain proper health of predictive models by ingesting newer data sources
orrections by executing extensive model testing strategies
ployment documentation
ata discovery to uncover the hidden patterns in data and translate learnings into a business actionable outcome
gers business and IT methodologies and apply this knowledge to project work.

al field such as Computer Science, Management of Information Systems, Data Science, Machine Learning, Engineering, or other relevant te
erience in an Agile environment with excellent understanding of the underlying Statistical, Machine Learning theory and Predictive Model
edictive models using advance ML/DL techniques. Proficiency in building Supervised learning models [ (classification and regression) - tree
uages (SAS , Phyton , R, Spark/Scala)
ngineering and validations
NN, RNN) and use of AI/Deep Learning frameworks like MXNet, Caffe 2, Tensorflow, Theano, CNTK, and Keras is an added advantage
andard analytical/ML modern ML platforms like (SAS Viya, Cloudera, Sage Maker Azure ML)
s and/or multiple projects simultaneously, including (but not limited to) building descriptive, predictive and prescriptive solutions and dep
multiple tasks simultaneously
cation skills in order to communicate strategic and technical ideas to internal audiences to both inform and solicit buy-in from the end use
Q+, SARSA, TD, Monte Carlo) is preferred

, and implement AIOps solutions at RBC Technology Infrastructure (TI). Leveraging leading edge technologies and various data sets, you w
sing advanced machine learning methods focusing on tangible outcomes
s collaborating proactively with various business and technical units to identify business opportunities and designing innovative solutions t
a (structured/non-structured)
ethods, and statistical analysis
erful representations of results
sions and actions by telling a convincing story
s and executives
oduction-scale solutions
em in the daily analytics exercises

stics, or relevant fields.

documentation skills
models (Supervised/Unsupervised)
thods and packages
deploying machine learning models
essing techniques and Apache Spark, Hadoop ecosystem, NoSQL/SQL databases
, Kubernetes, Openshift, PCF
d systems integration

tistics and/or analytics


orflow or Pytorch

rs, we are Canada’s largest diversified natural resource company looking to embark on our next chapter.

e charge of a company-wide renewal of technology and infrastructure – a high-tech transformation of mining into the next generation.

r leadership, we envision a fully-integrated agile digital platform that will:


, equipment flows, and data systems to expand and enable a broad application of advanced analytics, robotics, and artificial intelligence
and capabilities to enhance creativity and innovation – increasing sustainability, safety, and performance in real time, unlocking resource b
y – and we are building a better world for the next generation with lasting competitive advantages, from 2021 and beyond.

ientist, RACE21 brings deep understanding of big data to the teams, and helps in building and enabling big data analytics solutions. They a

r safety and environmental rules and procedures


nd cloud based machine learning production pipelines (data exploration, sampling, training data generation, feature en
and secure, and can serve multiple users within the company
ons that leverage emerging technologies
ge sets of data using multiple platforms
ments into detailed architecture, design, and high performing software
ystems to create robust and scalable applications for data analytics
ues as required
nsure that all data solutions are consistent
anaging the alignment of data availability and integration processes
ojects and initiatives

Computer Science, or a related quantitative discipline

R statistical software
arning, predictive maintenance) preferred
ems, using different tools
n, coding and testing patterns as well as experience in engineering software platforms and largescale data
ng of different data structures

her rapid development methods preferred


munication skills
ications to be shared with internal and external stakeholders
b and can influence those around them
s a comfortable & effective workplace

% of the time

Responsibilities will include:

pipelines and operationalize solutions

s, Mathematics, Statistics, Physics, Computer Science, Data Science, Engineering or equivalent experience

Preferably Python)
ks (e.g. Numpy, Scikit-learn, Scipy, Pandas)
ntelligence/analytics platforms (Teradata, Microstrategy, Cognos)

rs and analysts on a wide range of cybersecurity problems.


hallenges of measuring quality, improving security products, and understanding the behavior of end-users, computer systems, and networ
solutions for a myriad of cybersecurity use cases.

vant and approachable manner with both technical and non-technical audiences.
e large amounts of data for analysis.

nalyze data and create an implementation plan to address business problems.

ance on risks and limitations.


usability, and analytical results.

tatistics or similar quantitative field


and data science projects
oduction data sets
earning methods

and frameworks
anguages (such as Java, Javascript, C/C++, Perl, etc.)

ne learning, and data mining algorithms to business problems.


d devices (Firewalls, Routers, Switches, and Load Balancers)
ch as TCP/IP, UDP, IPSEC, HTTP, HTTPS, DNS, SSH, routing protocols)

ral Language Processing


latform and also want to help shape our Data Science toolset and stack
gineering team

r addresses a business problem.

ntific inquiry.

scientists on a variety of projects. Use both your IQ and EQ to support yourself, your team, and your company in achieving more inside and
cognition/engagement data relates to customer data and business metrics.
ms to develop Machine Learning and Natural Language Processing algorithms and scripts that will be built into software products.
boundaries on what is known about the “how” and “why” of the ways people work.
to analyse data such as feature usage and adoption and uncover insights to guide future product investments or business directions.
s of your stakeholders; erring on the side of quick communication and deep collaboration; strong sense of integrity and respect toward yo

havioural Science or Mathematics preferred


S data, building ML and/or NLP algorithms or making product focused impact with Data Science.
onable insights using different techniques including AI and ML.
red and unstructured data and a deep understanding of ML algorithms, NLP and statistical modeling.
nt for quick prototyping, but knowledge in OOP is crucial for the long run), fluency in Python, experience in deep learning libraries such as

erably NLP applications

resentation, and data-visualization.


n an agile environment and with programming best practices) and empirical science.
nowledge of Organizational Psychology is highly desirable.

mputer Science or in another highly quantitative field


ding Time series analysis, State-space models, Mixed-effect models, Longitudinal data analysis, Hierarchical Bayes; a

learning practitioner and python programmer with a background in software engineering and in at least one of the following fields:

n a dynamic and multi-cultural environment, among the brightest minds sourced from 25 different countries

your own projects and guide other people.

hods and models. With a practical mindset, you will bring these models into a production environment.
u to closely collaborate with our Engineering team to improve our models and push them through our release process.
m) or in Toronto (Canada)

a) from hundreds of thousands of users.


(type of transports, home/work location, etc.) and human behavior (commuting, sporting, shopping, etc.).
extracting insights from social interactions between users on our platforms

ce or a related field.
ne learning stack

eering and deployment to production

ifold learning and predictive modeling techniques.

our own hands.


experts, you will design, implement, and maintain decision logic that supports our behavioral profiling product. You will integrate existing b

achine learning will push you to continuously educate yourself to become a hands-on data scientist.

u to work closely with the engineering team to deliver robust and scalable solutions while implementing state-of-the-art algorithms and pr

ence or related field.

and machine learning stack

chine learning and data modeling

thematical background

our own hands.

cessfully implement them is essential.


nce is considered a plus:

ngineers in the field. You have had at least 2 years of work experience and have a passion for building state-of-the-art innovative computin

rves as an insights engine on top of sensor data

fferent technical backgrounds


alable production platform.

rn. You’ll be a part of an international team brought together by a culture of technical excellence, grit and integrity. You’ll find our compe

ter science or related field.

works (Kafka, Spark, …) and NoSQL/SQL databases.

our own hands.


cessfully implement them is essential.

and managing pipelines to process them.

numpy, scikit-learn).
ducibility tooling (git, conda, pip, docker).
TensorFlow, PyTorch), and parallel computing libraries (dask, numba) is a plus.

mplement and deploy end-to-end Machine Learning solutions for ourselves and our clients. You’ll have the opportunity to work on challen

at function as the backbone of our services.


Learning services.
platforms, frameworks and libraries.

tificial Intelligence.
ter Science or equivalent work experience.

n new things.

nd have a strong grasp on source control and reproducibility tooling (git, conda, pip, docker).
t frameworks (e.g. flask, django).
that support production systems and know their advantages and disadvantages (e.g. aws s3 vs aws MongoDB)

umba) and the SciPy stack (pandas, numpy, scikit-learn) is a plus.

ineering discipline from a top university


tics, using tools such as R or Python
h and discovering insights from data is acceptable in lieu of a BS/MS plus experience
es (PostgreSQL preferred), and good understanding of relational concepts
machine learning algorithms, regression analysis, etc.)
t be possible, and ability to visualize data in the best way possible
tions), and verbal
make compromises to enable scalability and productization of algorithms

oyable machine learning models


the right questions

stics and in machine learning (2 years of working or research experience is a must)


trong desire to learn and keep up with the latest technologies (an academic quantitative degree, preferably a PhD, earns more points)
rious machine learning algorithms (e.g. Gradient Boosting, Random Forest, SVM, Deep Neural Networks, RNN, CNN, Transformer, etc.)
tistical methods

commendation system, business acumen, blog writing

mize when necessary.


g appropriate recommendations and present results / findings to stakeholders
kly, monthly, quarterly & annual digital reporting
data sources

ts that help steer quality digital strategies


onships and grow understanding of partner’s present and future vision from a analytics and measurement perspective

l measurement process for both client and company

Commerce or similar field or equivalent

databases (i.e. SQL, mySQL)


rtising, SSPs, or DSPs is a plus
on software (Tableau, Datorama, etc.)
s (R, Python) is a plus
gle Analytics, Omniture, etc.) is a plus

dels, experiment and apply new ones, and apply analytics at scale in order to impact the business. Use the latest cloud and data technolog
ons in large datasets, and you are able to communicate that passion.
o find patterns and insights within structured and unstructured data.
hallenge and expand the thinking of everyone around you.
unities, growing and having fun while at it. You’re expected to stay ahead of the latest data science industry developments and coach the
hips within the company, and finding new business applications of data science and coaching more junior team members.

science/statistics/Physics with strong machine learning component

e, fast, object-oriented code


lems as machine learning problems, building datasets, selecting and engineering features, building and optimizing algorithms
d reinforcement learning (plus) machine learning algorithms
as Scikit-learn, Spark ML, perfection in at least one of these is highly desirable
op, Spark, and standard relational database structures along with query languages (SQL); experience in the Cloud a plus
ironment and excellent organisational skills - this is a fast-paced operation in a high-growth phase, so it's critical to be able to keep multipl
workflow in accordance with project timelines is required
o technical and business audiences, able to explain complex topics in simple terms, adapting communication technical depth to the audien
analytics to help drive recommendations; ability to demonstrate an understanding of the commercial impact of their activities
confidence intervals and A/B tests

orts to help improve how we i) collect and influence customer intent, ii) understand the relevant product options and iii) rank those option
ning, data mining and statistical modelling to design and implement mathematical models and algorithms to solve real-world applications
ence efforts – an area of major focus for the company
alysis on the website to find out the effectiveness of our efforts
technologies and techniques and identifying and advising how they can be utilized throughout the range of potential use cases.

d machine learning roles especially in Computer Vision, NLP or Recommender Systems


as Machine Learning/AI, Mathematics, Statistics, Computer Science, Economics or Engineering
rn to Rank, Recommender Systems, Personalisation, Classification, Deep Learning, Clustering, Regression, Predictive

such as Keras, Tensorflow or Spark MLlib. Experience with programming in Python or Scala
ding an ability to communicate across business areas

all aspects of analytics, test design and execution.


histicated data-driven analysis that result in specific action plans.

ers to design and execute site-wide monetization initiatives.


nformation, needed to make strategic decisions.
l models establishing objectives, key metrics, and test plans.

antitative discipline (Computer Science, Machine Learning, Operations Research, Applied Mathematics, Industrial Engineering, Statistics), P
n supply/demand dynamics, price elasticity, customer’s behavior, and competitive strategies is a plus.
on segmentation strategies and predictive modeling.
ands-on coding experience.

ta-driven decisions with intuition.


ti-terabyte data-sets.
and predictive models a plus).

ze revenue strategy
ment to efficiently provide effective support to growing insurance portfolio and highly-relevant custom experiences while maximizing reve
duct, marketing, and customer perspective
eliver additional insights
ds in our portfolio
financial, and statistical analysis
ming business needs

ess problems and partner with internal clients using consultative approach
rt raw data into actionable insights

power various aspects of highly complex business models


m, identify challenges, formulate the machine learning problem and provide/prototype solutions
derstand key business challenges and opportunities, and develop end-to-end software solutions using machine learning and optimization
s, program management, and engineering team to ensure that solution meet business needs, permit valid inferences and have functional
a; build new and improved techniques and/or solutions for data collection, management, and usage
anner to all levels of the company (field teams up to Snr. Management)

ative fields with background both machine learning (PhD preferred) and software development. 3+ years industry experience and hands o
kills in at least one low level language like C++/Java/C, and scripting languages like Python/R/Scala

orld problems using principled techniques and practices


cleaning, data visualization and modeling)
eeds, while developing warm relationships
municating results.
Transformation initiative.

st-paced, agile environment. Data Scientist will be required to innovate and establish as a subject matter expert and a thought leader.

he wealth of Customer, Network and Channel information available across Rogers business units, channels and care and marketing platfor

ghout the strategy, design, development and deployment stages


ses mange the customer base and improve relevant KPIs and performance metrics

ngineering, or other relevant technology field


g theory and Predictive Modeling Lifecycle
fication and regression) - tree based (Random Forest, Stochastic Gradient Boosting and eXtreme Gradient Boosting, Decision Trees, Extra T

as is an added advantage

prescriptive solutions and deploying these solutions in the cloud and/or on premise systems

solicit buy-in from the end user community

s and various data sets, you will apply machine learning and statistical modelling techniques to facilitate informed decision-making and bu
esigning innovative solutions to optimize processes and promote informed decision-making.

g into the next generation.

cs, and artificial intelligence


real time, unlocking resource base knowledge and improving planning for optimal resource extraction
21 and beyond.

ata analytics solutions. They apply complex and most current modelling techniques to existing data sets in order to find optimization and
data generation, feature engineering, model building, and performance evaluation)
omputer systems, and network devices.
y in achieving more inside and outside the office.

o software products.

ts or business directions.
tegrity and respect toward your colleagues (and everyone) that you express with helpfulness – you’re a team-player

deep learning libraries such as PyTorch / Tensorflow / Keras – Bonus if you are proficient in Spark
sis, Hierarchical Bayes; and Learning techniques such as Decision Trees, Boosting, Random Forests, Deep Learning,

of the following fields:


ct. You will integrate existing business rules into a knowledge based inference system, support the team in its debugging and testing proce

e-of-the-art algorithms and processes.


of-the-art innovative computing infrastructures. You will be a part of our Data Engineering Team. Since we believe in great teamwork, you
ntegrity. You’ll find our compensation and rewards competitive and of course, we have all the start up essentials: free coffee, nuts, fruits, a

opportunity to work on challenging and rewarding problems, including:


a PhD, earns more points)
N, CNN, Transformer, etc.)

atest cloud and data technologies to train and deploy machine learning models at scale. If you can prove your approaches are good, they'll

developments and coach the team on them.


am members.

mizing algorithms

Cloud a plus
tical to be able to keep multiple balls in the air.
technical depth to the audience
of their activities

tions and iii) rank those options


o solve real-world applications

potential use cases.

ng, Regression, Predictive Modelling, Numeric Optimization etc.

strial Engineering, Statistics), Phd degree preferred.

riences while maximizing revenue opportunities


ine learning and optimization methods.
ferences and have functional feasibility

dustry experience and hands on experience building machine learning models.


ert and a thought leader.

and care and marketing platforms. Collaborate with business primes from Wireless and Residential business to develop range of predictive

oosting, Decision Trees, Extra Trees, Regularized Greedy Forests), Generalized Linear Models, Discriminant models (LDA, MDA, FDA and QD

ormed decision-making and business process optimization. Moreover, designing and implementing end-to-end machine learning products
rder to find optimization and or improvement opportunities relevant to the context of the product being developed.
m Forests, Deep Learning, Neural Networks
ts debugging and testing procedures, and help optimizing our machine learning models and decision logic.
elieve in great teamwork, you must be eager to learn and bring an energetic and creative approach to work. We are looking for someone l
tials: free coffee, nuts, fruits, a ping pong table in Antwerp, and often home baked goods. Better yet, expect an agile and flat structure, dy
r approaches are good, they'll be quickly deployed to production.
to develop range of predictive models using advance Machine Learning and AI techniques and make business recommendations to

odels (LDA, MDA, FDA and QDA), and Unsupervised learning models (Isolation Forest, Clustering algorithms)

nd machine learning products will be part of your journey.


We are looking for someone like you if you are convinced that a fast-paced, high growth working environment would fit your skills.
t an agile and flat structure, dynamic growth opportunities, flexibility, and an openness for the curious.
s recommendations to
ent would fit your skills.
Bayesian Russia + Duke Bayesian Statistics very very important + University of California, Santa Cruz (mcmc-bayesian-statistic

Month-4
Time series
Feature Engineer + Feature Selection
ensebmle project packt
John Hokins
Projects >>

Month-5
PGM Coursera + CMU
http://www.cs.cmu.edu/~epxing/Class/10708-14/lecture.html

https://www.cs.cmu.edu/~epxing/Class/10708-20/lectures.html

https://www.cs.cmu.edu/~epxing/Class/10708-19/lectures/

https://www.youtube.com/playlist?list=PLoZgVqqHOumTqxIhcdcpOAJOOimrRCGZn

Apart from the MOOC by Daphne Koller as mentioned by Shimaa, you can look at the following courses on PGMs:

1. Machine Learning and Probabilistic Graphical Models by Sargur Srihari from University at Buffalo. You can find the video lec

2. Probabilistic Graphical Models by Andreas Krause from Caltech. You can find the slides at this link: http://courses.cms.caltec

3. Probabilistic Graphical Models by Eric Xing from CMU. Slides at this link: http://www.cs.cmu.edu/~epxing/Cl...

4. Probabilistic Graphical Models by David Sontag from NYU. Slides at this link: http://cs.nyu.edu/~dsontag/cours...
Cruz (mcmc-bayesian-statistics) (Bayesian Statistics: From Concept to Data Analysis) + Improving your statistical inferences (Eindhoven un

courses on PGMs:

alo. You can find the video lectures and slides at this link: http://www.cedar.buffalo.edu/~sr...

link: http://courses.cms.caltech.edu/c...

du/~epxing/Cl...

/~dsontag/cours...
tical inferences (Eindhoven unicersity of technology)
time series analysis pennstate college of science
https://online.stat.psu.edu/stat510/lesson/11/11.1
https://medium.com/auquan/time-series-analysis-for-finance-arch-garch-models-822f87f1d755
https://www.twirpx.com/file/2941628/grant/
https://www.kodges.ru/
https://download.csdn.net/download/weixin_39516246/10944826
https://litmy.ru/knigi/programming/
Bayesian Russia + Duke Bayesian Statistics very very important + University of California, Santa Cruz (mcmc-bayesian-statistic
Cruz (mcmc-bayesian-statistics) (Bayesian Statistics: From Concept to Data Analysis) + Improving your statistical inferences (Eindhoven un
tical inferences (Eindhoven unicersity of technology)
Top Courses :
Udacity
Coursera
Udemy
Courses: advance Skills: Youtube Channels
Stanford 2018 (2020 ) Game theory (https://ozonm Saptarsi Goswami
CMU Discrete Optimization Hisham hallag Valeo
UoT Discrete Models Baghdad Student
BC Stochastic analysis Hourani
Learn from Data Information theory
Cornewall Optimization and applicatiospringboard india
Hisham algorithms Abhishek Thakur
Coursera Washin PGM INSAID
Applied AI distributed Computing super data sicnec
H2O practical machine learning Coursera Algorithmic game theory a 365 data scince
Yandex Big Data for machine learning Modern statistics AI Engineering
https://courses.analyticsvidhya.com/ Uncertainty optimization anKrish Naik
Udacity ML Edruka
Advance ML Coursera Russia HSE University Simplelearn
Michigan upGrad
John_Hopkins Kevin markham
IBM 3blue1brown
365 Data Scinec Very important statquest
Alerta Optimizing sentdex
Google Cloud Greatlearning
New York for fincance ?
Edurka
Edx Coumiba
Edx Hardvard
Udemy
Udacity Coursers:
Data Analyst
Data SCientist
ML Done
Depp LEarning
Self Drviign Car
Time series Udemy 365 Data Scince
https://ozonmasters.ru/ml
pluralsight very important Practical
Russia Course : ML Russia Course : Big Data RL
Introductory lecture Course program https://dzone.com/articles/5-best-reinforcement-learning
Course structure, reporting, lecturer,HDFS Hadoop DistributePractical Reinforcement Learning (Russia Coursera)
Basic principles of HDFS aReinforcement Learning in Finance + advance (NYU ) Cour
Keywords: Namenode and datanode.
The concept of a block.
Data Science (Data Sciense) Replication and fault tolerance.
Statistics The process of reading a file.
Artificial Intelligence The process of writing a file.
Data Mining Cluster topology and proximity concept.
Machine Learning Mapreduce
Big Data Introduction to the MapReduce paradigm.
MapReduce program using command line utilities.
Statement of the main tasks of machineMapReduce in Hadoop. The concept of mapper and reducer.
Teaching with a teacher (with markedData flow, data locality.
Target function Computing optimization, function combiners.
An object Hadoop streaming in Python.
Label Hive. Database Management System on Top of Hadoop
Classification Hive architecture and comparison with traditional DBMS.
Forecasting HiveQL query language.
Object Space Managed and external tables.
Feature space Partitions and bucket.
Feature extraction Storage formats.
Task visualization Custom Functions and UDF.
Error Functions Hive streaming in Python.
Empirical risk Introduction to Apache Spark
Training sample Why do we need Spark? What is the problem of Apache Hadoop?
Learning Optimization Tasks Spark components and a brief history of development.
Algorithm Model Spark and SparkContext architecture.
Algorithm Introduction to RDD. Resilient distributed dataset.
Training Two types of operations and lineage graph.
Generalizing ability Caching
The scheme for solving the problem of Paired RDD, Mergers and Aggregations.
How are tasks solved Broadcast variables and batteries.
Learning without a teacher / with unall Spark sq
Partially tagged training The motivation for creating Spark SQL, remember RDD.
Transductive teaching How to create a DataFrame?
Reinforcement training Why do we need a circuit?
Structural conclusion Spark SQL Overview. Projections and samples.
Active learning Built-in functions.
Online Learning Mergers and introduction of query plan analysis.
Transfer learning Counting aggregates and statistics.
Multitask learning Custom functions.
Feature learning Work with time and window functions.
Machine Learning Issues Spark program optimization
Examples of model problems Program execution model.
Shuffle, partitioning.
Mathematics in Machine Learning: A BrShuffle, serialization.
Occam's razor Optimization of user functions.
Free Cheese Theorem Catalyst Query Optimizer.
Soccer oracle Optimization examples.
Tweaks Details Merge Algorithms.
Defining Distributions Optimization of mergers in Spark.
Average and deviation Spark ML. Classification and Regression
Conditional density, marginalization an Vector and matrix operations.
Point estimation Distributed matrices and SVD counting.
Maximum Credibility Assessment ML pipeline, architecture and components.
Kullback-Leibler divergence Overview of Kaggle toxic comments challenge.
Covariance and correlation Building a baseline solution.
Density estimate Work with unbalanced samples.
Bar chart approach Feature engineering at Spark ML.
Parzenovsky approach Calculation of quality metrics and cross-validation.
Normal distribution Spark ML. Clustering and ALS
Central Limit Theorem K-Means Algorithm in Spark.
Information theory K-means to improve Kaggle toxic comments challenge.
The Curse of Dimension Thematic modeling, LDA, further improvement of the Kaggle toxic comments challe
Singular Matrix Decomposition (SVD) ALS as a least-squares method with hidden variables.
Matrix differentiation Building a recommendations pipeline using ALS.
Optimization Industrial Spark ML
Unconditional optimization methods Pipeline Overview.
Zero-Order Methods Estimators and transformers.
First order methods Custom estimate.
Second order methods Custom transformer.
Gradient descent Scikit-learn integration in the Spark ML pipeline.
The fastest gradient descent Integration of XGBoost in the Spark ML pipeline.
Stochastic gradient descent Distributed selection of hyperparameters.
Training: Batch, online, mini-batch Cross validation
Gradient descent in machine learning Structured streaming
Stationary points Introduction to streaming data processing.
Newton's method Distributed fault tolerant Apache Kafka data bus.
Quasi-Newtonian methods Structured streaming in Apache Spark and data delivery semantics.
Restricted Optimization Structured streaming and Spark ML. We are building an antifraud pipeline for real-ti
NoSQL and Apache Cassandra
Metric algorithms Why NoSQL is needed, compare with relational databases.
Metric Algorithms (distance-based) Introduction to Apache Cassandra.
Nearest centroid (Nearest centroid alg The concept of a node.
Proximity Based Approach The concept of the ring.
kNN in the classification problem Keys for clustering and partitioning.
kNN in the regression problem Data recording.
Justification 1NN Reading data.
Lazy (Lazy) and impatient (Eager) algor GOSSIP protocol.
Weighted Generalizations kNN Data compaction.
Various metrics: Minkowski, Euclidean CQL query language Cassandra.
Applications of the metric approach: fuzzy table matching, Lencore, in DL, classification of texts
Effective Nearest Neighbor Search Techniques
Nadara-Watson Regression
Quality control and model selection
Quality control problem
Model Selection in the broadest sense
Sampling Rules
Deferred control (held-out data, hold-out set)
Cross-validation
Bootstrap
Time control (out-of-time-control)
Local control
Learning Curves
Enumeration of parameters

Linear methods
Linear regression
Generalized linear regression
Matrix degeneracy problem
Regularization. The main types of regularization
Ridge Regression
LASSO (Least Absolute Selection and Shrinkage Operator)
Elastic net
Feature Selection
Error with weights
Sustainable Regression
Linear scoring models in the binary classification problem
Logistic Regression
Probit Regression
Multiclass Logistic Regression
Linear Classifier
Perceptron
Evaluation of the error function through a smooth function
SVM

Nonlinear methods
Linearity problem
Polynomial model
Nuclear Methods (Kernel Tricks)
Core examples
Usage in SVM
Regression Use
Kernelization
Mathematics of nuclei
RBF, RBF networks

Decision trees
Decision Trees (CART)
Predicates / Branches
Tree answers
Cleavage criteria in classification problems: Missclassification criteria, entropy, Gini
Stopping criteria when building trees
Retraining problem for trees
Trimming (post-pruning)
Classic algorithms for constructing decision trees: ID3, C5.0
The importance of symptoms
Missing Values
Categorical signs
Comparison: trees vs linear models

Ensembles
Algorithm ensembles: examples and rationale
Committees (voting) / averaging
Bagging
Encoding / Transcoding Responses, ECOC
Stacking and Blending
Boosting: AdaBoost, Forward stagewise additive modeling (FSAM)
Manual methods
Homogeneous ensembles

Random forest
Universal methods
Random forest
OOB (out of bag)
Setting Method Parameters
Areas of sustainability
The importance of symptoms
Boruta
ACE
RF computed proximity
Extreme Random Trees

Gradient Boost
Gradient Boost Over Trees
Gradient Boost Iteration
Fastest descent
Shortcut Heuristics - Shrinkage
Stochastic Gradient Boosting
Advanced Optimization Techniques
Modern Gradient Boost Implementations
Built-in control methods
Gradient Boost Options
Case: Scoring Task (TKS)
Calibration
Case: Predicting Answers to Questions

Algorithm complexity, retraining, bias, and scatter


Generalization problem
Retraining
Undergraduate education
Algorithm Complexity
Offset and Scatter
Ways to deal with retraining

Bayesian approach
Bayes formula
The optimal solution to classification problems
Minimizing average risk
Naive Bayes
Bayesian Machine Learning Approach
Maximum Credibility Method

Maximum Credibility Method


+ Bayesian Approach in Example
MAP
Features of the Bayesian approach
Bayesian theory for linear regression
Logistic Regression
Bayesian point estimates
Bayesian Interval Estimates
RVM

Clustering
Clustering task, types of clustering
k-means (Lloyd's algorithm)
Generalizations of k-means
Clustering Model Problems
Affinity propagation: message clustering between points
Mean Shift: Density Mode Detection
Hierarchical clustering
Linkage Types
Minimum Spanning Tree Clustering
Spectral clustering
DBSCAN
Birch
Cure
Generative Models
EM
Gaussian Mixture Model (GMM)

Teacherless Learning
UL Tasks
Dimension reduction (reduction)
PCA
Nonlinear Dimension Reduction
Kernel PCA
t-SNE
Noise Reduction
Data Generation

Anomaly Detection
Detection of emissions and novelty (anomaly detection).

The concepts of “ejection” and “novelty”, “noise”, “anomaly”.

Anomaly detection task applications.

Anomaly detection methods: statistical tests, model tests, iterative methods, metric methods, task substitution methods, m
Associative rules
Basic terms

Apriori AP
Recommender Systems
Recommender systems

Personalization, online and offline recommendations

Content recommendation (content based methods)

Collaborative filtering: GroupLens algorithm, SVD, SVD ++, timeSVD ++, adaptation of SVD for social connections

One-class recommendation

Factorization machine, factorization machine with fields (FFM - field- aware factorization machine)

Simple Recommendation Methods: FPM - Frequent Pattern Mining

Knowledge-based Recommendations
Estimates of the mean, probability, and density. Weight schemes
Definition of the average: arithmetic mean, median, mode, average according to A.N. Kolmogorov, Cauchy average.

Multidimensional median, average as a solution to an optimization problem, emission control.

Estimation of minimal contrast.

Probability estimate. Maximum Likelihood Method. Laplace smoothing.

Weight schemes.

Case "zodiac scoring": sample sizes, stability patterns.

Case "forecasting visits of supermarket buyers and the amounts of their purchases": matrix of visits, estimation of the prob
Density recovery, weighted nonparametric methods, predicting the amount of purchases with their help, solving the joint

Case "traffic jam problem": two-dimensional averaging, smoothing Nadara-Watson.


Visualization art
Rationale for visualization: Enscombe quartet

History of data analysis and infographic: Joseph Priestley, William Playfair, Charles Joseph Minar, Florence Nightingale, Joh

What kind of data game?

Examples of poor visualizations: 3D graphics, pie charts.

Recommendations on choosing the scale of graphs and scales, explanatory text, color and style of images, presentation of

Visualization goals.

Descriptive statistics: average, characteristic elements, scatter of values, absolute variations, relative variations, moments,

An example of visualization of descriptive statistics. The study of the parts of the sample (folds), visualization of the import

Visualization of individual signs: charts, histograms, distribution densities, selection of the number of bins, transformation

Visualization of categorical signs: histograms, pie charts and areas, clarification of the nature of the sign.

Visualization of a pair of attributes: correlation, dependence of attributes, independence of attributes, typical values, outli

The visualizations of the pair “material attribute” are “categorical”.

Comparison with the benchmark.

Visualization “algorithm response” - “algorithm response”. Visualization “response of the algorithm” is a “sign”. Deformati

Correlations. 3D renderings. Dummy-renderings. Verification of train-test compliance. Aggregation.


Error Functions / Quality Functionals
Error functions in the regression problem: mean deviation module MAE (MAD), mean squared deviation MSE, its derivativ

Quality in classification tasks:

“Confusion Matrix” error / inconsistency matrix , Accuracy (MCE), 1st and 2nd kind errors, completeness (Recall, TPR), spe

Quality in binary classification problems with the answer in the form of probability, scoring errors: logistic error function Lo

Quality in multiclass problems: Hamming Loss, cross-entropy, Mean Probability Rate, MSE, MAE, averaging, generalization

Quality in the tasks of the recommendation: accuracy on the first n elements, average accuracy on the first n elements, MA

Editorial distance.
Quality in a task with target values ​- intervals: Jaccard coefficient, Shimkevich-Simpson coefficient (Szymkiewicz, Simpson),

Ways to configure specific error functions. Construction of a cleavage criterion for optimizing AUC ROC. Tasks with an inter
Data preparation
Fundamental data properties.

Types of data.

Data preprocessing.

Data Cleaning: anomalies / outliers, omissions, noise, incorrect values.

Data Reduction: sampling, dimensionality reduction, feature selection, feature selection.

Data Transformation: renaming features, objects, feature values, type conversion; coding of categorical variable values; sa

Data integration.
Character Generation
Types of numerical signs.

Contextual signs.

Service signs.

Data leak.

Oddities in the data.

Using EDA to generate traits.

Real signs.

Temporary features (characteristics of time points, interaction of a pair of features, use for other features, use to generate

Geographical (spatial) features: Spatial Variables. (projections on different axes, clustering, identification, binding, characte

Processing categorical characteristics


est-reinforcement-learning-courses
ng (Russia Coursera)
nce + advance (NYU ) Coursera
ggle toxic comments challenge.

antifraud pipeline for real-time data.


ask substitution methods, machine learning methods (single-class SVM, isolating forest, ellipsoidal approximations), ensembles of algo

cial connections

ov, Cauchy average.

sits, estimation of the probability of visits by recounting, weighted schemes in assessing the probability of visits, direct method for ass
heir help, solving the joint estimation problem.

, Florence Nightingale, John Snow.

of images, presentation of numerical values, use of different visualization tools.

ative variations, moments, standardized moments.

visualization of the importance of signs, primary actions in the analysis of the sign.

er of bins, transformation of signs.

butes, typical values, outliers, clusters. Scatter chart. Using noise for visualization. Pivot tables, triangular dependencies.

hm” is a “sign”. Deformations of answers and signs. Residual plot.

eviation MSE, its derivatives: RMSE, coefficient of determination R2, probabilistic and improbable justification RMSE, Huber function, L

leteness (Recall, TPR), specificity (TNR), accuracy (Precision), FPR (False Positive Rate) , F1 measure, Cohen's Kappa, Weighted kappa, M

rs: logistic error function Log Loss, MSE, Misclassification Loss, Exploss; Hinge loss; AUROC, GINI (Lorentz curve),

, averaging, generalization of F-measures, balanced accuracy (Balanced accuracy). Different types of averaging of quality: macro, micro

on the first n elements, MAP, Concordant - Discordant ratio, Mean Reciprocal Rank (MRR), Cumulative Gain, Discounted Cumulative G
nt (Szymkiewicz, Simpson), Braun-Blanquet coefficient, Sörensen coefficient, Kulczinsky coefficient, Oulia coefficient ( Ochiai), inclusion

UC ROC. Tasks with an interval attribute. Minimizing Root Mean Square Percentage Error (RMSPE) with deformations. Derivation of gra

egorical variable values; sampling; normalization; smoothing; creation of signs; aggregation; generalization; deformation of values.

r features, use to generate features, use to refine the task).

tification, binding, characteristics of the neighborhood, analysis of trajectories, deanonymization of data, use of context and the study
ations), ensembles of algorithms.

sits, direct method for assessing the probability, ensemble.


pendencies.

on RMSE, Huber function, Logcosh, generalization of MAE and RMSE, percentage error functions (SMAPE, MAPE, PMAD), errors based

Kappa, Weighted kappa, Matthews Ratio (MCC).

ng of quality: macro, micro, weight, by objects.

Discounted Cumulative Gain (DCG), nDCG, Expected reciprocal rank (ERR) .


efficient ( Ochiai), inclusion measures.

mations. Derivation of gradient descent formulas for basic error methods and functions.

deformation of values.

e of context and the study of strangeness, distance generation and use for other features).
APE, PMAD), errors based on comparison with the benchmark (MRAE, REL_MAE, PB), normalized errors (MASE), asymmetric errors, er
SE), asymmetric errors, errors accurate to the threshold, the use of error functions to generate attributes.
advance Skills: Game theory (https://ozoDiscrete Optimization Discrete Models
Game theory (https://ozoCoursera + University + Book + Youtube
Discrete Optimization
Discrete Models
Stochastic analysis
Information theory
Optimization and applications 2
algorithms
PGM
distributed Computing
Algorithmic game theory and mechanism design
Modern statistics
Uncertainty optimization and risk modeling
Stochastic analysis Information theory Optimization and applicatalgorithms
PGM distributed Computing Algorithmic game theory Modern statistics
Uncertainty optimization and risk modeling
 created a roadmap for exploring machine learning in 10 days. Of course, you would want to dig deeper into each of the

Day 1:

Basic terminology:

a. Most common settings: Supervised setting, Unsupervised setting, Semi-supervised setting,


Reinforcement learning.

b. Most common problems: Classification (binary & multiclass), Regression, Clustering.

c. Preprocessing of data: Data normalization.

Concepts of hypothesis sets, empirical error, true error, complexity of hypotheses sets, regularization, b


variance trade-off, loss functions, cross-validation.

Day 2:

Optimization basics:

a. Terminology & Basic concepts: Convex optimization, Lagrangian, Primal-dual problems, Gradients & subgradients, ℓ1ℓ

b. Algorithms: Batch gradient descent & stochastic gradient descent, Coordinate gradient descent.
c. Implementation: Write code for stochastic gradient descent for a simple objective function, tune th
step size, and get an intuition of the algorithm.

Day 3:

Classification:
a. Logistic Regression

b. Support vector machines: Geometric intuition, primal-dual formulations, notion of support vector
kernel trick, understanding of hyperparameters, grid search.

c. Online tool for SVM: Play with this online SVM tool (scroll down to “Graphic Interface”) to get some intuition of the al

Day 4:

Regression:

a. Ridge regression

Clustering:

a. k-means & Expectation-Maximization algorithm.

b. Top-down and bottom-up hierarchical clustering.

Day 5:

Bayesian methods:

a. Basic terminology: Priors, posteriors, likelihood, maximum likelihood estimation and maximum-a-
posteriori inference.

b. Gaussian Mixture Models

c. Latent Dirichlet Allocation: The generative model and basic idea of parameter estimation.

Day 6:
Graphical models:

a. Basic terminology: Bayesian networks, Markov networks / Markov random fields.

b. Inference algorithms: Variable elimination, Belief propagation.

c. Simple examples: Hidden Markov Models. Ising model.

Days 7–8:

Neural Networks:

a. Basic terminology: Neuron, Activation function, Hidden layer.

b. Convolutional neural networks: Convolutional layer, pooling layer, Backpropagation.

c. Memory-based neural networks: Recurrent neural networks, Long-short term memory.


d. Tutorials: You can follow tutorial for one deep learning framework (e.g. TensorFlow-
Examples or pytorch/examples).

Day 9:

Miscellaneous topics:

a. Decision trees

b. Recommender systems

c. Markov decision processes

d. Multi-armed bandits
Day 10: (Budget day)

You can use the last day to catch up on anything left from previous days, or learn more about whatever
topic you found most interesting / useful for your future work.
I think the best three books
for doing research on
of the topics listed to have a working knowledge of them: https://hackernoon.com/th
machine learning:
https://blog.floydhub.com/

1. Machine Learning: a Probabilistic Perspective by


https://blog.floydhub.com/a
Kevin Patrick Murphy M

2. Pattern Recognition and Machine Learning byhttps://blog.floydhub.com/g


Christopher Bishop Christo

3. The Elements of Statistical Learning by Trevor Hastie, Robert Tibshirani &


Also, there are other books
which may also be helpful to
understand and support
research on machine learning
and related fields:

1. Convex Optimization by Stephen Boyd and Lieven Vandenberghe Convex

2. Data Mining: Concepts and Techniques by Jiawei Han, Micheline Kamber

3. Introduction to Data Mining by Pang-Ning Tan, Michael Steinbach, Anuj K

4. Introductory Lectures on Convex Optimization by Nesterov, Yurii Introdu

s, ℓ1ℓ1 and ℓ2ℓ2 regularized objective functions.


the algorithm.
https://hackernoon.com/the-best-machine-deep-learning-books-e1bcec2c0a17

https://blog.floydhub.com/

https://blog.floydhub.com/a-pirates-guide-to-accuracy-precision-recall-and-other-scores/

https://blog.floydhub.com/guide-to-hyperparameters-search-for-deep-learning-models/

or Hastie, Robert Tibshirani & Jerome Friedman https://web.stanford.edu/~hastie...

ieven Vandenberghe Convex Optimization - Boyd and Vandenberghe

awei Han, Micheline Kamber & Jian Pei. Book page

an, Michael Steinbach, Anuj Karpatne & Vipin Kumar. Introduction to Data Mining

on by Nesterov, Yurii Introductory Lectures on Convex Optimization - A Basic Course | Yurii Nesterov | Springer
1- action oriented 4
2-self development 4
3-customer focus 3.5
4-plan & aligns 3
5-communication effective 3.5

Presentation Skills: Speechwriting, Slides and Delivery Specialization


Write Professional Emails in English
personal development
PMP
Global Telecom Business
mini-MBA >> Execautive MBA

How To Become Data Sceinetist ?


how to Take master ML on imperial collaege london.

Canada :
experince + 3G+4G+5G + Data Eng + ML + Big Data
P.E
Master
MBA
Start :
intiative 1-Hisham Asem
T.T 2- udacity
Ok 3-
Ok 4-
A.I Master
ML
Deep Learnning
Time series Analysis
Iot
BlockChain
Agenda of Meeting
strategic vision for 5G
5G innovations and vision
5G Large Scale Deployment in MENA
5G Core and network Slicing
RAN Evolution
Roadmap to mobile 5G
technical differnce between 4G and 5G Slicing
5G Business Modles & Monetisation
5g iot baseband Software developer
C + C++ + Python
MWC Los Angles Topics
Asia Singafora Topics
AI + ML + DL 3G + 4G
1-Helwan important Math Topics + Old man 3 Lecture Alex 3G ready
2-andro ML alex 4G ??
3-androw DL
4- DataCamp Time Series Analysis
5-udemy Time Series Analysis
6- Hisham asem
7-Seraj
8-Linkden Data Scentist
9-Mutaz
10-Husam Hourani
11-Big Data
12-Stanford & MIT ( machine learning )
Mit >> Math
Walid Yousef >> probability
ML Andro
ML >> Yaser abu Mustafa
Optimization and Alqorithms

Hesham + Hourani + Mutiz >> datacamp (All Python Courses)


https://www.datacamp.com/tracks/data-scientist-with-python https://www.datacamp.com/courses/tech:pytho
111 Course ?? https://www.datacamp.com/tracks/data-analyst-
data_manipulation
data visulization Data Scientist
proboailbity and statiscitc Data Analyst

ML
Reporting
case studies

ML Path Study :: DL Path Study

ML Hisham andrew
ML Udacity Udacity
ML Udemy Udemy
applied Machine Learnning washiton
Projects & Idea ( Optimization )

1- Easy to more advance


Traffic with SQI & Congestion & Events
Power Control
Handover analysis
EDGE Throughputs

data manipulation
Web Scraping in Python
5G software Developer
C
C++
Data Strucure Walid Yousef
alqorithms
Statsitics
Data analysis
learn Data

datacamp-course-roadmap
p.com/courses/tech:python
p.com/tracks/data-analyst-with-python
Career tracks
Data Scientist Data Analyst
Introduction to Python Introduction to Data Science in Python

Intermediate Python for Data Science Intermediate Python for Data Science
Python Data Science Toolbox (Part 1) Python Data Science Toolbox (Part 1)
Python Data Science Toolbox (Part 2) Intro to SQL for Data Science

Importing Data in Python (Part 1) Joining Data in SQL

Importing Data in Python (Part 2)


Introduction to Relational Databases in SQL
Cleaning Data in Python Importing Data in Python (Part 1)

pandas Foundations Importing Data in Python (Part 2)


Manipulating DataFrames with pandas Cleaning Data in Python
Merging DataFrames with pandas pandas Foundations
Analyzing Police Activity with pandas Manipulating DataFrames with pandas
Intro to SQL for Data Science Merging DataFrames with pandas
Introduction to Relational Databases in SQL
Analyzing Police Activity with pandas

Introduction to Data Visualization with Python


Introduction to Data Visualization with Python
Interactive Data Visualization with Bokeh Statistical Thinking in Python (Part 1)
Statistical Thinking in Python (Part 1) Statistical Thinking in Python (Part 2)
Statistical Thinking in Python (Part 2)
Joining Data in SQL
Introduction to Shell for Data Science
Conda Essentials
Supervised Learning with scikit-learn
Machine Learning with the Experts: School Budgets
Unsupervised Learning in Python
Machine Learning with Tree-Based Models in Python
Deep Learning in Python
Network Analysis in Python (Part 1)
Object-Oriented Programming in Python
Machine Learning Case Studies

Clustering Methods with SciPy


Data Science with Python Big Data
SQL For Data Analytics https://www.simpliv.com/sql/sql-masterclass-sql-for-data-analytics?fbclid=IwAR36Gak
datacamp (All Python Courses)
coursera (Python for Data Science and AI)

Skill Projects Coursera


advance Topics Python Track GET REAL LIFE E Applied Dat
Machine Learning with Apache Spark

Streamlined Data Ingestion with


pandas

Forecasting Using ARIMA Models in Python


sentiment-analysis-in-python-by-violeta-misheva
probability & statistics
ql-masterclass-sql-for-data-analytics?fbclid=IwAR36GakfmQrMoMrpq0ec2Okw3MmAPXsUnCewF8BgKfjQxRRBhmDQq7LnTz0

very important pluralsight


https://www.coursera.org/specializations/data-science-python Building Machine Learning Models in P
There are 5 Courses in this Specialization Machine Learning with XGBoost Using scikit-learn i

Introduction to Data Science in Python Understanding Machine Learning with Python


Applied Plotting, Charting & Data Representation in Python Getting Started with Natural Language Processing w
Applied Machine Learning in Python Python Fundamentals

Applied Text Mining in Python advance python

Applied Social Network Analysis in Python


Using Python to Access Web Data

Using Databases with Python

TensorFlow in Practice Specialization

This course is part of the Applied Data Science with Python Specialization
Johns Hopkins University 10 Courses

Excel to MySQL: Analytic Techniques for Business Specialization 5 Courses

Business Analytics Specialization 5 Courses

Advanced Machine Learning Specialization

Creating Dashboards and Storytelling with Tableau


Data Visualization with Tableau Specialization

Big Data Specialization https://www.coursera.org/specializations/big-data


Big Data for Data Engineers Specialization (Yandex) https://www.coursera.org/specializations/big-data
Data Mining Specialization

Cloud Computing Specialization

Tableau
Fundamentals of Scalable Data Science (IBM)
BhmDQq7LnTz0

cloudera KhanAcademy Kaggle Learn Microsoft stanford


intro to hadoop and mStatisitics and probability NOSQL (EDX) SPARK
GBoost Using scikit-learn in Python Analytics storytelling Deep Learning

earning with Python


ral Language Processing with Python

TensorFlow in details
g/specializations/big-data
g/specializations/big-data-engineering
Mit Udacity analyticsvidhya
Predictive Analytics for Business applied-machine-learning-beginner-to-prof
Deep Learning Become a Data Scientist natural-language-processing-nlp

Become a Machine Learning EngiCertified Machine Learning Master's Progr


Become a Deep Reinforcement Learning Expert
Become a Natural Language ProceLoan Prediction Practice Problem (Using Python)

relational database Creating Time Series Forecast using Python


FREE COURSE
Intro to Descriptive
Statistics
Become a Data Analyst

Intro to Inferential Statistics


Making Predictions from Data
Udemy statquest courses.anal
Unsupervised Deep Learning in Python Full Year Roa
Cutting-Edge AI: Deep Reinforcement Learning in Python

Recommender Systems and Deep Learning in Python


Natural Language Processing with Deep Learning in Python
m (Using Python)

Machine Learning A-Z™: Hands-On Python & R In Data Science

Python for Data Science and Machine Learning Bootcamp


Deep Learning A-Z™: Hands-On Artificial Neural Networks

Learn MongoDB : Leading NoSQL Database from scratch


Data Science A-Z™: Real-Life Data Science Exercises Included
The Ultimate Hands-On Hadoop - Tame your Big Data!
HDFS, Spark, Flink, Hive, HBase, MongoDB, Cassandra, Kafka
Apache Spark 2 with Scala - Hands On with Big Data!
Python A-Z™: Python For Data Science With Real Exercises!

Statistics for Data Science and Business Analysis


Taming Big Data with MapReduce and Hadoop - Hands On!
Interactive Python Dashboards with Plotly and Dash
Bayesian Machine Learning in Python: A/B Testing
Power BI A-Z: Hands-On Power BI Training For Data Science!
Powerful Reports and Dashboards with Microsoft Power BI
Cluster Analysis and Unsupervised Machine Learning in Python
Elasticsearch 7 and the Elastic Stack - In Depth & Hands On!
Python for Statistical Analysis
The Complete Machine Learning Course with Python
Python for Data Science and Machine Learning Bootcamp
Machine Learning Masterclass: From Beginner to Advanced
Excel Map Dashboard with VBA
Practical statistics for data and business analysis
Building Recommender Systems with Machine Learning and AI
Cornwell Machine LeaMit Onlin Edx Yaser abu mostafa Math Book
theory in details 2300$ https://www.edx.org/microsoft-professional-program-data-science
Data Algorithms (Mahmoud Parsian)
ADVANCEDMicroMaster Mit Statistics and Data Science
rithms (Mahmoud Parsian)
ML + DL
ML + DL
Path Roadmap Study
ML + DL Practical
Time series Analysis
Social network Analysis
NLP
Computer Vision
Projects
Deep Learning
Reinforcement Learning
Feature Engineering & Feature Selection

Mathematics for Data Science Specialization


Probability
advanced Proability ( Part1 : The fundemental , Par
Statistics I
Statistics II
descriptive statistics
Statistical Inference
A/B Testing
Bayesian statistics I
Bayesian statistics II
Markov models MCMC
Hidden Markov Model for Time series Analysis
Gaussian Process
Bayesian Inference, Markov Chain Monte Carlo and Metropol
Variational infernce and Dirhichlet process and Gussain pr

Probability Graphical Models & Bayesian methods


advanced probability Graphical Model
Linear Algebra
Multivariate Calculs
information theory
Convex Optimization
XGBoost
Bagging
Stacking
StackNet H2o
Discrete Mathmatics ( Graph Theory )
oop + Functional + competitive + reactive programming
Data Strucutre and Algorithms
Design Pattern + DDD
Fluent python ( Optimize Code )
Refactoring Python Code ( Packtpub )
Functional Programming Scala
Cloud Computing
Web Scraping
Data Analyst and Data Engineering
C++
Blockchain Developer
Java Developer
Software Architecture

Parallel Programming
Concurrent Programming
Distributed Programming
Android Developer
Ios Developer

Data Engineer with Python


(dataCamp)
Amazon + Linkden Courses
business intellegent
"Data Science": metro-rider snapshots
R + SAS + Scala
Clean and missing preprocessing
Scaling & Normalization
Feature Engineering and Selection
SkLearn
TensorFlow
Keras
Pytorch
Theano,, or deep learning framework
Caffe, Caffe2 deep learning framework
MXNet deep learning framework
H2O
The Microsoft Cognitive Toolkit
Cake
Micorsoft
tidyverse >> R Prgoamming Vsiulaization
Keras
Chainer – A flexible framework of neural networks
: StanfordNLP
Numpy
Panads
Visulatization
CNTK
Weka sted open source machine learning software that can be accessed through a graphical user interface, standard term
dplyr dplyr is a grammar of data manipulation, providing a consistent set of verbs that help you solve the most common da
Business Objects,
Tableau,
ChartIO, Create powerful dashboards from your company’s data and quickly discover insights with Chartio
JMP Statistical analysis as SAS
SPSS
Scipy
Recommender System
nltk, spacy NLP
, Sockeye, and NLP
OpenNMT NLP
Kibana, Kibana lets you visualize your Elasticsearch data and navigate the Elastic Stack so you can do anything from trackin
Zeppelin >> Web-based notebook that enables data-driven,interactive data analytics and collaborative documents with SQ
text mining techniques such as sentiment analysis, topic modeling
Power BI,
QlikView > end-to-end data integration and data analytics solutions and the expertise you need to build a data-driven ente
Cognos, IBM Cognos Analytics
Looker Analytics & BI Platforms
QSAR modelling
Deepchem democratize deep learning for science
Domo Analytics & BI Platforms
AutoML tools > Splunk , DataRobot , H2O , Rapid Miner , Big
Julia , Rust , Go
Graph algorithms: Neo4j, OR-Tools
software engineering standard methodologies such as code-reusability, modularity, use of repos
Data Warehouse concepts, ETL strategies and best practices.
Bayesian Inference in Python + Causal infernce
Statistical Learning Theory and Applications
Kaggle, demos, hackathons)
AWS infrastructure (SQS, SageMaker, Lambda)
data visualization tools (Spotfire, Tableau, Qlik)
algorithms, data structures,
web scraping
Shiny (build interactive web apps straight from R.), Flask , Django
Linux command line and shell scripting.
data mining language (e.g., R, SAS, SPSS),
 programming language (e.g., Python, Scala, Java)
 Big Data development tools: Hadoop, Spark, PySpark or SparkR
 visualization tools (e.g., Power BI, Tableau, Shiny)
Sound knowledge of data science development tools (Jupyter, RStudio), ideally on a platform such as IBM Watso
Graph Theory

IoT
5G

Expertise on Spark in general and MLlib specifically (PySpark, Scala, SparkR)

Information Retrieva

Cloud Technology, APIs and Databases

specially GIS/spatial analysis; graph theory/network


analysis; or natural language processing (NLP).

Working experience with non-Excel BI tools such as


Tableau, Looker, Superset, PowerBI, etc.

soft skills
writing email

Improve Your English Communication Skills Specializati

Business English Communication Skills Specialization

Learn English: Advanced Academic Speaking and Listening Specialization


Academic English: Writing Specialization
Big Data

Apache Spark vs. Hadoop MapReduce

Mlib

Spark/MLlib

Postgres, Mongo, SQL, NoSQL

pySpark Python Spark Certification Training using PySpark

Agile Scrum and/or kanban method


Git
API access of complex event data

Casandra
MongoDB
ElasticSearch
Docker
Agile experience
CouchDB
cloud services (AWS, GCP, Azure)

REST APIs
Docker, Kubernetes).
Spark in general and MLlib specifically (PySpark, Scala, SparkR)
ETL tool like SSIS
cloud data warehouses - ie: Snowflake, Azure Data Warehouse, etc.,
 a variety of databases - ie: SQL, PostgreSQL, Azure SQL, Oracle,
 data automation and ETL tools ie: WhereScape, SSIS, Informatica
 analytical tools - ie: SSRS, Cognos, PowerBI, Tableau,
 Agile software development and tools ie: JIRA, Azure DevOps,

Knowledge of management and productivity environments and tools (e.g., Jira, Git, Bitbucket, Jenkins)
Linkden Courses

BlockChain
Agile
Strong data profiling, cleaning, mining and technical documentation skills

2+ year(s) experience with NLP and text analytics methods and


packages

Experience building end-to-end pipeline and deploying machine learning models


Experience with container-type environment: Docker, Kubernetes, Openshift, PCF

Experience with custom Web interfaces, API calls, and systems


integration

Familiar with Linux environment, shell scripting, and Git


Experience working in an agile environment

causal inference

Familiarity with Google BigQuery

data analyst

AI
IBM Applied AI Coursera

Data Mining Specialization


Analyze Text, Discover Patterns, Visualize Data. Solve real-world data mining challenges.
data mining Online Courses
Data Visulization

Text Retrieval and Search Engines

Text Mining and Analytics


Pattern Discovery in Data Mining

Cluster Analysis in Data Mining

Data Mining Project

Mining Massive Datasets

NLP

Software Engineering
book Hand on-ML >> Repeat with Code + Summery Notes
Hisham asem + Hand-on Again + Kaggle

Udemy + experfy + Udacity + Coursera[michgen + washinton] + edurak


data-Camp + Udemy Very Very Very important Course
ФКН ВШЭ Youtube Channel
https://www.kdnuggets.com/2019/10/10-free-top-notch-courses-natu

Hisham+ ahmed fathi Math +andrw + fast.ai


Packtpub (Practical ) + Book (Theory )
Google Cloud

HSE University Coursera >> https://www.coursera.org/specializatio


walid + Mit + jamal505 (Arabic Youtube Channel )
https://ocw.mit.edu/resources/res-6-012-introduction-to-probabilit
https://ocw.mit.edu/courses/mathematics/18-650-statistics-for-applic

Udacity
Udacity
data-camp
step by step Book
Chieh Wu + BayesGroup.ru
https://mlwhiz.com/blog

http://www2.stat.duke.edu/~rcs46/bayes18.html
AAILab Kaist Youtube Channel >> https://aai.kaist.ac.kr/xe2/

CMU
https://sites.google.com/site/cs228tspring2012/
Ahmed Fathi + Walid + Mit Gilberton + Visual Draw Youtube
youtube channel >> intrigano : https://www.youtube.com/watch?
first 3 chapter Entropy
Walid + ahmed bazi + stanford + NPTEL + AlROOMI Arabic

https://dzone.com/articles/xgboost-a-deep-dive-into-boosting?edition=590295&utm_source=Zone%20Newsletter&utm_m
walid yousef

Videos + Book Python ( Walid Yousef ) + Book + Slides Google Drive al

Panads + numpy + matplotlib +scipy (Mastering)

Udacity
Udacity

Udacity

Udacity

Coursera Software Design and Architecture Specialization

Coursera Parallel, Concurrent, and Distributed Programming in Java Specialization

Walid Yousef

Done from Hand-on + Hisham + Udemy A-Z


Hisham + Book + Pucktpub + Course Stokcing Loai

Intro to Machine Learning with PyTorch


matplotli + seaborn + Plotly Dash

re that can be accessed through a graphical user interface, standard terminal applications, or a Java API. ( Machine Learning without p
roviding a consistent set of verbs that help you solve the most common data manipulation challenges

company’s data and quickly discover insights with Chartio

Udemy

ch data and navigate the Elastic Stack so you can do anything from tracking query load to understanding the way requests flow throug
ata-driven,interactive data analytics and collaborative documents with SQL, Scala and more.
is, topic modeling

analytics solutions and the expertise you need to build a data-driven enterprise.

https://analyticsindiamag.com/10-popular-automl-tools-developers-can-use/

uch as code-reusability, modularity, use of repos


st practices.
Pycon Sweden Youtube Channels ( Pydata ….) (Montreal-Python) (Enthought) (PyOhio) (Max Planck Institute for Intelligent
Youtube Channl >> Center for Brains, Minds and Machines (CBMM)

m R.), Flask , Django

PySpark or SparkR
ent tools (Jupyter, RStudio), ideally on a platform such as IBM Watson Studio, Anaconda, Databricks or GCP)

cifically (PySpark, Scala, SparkR)

https://www.coursera.org/specializations/improve-english

and Listening Specialization


https://www.udacity.com/course/sql-for-data-analysis--ud198

k, Scala, SparkR)
e Data Warehouse, etc.,
Azure SQL, Oracle,
pe, SSIS, Informatica

A, Azure DevOps,

environments and tools (e.g., Jira, Git, Bitbucket, Jenkins)

ne learning models
etes, Openshift, PCF

Advance Courses

Data Mining Specialization


s, Visualize Data. Solve real-world data mining challenges.
https://www.classcentral.com/subject/data-mining
Arnab Dey ( this is Youtube Channel take Courses )

Artificial Intelligence - All in One ( Youtube Channel )

Artificial Intelligence - All in One ( Youtube Channel )


https://www.youtube.com/watch?edufilter=NULL&list=PLxfZ4sD3Aoe3DymDFe3-AS0tXyp-r6-qZ&v=f3KWjjEq2xQ

https://www.youtube.com/watch?edufilter=NULL&list=PLyGDRqxp7CWmcw1igY3nF0qjuUXLE0jAi&v=KPDubbl_WWE

Artificial Intelligence - All in One ( Youtube Channel )

Two Courses Artificial Intelligence - All in One ( Youtube Channel )

Hammad Books
Object-Oriented Analysis, Design and Implementation: An Integrated Approach
Microsoft SQL Server 2012 T-SQL Fundamentals
Domain-Driven Design Distilled
Berthold Vöcking
Foundations of Programming Languages
Introduction to Parallel Computing
Packtpub
Nando de Freitas Youtube Videos
Hand on ML ( Book ) + Again with Code in Details with all Topics step by step from interne

Theory Explain (Xoviabcs ) Haindi NPTEL + mathematicalmonk Youtube Channel (Math


advance applied Time series Analysis (https://nwfsc-timeseries.github.io/atsa
Michigan Coursera + DataCamp
Udacity expert

IBM AI Enterprise Workflow Specialization

4 Courses 1- Discrete Math and Analyzing Social Graphs 2-Calculus and Optimization for Machine Learning 3- First Steps in
Ben1994 (Youtube Channle)
Probability - The Science of Uncertainty and Data (EDX)
khan-academy

Introduction to Statistics: Descriptive Statistics Edx


Statistical Inference (Coursera) Youtube [Johns Hopkins]

Bayesian sts by hackers Book


Bayesian Data Analysis book by Andrew Gelman,

徐亦达 (Youtubr chaneel)

http://nlp.chonbuk.ac.kr/
Coursera (Imperial Collegae Youtube >> george soilis ) https://www.youtube.c
PCA >> https://www.youtube.com/watch?edufilter=NULL&list=PL2jykFOD1AWa-
Game Theory Stanford Youtube (Freemium Courses Channel )
Applied Optimization for Wireless- (IIT Kanpur July 2018)

sting?edition=590295&utm_source=Zone%20Newsletter&utm_medium=email&utm_campaign=ai%202020-04-01
Coursera very important Mathematics for Data Science Specialization ( 4 courses )

Udacity ( intro + data )

ng in Java Specialization
erminal applications, or a Java API. ( Machine Learning without programming
on data manipulation challenges

cking query load to understanding the way requests flow through your apps.
h SQL, Scala and more.

evelopers-can-use/

Python) (Enthought) (PyOhio) (Max Planck Institute for Intelligent Systems)


atson Studio, Anaconda, Databricks or GCP)
https://www.coursera.org/learn/cloudera-big-data-analysis-sql-queries
https://www.coursera.org/specializations/data-mining
Ryo Eng (Youtube Channel ) >> Arnab Dey ( this is Youtube Channel take Courses )

Ryo Eng (Youtube Channel )

n: An Integrated Approach
andro theory very important + walid + yasser (theory) + a intro to st (http://faculty.marshall.usc.edu/gareth-
ails with all Topics step by step from internet Coursera with also Youtube ( Andrw )

Andreas Mueller Applied Machine Learning 2019 YoutubeAbdul


Ch Bari alqorithms + Ngoc Ha + Machine Learnin
Time Series Intel Course Coursera Practical TS Course Open Suny

Coursera Russia Advance udemy Hand-on

Optimization for Machine Learning 3- First Steps in Linear Algebra for Machine Learning 4-Probability Theory, Statistics and Explorato
Probability for Data Scientists Book MIT RES.6-012 Introduction to Probability, Spring
https://ocw.mit.edu/courses/electrical-engineering-and-chttp://nlp.chonbuk.ac.kr/
(IIT Kanpur July 2018) Descriptive Statistics Using "R" Udacity (stat > Desc >> Infer )>> profe leanod >> s

Descriptive statistics | Probability and Statistics | Khan Academy


Improving your statistical inferences (Coursera ) Very Important

Coursera Advanced ML (Bayesian MethodS ) Bayesian Statistics Coursera University of C


Jordan Boyd-Graber Coursera Full Course

Max Planck Institute for Intelligent Systems Book


Carnige Mellon University
https://brilliant.org/courses
Mit Course

a. Terminology & Basic concepts: Convex optimization, Lagrangian,


Zero sum game
Primal-dual
in gameproblems,
theory Gradients & subgradients, ℓ1ℓ

m_campaign=ai%202020-04-01
7 Coursera San diago Coursera Algorithm I & II Pricnton University
business
Experience with developing business
requirements, use cases and user stories in a data
analytics context
Experience with data warehousing and business
intelligence tools and platforms
Exposure to best practices in data management
and data governance practices

Ability to deliver projects with complex


requirements and a strong customer focus

Ability to influence and communicate with a


diverse group of stakeholders from multiple
levels of management
Ability to succeed in a team-oriented
environment under very dynamic conditions
Project Management and People Management
experience
Bonus Skills

Notebook experience (Jupyter, Zeppelin, Databricks, etc.) to perform


data analysis and algorithm development using Python

Experience with NoSQL databases, such as MongoDB, Cassandra,


HBase

Experience with graphics/visualization programming (CUDA,


cuDNN)
Cloud computing experience
Experience with ML collaborative platforms/pipelines (MLflow,
Neptune, Kubeflow, etc.)
Docker and/or Kubernetes exposure

Knowledge of big data technologies (Microsoft Azure, Google Cloud Platform, Cassandra, Spark, etc.) and advanced analytics (Azure Machine Learning, Goo

Experience with Version Controlled data pipelines (Pachyderm)

At least 2 years of experience working with one or more data mining


tools such as R, Python, Scala or SAS
Hands-on experience working with Big Data technologies such as
Spark, Cassandra, and/or Hadoop
Hands-on experience writing complex SQL queries and working with
relational databases such as Oracle, DB2 or SQL Server
Hands-on experience constructing and manipulating JSON and XML
documents and working with NoSQL databases such as MangoDB
and CouchDB
Good understanding of microservices architecture and hands-on
experience working with REST APIs
Expertise with Agile development methodology and Continuous
delivery best practices
Knowledge of RPA automation tools such as UIPath or Blue Prism

Fluent in implementing logistic regression,


random forest, XGBoost, bayesian and ARIMA in
Python/RExperience in User path navigation with
Markov Chain, STAN Bayesian analysis for A/B
testingFamiliarity with Sentiment Analysis (NLP)
and LSTM AI modelsExperience in full AI/ML life-
cycle from model development, training,
deployment, testing, refining and iterating.

Experience in Tableau, Apache SuperSet, Looker


or similar BI tools.
Knowledge of AWS Redshift, Snowflake or similar
databases
Data mining experience working with Relational,
NoSQL and Graph databases
Experience in BI platform and data visualization
tools (QlikView, Microstrategy, Tableau, Power BI,
etc.)
Experience in software engineering / algorithms
is an asset
Experience with time-series is an asset
Experience with process mining is an asset
Experience in behavior economics and industrial
psychology is an asset

Bonus points:
Some experience with analyzing time-series data
Familiarity with analyzing big datasets using tools such as Apache
Spark
Some experience with R and related machine learning libraries
Additional experience with deep learning frameworks such as
TensorFlow and PyTorch
Experience with accessing data from other datastores in the AWS
ecosystem such as ElasticSearch, S3, and DynamoDB
Exposure to working with software development teams and using
tools such as Git, GitHub, and Zenhub

Languages: Java, Scala, Python


Frameworks: Spark, Airflow, DataBricks, ONNX, Kafka, Netty

Databases: MySQL, Snowflake, S3/Parquet

Amazon Web Services to keep everything running

Understanding of GCP/Azure Data Engineering Stack, Big Data Tools


and Technologies ( HDFS, HBase, Spark, Kafka)
In-depth knowledge of pattern recognition models such as Fourier
transform, Wavelet transform, and change point analysis

Knowledge in six sigma methodology an asset

You’ve worked with different types of data infrastructures (e.g., data warehouses or data lakes)

Experience with Version Controlled data pipelines (Pachyderm)

Knowledge of CI tools and processes (Jenkins, TeamCity, Bitbucket


pipelines, etc);

Familiarity of software engineering tools (JIRA, git);

Strong command of SQL and working with multiple relational databases and data warehouses (Redshift, Snowflake, Postgres, MySQL, etc…)

Bonus points:

Some experience with analyzing time-series data


Familiarity with analyzing big datasets using tools such as Apache
Spark
Some experience with R and related machine learning libraries
Additional experience with deep learning frameworks such as
TensorFlow and PyTorch
Experience with accessing data from other datastores in the AWS
ecosystem such as ElasticSearch, S3, and DynamoDB
Exposure to working with software development teams and using
tools such as Git, GitHub, and Zenhub

Qualifications
Advanced degree in Computer Science or Computer Engineering
with a focus on ML or NLP (PhD preferred)
Minimum of 3 years of applied experience in ML, deep learning and
NLP
Have expert understanding of machine learning and NLP tasks such
as classification, feature engineering, information extraction,
structured prediction, sentiment analysis, Q/A, NER and topic
modelling
Fully understand different neural networks (LSTM, CNN, RNN,
seq2seq, BERT etc.), different word embedding models and transfer
learning.

Proficient in Python or C/C++

Knowledge of packages such as Tensorflow, Pytorch, Keras, Scikit


Learn, Pandas, NLTK, Gensim, spaCy and XGBoost
Have experience with embeddings, transfer learning, and
interpretability methods
Have working knowledge of Hadoop ecosystem (e.g. Spark)
element of st Pattern Recognition and Machine Introduction to Statistical Machine Learning
Nando de Freitas

Applied Text Mining in Python Michigan

y, Statistics and Exploratory Data Analysis


(Harvard University) UCI Open Actuarial Path

Practical Statistics for Data Scihttps://www.classcentral.com/subj


Statistics Making Sense of Data ( To

bayesian stat in R Duke Coursera Medium + Youtube Video

edureka!
Coursera Knowledge of uncertainty quantification metho

Operations Research
UDEMY Master the Coding Intervie
Coursera Stanford University Plurasight
s (Azure Machine Learning, Google Cloud Machine Learning, SAS Grid, Splunk, etc.) (asset)
al Machine Learning

Lawrence Leemis

Inferential Statistics (Duke) Coursera

PGM Youtube >> kayhan batmang10708-18Spring instructors-team


DataCamp
2020
2 + 2 = 4 Day 30 * 4 = 120 Hour >> 100 Hour

Plan : 30 Day
Month 4 Time series Udemy
Time series Coursera
Time series DataCamp
Machine Learning Washintogn Univ
Regression

Feature Engineering & Feature Selection

Month 5 Machine Learning Washintogn Univ


Clas Clustering
NLP 3 Course study in details understand
R-programming udemy
Delay after Finish everything Applied AI

Start DL Hisham + ahmed Fathi + with andro + Riad Almadani important Lectures
Month 6 / 4 - 5 Hour / Day DL Hisham with Andro
TensorFlow Coursera + Book
NN Design
Pytorch
Keras
recommnded System Udemy & Coursera
Griff hilton NN Coursera
Month 7 Reinforcement Learning
Udemy SVM Course
Ensemble Projects
Ensemble Bagging Xgboosting Stacking HandBook
Unsupervised Handbook
Packt Hand-On
ensemble analyticsvidhya path 2020
Month 8 DataCamp
Data Scintsit Career Track
Machine Learning Career Track
SQL + R
all Projects
Hisham Projects >> Udemy projects 1 + 2 >> Udacity Projects >> DataCamp Projects >> analytic

Advance Topics in processing if Time Avialable to study :


Inferential Statistics (university of amstrdam) Books ML + DL + …..
Stochastic processes (Coursera) >> Game theory Edx ML Colmiba + haravad + DL
Udacity DL
data strcture and algrithms Udacity Self Drving Car
PGM NPTEL Lectures in Depth undersatnd
Convex John Hopkins
Game theory >> Linear Programming Advance Russia Very Important
Discrete optimization specialization
stochastic process
Advance High Level :
Data Mining Specialization Coursera
(Month Data >> MRR ) data strcture and algrithms
Monta-carlo + Markov chain PGM
Gussian Models >> LDA QDA Convex
Bayesian statistics Game theory >> Line
Discrete optimization
stochastic process

(Month Data >> MRR )


Monta-carlo + Markov chain
Gussian Models >> LDA QDA
Bayesian statistics
Month 9 PGM CMU + Coursera
Gaussian Process & Bayesian & HMM
Bayesian Inference in Python + Causal infernce
Duke Bayesian Statistics very very important + University of California, S
Inferential Statistics (university of amstrdam)

Month 10 Data Strucutre and Algorithms + Discrete Math ( Graph Theory )


Python on Depth Factoring+ OOP + Design Pattern + Functional program
software engineering
web scraping
TDD, DDD and refactoring skills

mportant Lectures REST APIs


Month 11 Flask , Django
Shiny >> build interactive web apps straight from R.
Problem Solving python ( Hani sayed )

Month 12 Convex Optimization in Details

Theano,, or
Caffe, Caffe2
MXNet
H2O
Weka
CNTK
Chainer
Recommender System
> DataCamp Projects >> analyticsvdeeplearning4j Deep Learning for Java
text mining techniques such as sentiment analysis, topic modeling
y: Deepchem
AutoML tools > Splunk , DataRobot , H2O , Rapid Miner , Big ML
Julia , Rust , Go
AWS infrastructure (SQS, SageMaker, Lambda)
tidyverse >> Visualization R-Programming
pth undersatnd dplyr >> Grammer
Shiny >> build interactive web apps straight from R.
IBM Watson
GIS/spatial analysis; graph theory/network analysis
Git

ng Specialization Coursera
ure and algrithms

(Mlflow (An open source platform for the


Kubeflow,(The Machine Learning Toolkit for Kubernetes)
Neptune Experience with ML collaborative platforms/pipelinesmachine learning,
Notebook experience (Jupyter, Zeppelin, Databricks, etc.)

ata >> MRR )


rlo + Markov chain
Models >> LDA QDA
Groups:

+ University of California, Santa Cruz

th ( Graph Theory )
ttern + Functional programming
2021

Big Data + DB in Details 15-day 15-day 15-day


Month 1 SQL 100 Pages Linkden Projects WorkFlow Cloud Developmen
Start BigData Engineering with Moustafa + Agile AWS Jira
CMU>> intro to DB system + Advanced Scrum GCP , Git,
NPTEL kanban Azure Bitbucket
Book :: Design Waterfall , Jenkins
DataBase
Data Analyst + Data Scientist Udacity
Month 2

Month 3

Path:
>> Data analyst + Dat
Again to re-fresh your knowledge ML+DS San-Diago
Month 4 Search Internet All Topics in Details Yandex
Yasser abu Mostafa Learn from Data Cloudera
A/B testing, Udacity
InterView Questions Udemy All
Udemy ML A-Z but R-programming Data Engineering, Big
Fast.ai (DL Projects )
Udemy Coursera
Month 5 Quera Question for interview
CMU ML University Lectures Theory ( from Syllabs)
DL + ML all Video + material ( Pluarsight )
DS + ML Interview Questions

Month 6

Month 7 Projects WorkFlow


Cloud
Development Tools
Deploy container-type environmen
Month 8 Analysis
Visulaiztion

Month 9

Month 10 IELTS Study Start

Month 11

Month 12
2021

15-day 15-day 15-day


Deploy container- Analysis Visulaiztion DB in Depth Big Data
Docker, SAS Business Objects, Neo4j Graph AData Warehouse concepts, ETL str
Kubernetes). SPSS Tableau, Postgres, Mongo, SQL, NoSQL
Openshift, Scala ChartIO  Big Data development tools: Hadoop, Spark, PySpark or SparkR
PCF JMP Cloud Technology, APIs and Databases
Kibana
Zeppelin
Power BI,
QlikView
Cognos, IBM Cognos Analytics
Looker Analytics & BI Platforms
Domo Analytics & BI Platforms

data visualization tools (Spotfire, Tableau, Qlik)


 visualization tools (e.g., Power BI, Tableau, Shiny)
Apache Superset
D3
>>SQL Expert: + Cornge Melon University
Coursera:Learn SQL Basics for Data Science Specialization 5 Courses (UCDAVIS)

Group6:
Big Data:
Hadoop, Spark, PySpark or SparkR,MapReduce,MLib
Databases:Postgres, Mongo, SQL, NoSQL
Casandra , ElasticSearch,CouchDB
cloud data warehouses - ie: Snowflake, Azure Data Warehouse, etc.,
 a variety of databases - ie: SQL, PostgreSQL, Azure SQL, Oracle,
 data automation and ETL tools ie: WhereScape, SSIS, Informatica
 analytical tools - ie: SSRS, Cognos, PowerBI, Tableau,
Google BigQuery
AWS ecosystem such as ElasticSearch, S3, and DynamoDB
multiple relational databases and data warehouses (Redshift, Snowflake, Postgres, MySQL, etc…)
Understanding of GCP/Azure Data Engineering Stack, Big Data Tools and Technologies ( HDFS, HBase, Spark, Kafka)
Frameworks: Spark, Airflow, DataBricks, ONNX, Kafka, Netty
Databases: MySQL, Snowflake, S3/Parquet
Data mining experience working with Relational, NoSQL and Graph databases
Knowledge of RPA automation tools such as UIPath or Blue Prism
Splunk
Experience with NoSQL databases, such as MongoDB, Cassandra, HBase
Some of the popular DBMS include: MySQL, SQL Server, Oracle, IBM DB2, PostgreSQL and NoSQL databases (MongoDB, Couch

Group7:
Agile Scrum and/or kanban method
 Agile software development and tools ie: JIRA, Azure DevOps,

Group8:
 visualization tools (e.g., Power BI, Tableau, Shiny)
data visualization tools (Spotfire, Tableau, Qlik)
Experience in Tableau, Apache SuperSet, Looker or similar BI tools.
Experience in BI platform and data visualization tools (QlikView, Microstrategy, Tableau, Power BI, etc.)
Tableau, Power BI

Group9:
Docker, Kubernetes
Knowledge of management and productivity environments and tools (e.g., Jira, Git, Bitbucket, Jenkins)
Git, GitHub, and Zenhub
Knowledge of CI tools and processes (Jenkins, TeamCity, Bitbucket pipelines, etc);
Experience with container-type environment: Docker, Kubernetes, Openshift, PCF
Experience with Version Controlled data pipelines (Pachyderm)
prgoramming
Java problem Solving
C/C++
BlockChain
Web Developer
JavaScript

Hadoop, Spark, PySpark or SparkR

MySQL, etc…)
es ( HDFS, HBase, Spark, Kafka)
QL and NoSQL databases (MongoDB, CouchDB, DynamoDB, HBase, Neo4j, Cassandra, Redis)

eau, Power BI, etc.)

Bitbucket, Jenkins)
Jobs :: book linkden Jobs need Canada remotly work
data analyst Real-World Machine Learning
fdsfsd procom Loay Egypt Team
Data scientist statsitcsi + probability Python/R/Scala abdallah
ML (Hadoop/Spark
Big data Experience with AWS
ML frameworks/ libraries (e.g. Scikit-lear
Hadoop, Spark, Redis, HBase, Kafka, etc.

Signal processing
personal projects,
Geo-spatial data Kaggle
Experience
processing with
publications,
data or presentations at meetups/conferences
visualisation
Natural Language
tools, such as D3.js,
processing
GGplot,
Social Tableau,
NLP frameworks/libraries
network e.g. Gensim, SpaCy is a plus
TIBCO
analysisSpotfire.
Caffe, Tensorflow,
Proficiency in usingPyTorch
Recommender
query languages
Experience with analytical tools supporting data analysis (eg. Tablea
system
such as SQL, Hive,
Adaptive
AWS
Pig services including SageMaker
experimentation
Experience
techniques with
NoSQL databases,
such as MongoDB,
Cassandra, Hbase

udemy :

Machine Learning A-Z™: Hands-On Python & R In Data Science


The Data Science Course 2019: Complete Data Science Bootcamp
Python for Data Science and Machine Learning Bootcamp
Machine Learning, Data Science and Deep Learning with Python
Deep Learning A-Z™: Hands-On Artificial Neural Networks
The Complete Machine Learning Course with Python
Unsupervised Machine Learning Hidden Markov Models in Python

Cutting-Edge AI: Deep Reinforcement Learning in Python

Unsupervised Deep Learning in Python

Bayesian Machine Learning in Python: A/B Testing

Machine Learning and Data Science Hands-on with Python and R


https://www.udemy.com/course/machine-learning-become-kaggle-master/
https://www.udemy.com/course/neural-networks-fundamentals-in-python/
https://towardsdatascience.com/important-topics-in-machine-learning-you-need-to-know-21ad02cc6be5
online-master
ilinous
imperial
mit
try on canada
Tornto or WaterLoo
ryseron

at meetups/conferences

Gensim, SpaCy is a plus

s supporting data analysis (eg. Tableau)


g-you-need-to-know-21ad02cc6be5
Plan :: 3-Year >> P.E + IELTS + Prepare for Master
month-9-2019 >> chapter 8 >> Hisham
month-10-2019 >>8+9+10+11>> all Practical + revision
month-11-2019>>11+12 >> All Practical (Udemy) +(Hourani)
month-12-2019>>Practical DataCamp + Hand on Book (Mostafa El-Hosseini) + applied (michgen) + Seraj
2020 >> Start Python ML Book + Plurisght (very important TensorFlow)
all Time series Analysis from DataCamp

1>> Udacity + Project Follow-up Continue


2>Udacity

3>Udacity
ML Michigan University
ML Washinton University
ML IBM
ML pluralsight + advanced Python

ML Edurak
ML Linkden
ML mlcourse.ai. Lecture 0. Introduction

Data Engineering & Big Data


GCP,AWS,Azure,databricks,snowflaks,
Hadoop & Spark & Scala in details
San Diago
Yandex

Edureka (https://www.edureka.co/masters-program/big-data-architect-training)

Udemy

GCP Data Engineering

Data Engineer Udacity


PySpark Certification Training’ (Edurka ) Very important

Data Structure and Algorithms


1-Walid Yousef and Data Science course
2-Dasouqi + masrCode
3-Udacity + Coursera ( 6 Courses in this Specialization)

Deep Learning
Hisham +andrew +ahmed Fathi
Mutiz saad
Griff hilton NN Coursera
Udacity + Udemy
Nividea Courses
David Barber’s Bayesian Reasoning and Machine Learning

Statsitsics and probability


Introduction to statistical learning
The Elements of Statistical Learning
Machine learning: a Probabilistic Perspective
Learning From Data
Bishop's Pattern Recognition and Machine Learning
information theory ( Mohammed Hussin)
new book 2019 >> very important ML probabilites
speific Topics
https://www.coursera.org/specializations/probabilistic-graphical-models
4-Pattern recognition :shop’s Pattern Recognition and Machine Learning
5-Convex Optimization
Graphical Modelling Certification Training (Edurka is very important)

Cloud Computing
AWS
salesforce
Cloud Architect Master Program (Edurka)
OpenStack

DevOps Engineer Masters Program


Puppet, Nagios, Chef, Docker, Git & Jenkins
Linux, Python, Docker, AWS DevOps Certification Training and Splunk
Plan what is study ?? Roadmap Yousef
[ML+DL+RL]+Big Data+[Math+Probabilty+Statistics+Programming+Advance Math]
P.E + IELTS + Prepare for Master

-Hosseini) + applied (michgen) + Seraj

Data Science Specialist with experience of Data Analysis and Machine Learning projects.
Solid knowledge of Pandas, Numpy, Scikit-learn, Spark, SQL, Tableau, and AWS
York University Big Data Analytic

WeCloudData + Brainstation
Adaptable data analysis skills with statistical business intelligence
Strong programming skills with Python and its toolkits (Pandas, NumPy, SciPy, etc.)
Proficiency in database management system: MySQL, NoSQL
Scalable data processing platforms: Hadoop, MapReduce, Spark, Hive

Machine Learning Tools: TensorFlow, Sklearn, SparkML


Agile team reporting/Data visualization: Jira, Tableau, Apache Superset 
My technical skills include the complete package: Python programming with Structured an

• Programming: Python (Numpy, Scipy, Pandas, Scikit-Learn, pyspark), C/C++, Linux


Command Line Interface (CLI)
• Machine Learning: Linear Regression, Logistic Regression, SVM, Decision Trees,
Random Forest, KNN, AdaBoost, Gradient Boosting, Neural Networks(Multi-Layer
Perceptron), Kmeans clustering, PCA
• Database: MySQL
• Data Collection: Web Scraping (request and Beautiful Soup) or API request
• Big Data: Hadoop, Hive, Spark, Databricks, AWS(S3, EC2, EMR, RedShift, DynamoDB,
IAM), Azure(data lake, SQL database, pipeline in data factory)
• Data Visualization: Tableau, Matplotlib, Seaborn
Key Skills:
• Data collection, cleansing, and storing: SQL, OpenRefine, Hadoop, Bash, Git.
• Advanced big data analytics: Python, SAS, R, statistical modeling, A/B testing,
clustering.
• Machine learning: reinforcement learning, neural networks, TensorFlow, PyTorch,
Spark.
• Data visualization and presentation: Tableau, Shiny, D3, PowerPoint, creation of KPIs
(key performance indicators), dashboards, and reports (including automated reporting).
• Strong knowledge of MS Office suite: advanced Excel, Visio.
• Solid experience with conducting statistical analysis in Minitab.
• Lean Six Sigma Green Belt.

tect-training)
needed for Jobs

Skills : Done
Sklearn on-Going

TensorFlow, PyTorch, Theano, Caffe, Caffe2, or MXNet


sklearn, XGBoost, tidyverse, Tensorflow, pytorch, Kafka, Spark, Elasticsearch etc.

Scikit-Learn, H2O, Keras, TensorFlow


(numpy, scipy, pandas, scikit-learn, tensorflow/keras/pytorch, etc.)
linear/logistics regression discriminant analysis, bagging, random forest, Bayesian model, SVM, neural netw

statistics skills, such as distributions, statistical testing, regression, etc.

Advanced pattern recognition and predictive modeling experience


analyze A/B tests
Experience building shallow or deep learning models (GBDT, CNN, RNN, LSTM), toolkits e.g. Matlab, RStudi
TensorFlow, Spark/MLlib or

Apply statistical inference to draw conclusions from data


Create essential performance metrics.
Experience utilizing both qualitative analysis (e.g., content analysis, phenomenology, hypothesis testing) an
(i.e. Classification, ranking, segmentation, multivariate regression and/or pattern recognition techniques
Professional experience using unsupervised learning techniques - clustering, word embeddings, dimension
Knowledge of uncertainty quantification methods (Bayesian methods, confidence intervals, probabilistic gr
Python (numpy, pandas, sklearn, xgboost, TensorFlow)

Design hypothesis testing, backtesting, model validation, and data visualization systems

algorithms and data structures

SageMaker

social network analysis methods


Understanding of statistics (e.g., hypothesis testing, regression, signal-to-noise ratio, confidence bounds)
Understanding of programmatic modeling (e.g., mathematical optimization, confidence
statistical software (ex. dplyr, Pandas)
Machine learning (supervised and unsupervised methods) and exploratory/statistical data analysis (such as
Domain experience in math and statistical methods such as hypothesis testing, confidence intervals, and va
graph algorithms and semantic Web
navy mission systems
Knowledge of Machine Learning concepts: e.g., cross-validation, regularization, boosting, bootstrapping, et
Experience with one or more Business Intelligence and visualization tools (Business Objects, Tableau, Cha
Working knowledge of one or more statistical analysis packages: R, SPSS, SAS, Numpy/Scipy, etc.
Experience in statistical techniques such as classification, clustering, regression, statistical inference, colla
Experience using one or more of the following software packages: scikit-learn, numpy, pandas, jupyter, mat
Run experiments with the latest deep learning models using frameworks such as TensorFlow and PyTorch,
A minimum of 2 to 4 years experience with mathematical model creation and/or evaluation, hypothesis testi
Use data visualization tools (e.g., Tableau, Kibana, Zeppelin) to share ongoing insights.
Expertise in data mining algorithms and statistical modeling techniques such as clustering, classification, regression,
AWS,AZURE,GCP,Cloudera,databricks,snowflakes
Hadoop, Spark, Kafka
relational SQL and NoSQL databases, including Postgres and Cassandra.
stream-processing systems: Storm, Spark-Streaming,
Knowledge of VBA, SQL, MariaDB, and MCG IAM Web Harness software mandatory
Experience with data visualization tools, such as Power BI, QlikView
Knowledge of one of the following: Business Objects, Tableau, Cognos, Looker, Power BI
Big Data technologies such as Hadoop, Kubernetes, Cassandra
Experience of data wrangling and data munging, using Big Data technologies
Hands-on experience with the following platforms/tools - Spark, Redshift / Postgres, AWS, Linux, Hive / Pres
e.g., S3, EC2, EMR, Redshift, DynamoDB, Kinesis, etc.
Advanced knowledge of data visualization techniques and software tools (eg. R, Qliksense, Qlikview, HTML
Working knowledge of ETL tools (eg. Informatica, Ab Initio, Talend).
Experience with causal inference techniques, experimental design and/or A/B testing
Experience with creating visuals and dashboards in BI tools (e.g. Looker, Tableau, Power BI, Google Data S
visualization tools (e.g. Chartio, Looker, Tableau)
Experience working with microservice architectures/Docker/Containerization
e SQL, Hadoop, Spark, BigTable or DynamoDB
Experience with distributed data processing systems (e.g. Spark, Redshift)
Big Data Platforms and tools (e.g. Cloudera, Hortonworks, MapR, Hadoop, Pig, Hive, etc.)
Impala, and SQL – for queries
NoSQL databases (ex. HBase, MongoDB,ArangoDB, Neo4J) – for behavioral analysis
Data formats – ex. JSON, flat files, Parquet, ORC files, Avro
Extract-Transform-Load (ETL) processes
Data visualization tools (ex. Tableau, Qlik, IPython, etc.)
Kubernetes
HTTP, JSON, REST.
Postgres, Mongo, SQL, NoSQL.
Working knowledge of one or more SQL languages: Oracle, MySQL, PostgreSQL, Redshift, etc.
A passion to automate everything
Hands-on experience scripting and programming skills
Excellent understanding of machine learning techniques and algorithms,
such as k-NN, Naive Bayes, SVM, Decision Forests, etc.
Proficiency in using query languages such as SQL, Hive, Pig
Experience with databases, such as MongoDB, Cassandra, MSSQL, Oracle
Experience in scripting with Bash, Perl, YAML, or other scripting languages
Data visualization tools such as Tablaeu, ggplot, d3.js and Matplottlib,
Experience analyzing data from 3rd party providers: Google Analytics, Facebook Insights, etc.
Experience visualizing/presenting data for stakeholders using: Data Studio, Datalab, Redshift, etc
Experience with Amazon Web Services (EC2, S3, RDS, EMR, ELB, etc.)
Experience with web services using REST in Java
Experience with database management systems, data platforms (e.g. Streaming Data Platform, Kafka, Logst
Experience in AWS or GCP using tools such as EMR, S3, EC2, Deep Learning AMI’s, SageMaker
Above average capabilities with cloud computing techniques or tools such as S3, EC2, EMR, SageMaker, ECS, Docker
Data and Model pipeline and workflow management tools: Azkaban, Luigi, Airflow, Dataiku, etc.

GitHub or Kaggle
git, mercurial, jenkins, travis, jira, asana, etc.
Source control experience, preferably GIT
Weka
Experience with popular DevOps tools such as (Puppet, Chef, Ansible, Vagrant, Docker, Jenkins, Maven, Se

Experience with high-performance Deep Learning frameworks such as TensorFlow, PyTorch, Theano, Caffe
Experience with popular machine vision frameworks in python and other languages such as Keras, Caffe, T
AWS,AZURE,GCP,Cloudera,databricks,snowflakes
using technologies like; git, mercurial, jenkins, travis, jira, asana, etc.
Published work in Data Science related journals or conferences such as ICML, NIPS, JML, KDD, and INFORM
project available in a public forum like GitHub or Kaggle
Contributing to influential Open Source Projects like sklearn, XGBoost, tidyverse, Tensorflow, pytorch, Kafk
Scikit-Learn, H2O, Keras, TensorFlow
(numpy, scipy, pandas, scikit-learn, tensorflow/keras/pytorch, etc.)
linear/logistics regression discriminant analysis, bagging, random forest, Bayesian model, SVM, neural netw
Hadoop, Spark, Kafka
relational SQL and NoSQL databases, including Postgres and Cassandra.
Data and Model pipeline and workflow management tools: Azkaban, Luigi, Airflow, Dataiku, etc.
stream-processing systems: Storm, Spark-Streaming,

Knowledge of VBA, SQL, MariaDB, and MCG IAM Web Harness software mandatory
statistics skills, such as distributions, statistical testing, regression, etc.
Experience with data visualization tools, such as Power BI, QlikView
Advanced pattern recognition and predictive modeling experience
Knowledge of one of the following: Business Objects, Tableau, Cognos, Looker, Power BI
Big Data technologies such as Hadoop, Kubernetes, Cassandra
Experience of data wrangling and data munging, using Big Data technologies
project management skills
a quantitative field
Agile environment >> Scrum
ETL processes
Hands-on experience with the following platforms/tools - Spark, Redshift / Postgres, AWS, Linux, Hive / Pres
e.g., S3, EC2, EMR, Redshift, DynamoDB, Kinesis, etc.
analyze A/B tests
Advanced knowledge of data visualization techniques and software tools (eg. R, Qliksense, Qlikview, HTML
Working knowledge of ETL tools (eg. Informatica, Ab Initio, Talend).
Experience with causal inference techniques, experimental design and/or A/B testing
Experience with creating visuals and dashboards in BI tools (e.g. Looker, Tableau, Power BI, Google Data S
visualization tools (e.g. Chartio, Looker, Tableau)
Experience building shallow or deep learning models (GBDT, CNN, RNN, LSTM), toolkits e.g. Matlab, RStudi
Experience constructing SQL queries (using Postgres or a similar platform)
visualization tools such as Tableau, PowerBI,
Experience working with microservice architectures/Docker/Containerization
e SQL, Hadoop, Spark, BigTable or DynamoDB
TensorFlow, Spark/MLlib or
Apply statistical inference to draw conclusions from data
Create essential performance metrics.
Experience utilizing both qualitative analysis (e.g., content analysis, phenomenology, hypothesis testing) an
Experience with distributed data processing systems (e.g. Spark, Redshift)
Big Data Platforms and tools (e.g. Cloudera, Hortonworks, MapR, Hadoop, Pig, Hive, etc.)
Impala, and SQL – for queries
NoSQL databases (ex. HBase, MongoDB,ArangoDB, Neo4J) – for behavioral analysis
Data formats – ex. JSON, flat files, Parquet, ORC files, Avro
Extract-Transform-Load (ETL) processes
Data visualization tools (ex. Tableau, Qlik, IPython, etc.)
Source control experience, preferably GIT
Weka
(i.e. Classification, ranking, segmentation, multivariate regression and/or pattern recognition techniques
Professional experience using unsupervised learning techniques - clustering, word embeddings, dimension
Kubernetes
HTTP, JSON, REST.
Postgres, Mongo, SQL, NoSQL.
Knowledge of uncertainty quantification methods (Bayesian methods, confidence intervals, probabilistic gra
Python (numpy, pandas, sklearn, xgboost, TensorFlow)
Design hypothesis testing, backtesting, model validation, and data visualization systems
algorithms and data structures
SageMaker
social network analysis methods
Understanding of statistics (e.g., hypothesis testing, regression, signal-to-noise ratio, confidence bounds)
Understanding of programmatic modeling (e.g., mathematical optimization, confidence
statistical software (ex. dplyr, Pandas)
Machine learning (supervised and unsupervised methods) and exploratory/statistical data analysis (such as
Domain experience in math and statistical methods such as hypothesis testing, confidence intervals, and va
graph algorithms and semantic Web
navy mission systems
Knowledge of Machine Learning concepts: e.g., cross-validation, regularization, boosting, bootstrapping, et
Experience with one or more Business Intelligence and visualization tools (Business Objects, Tableau, Cha
Working knowledge of one or more statistical analysis packages: R, SPSS, SAS, Numpy/Scipy, etc.
Working knowledge of one or more SQL languages: Oracle, MySQL, PostgreSQL, Redshift, etc.

A passion to automate everything


Hands-on experience scripting and programming skills
Excellent understanding of machine learning techniques and algorithms,
such as k-NN, Naive Bayes, SVM, Decision Forests, etc.
Proficiency in using query languages such as SQL, Hive, Pig
Experience with databases, such as MongoDB, Cassandra, MSSQL, Oracle
Experience in scripting with Bash, Perl, YAML, or other scripting languages
Data visualization tools such as Tablaeu, ggplot, d3.js and Matplottlib,
Experience with popular DevOps tools such as (Puppet, Chef, Ansible, Vagrant, Docker, Jenkins, Maven, Se
Experience in statistical techniques such as classification, clustering, regression, statistical inference, colla
Experience using one or more of the following software packages: scikit-learn, numpy, pandas, jupyter, mat
Run experiments with the latest deep learning models using frameworks such as TensorFlow and PyTorch,
A minimum of 2 to 4 years experience with mathematical model creation and/or evaluation, hypothesis testi
Use data visualization tools (e.g., Tableau, Kibana, Zeppelin) to share ongoing insights.
Experience analyzing data from 3rd party providers: Google Analytics, Facebook Insights, etc.
Experience visualizing/presenting data for stakeholders using: Data Studio, Datalab, Redshift, etc
Experience with Amazon Web Services (EC2, S3, RDS, EMR, ELB, etc.)
Experience with web services using REST in Java
Experience with database management systems, data platforms (e.g. Streaming Data Platform, Kafka, Logst
Expertise in data mining algorithms and statistical modeling techniques such as clustering, classification, regression,
Experience in AWS or GCP using tools such as EMR, S3, EC2, Deep Learning AMI’s, SageMaker
Above average capabilities with cloud computing techniques or tools such as S3, EC2, EMR, SageMaker, ECS, Docker
Framework Projects + Coding + R reinforcement Learning
TensorFlow OOP Python DQN Learning
Keras advance advance Python sentdex
Pytorch 2G system Project KPIs Ahmed Arabic
TENSORFLOW 2.0 PRACTIPower Control Coursera Practical RL
https://www.udemy.com/course/tensorflow-2-practical/
Handover udacity deep
TENSORFLOW 2.0 PRACT Traffic simoninithomas important

https://www.udemy.com/course/tensorflow-2-practical-advanced/
DYMA Half rate Reinforcement learning with Tensorflow 2.0
sticsearch etc. BCCH https://rubikscode.net

udemy :Cutting-Edge AI: Deep Reinf


Artificial Intelligence: Reinforcemen
orest, Bayesian model, SVM, neural networks Advanced AI: Deep Reinforcement L
Deep Reinforcement Learning: A Hands-on Tutorial in Python
https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflo

Hands-On Reinforcement Learning With Python


RNN, LSTM), toolkits e.g. Matlab, RStudio, Weka, MLLib and frameworks PyTorch, TensorFlow, CNTK
https://www.mlq.ai/deep-reinforcement-learning-for-trading-with-tenso
https://www.theschool.ai/courses/move-37-course/lessons/markovdecis

Book + Video Reinforcement Learning with TensorFlow & TRFL [Video]


https://blog.varunajayasiri.com/ml/dqn.html
phenomenology, hypothesis testing) and quantitative analysis techniques (e.g., clustering, regression, pattern recogn
https://www.quora.com/What-are-some-of-the-good-Reinforcement-Le
nd/or pattern recognition techniques http://videolectures.net/rldm2015_silver_reinforcement_learning
ustering, word embeddings, dimensionality reduction, etc https://adventuresinmachinelearning.com/category/reinforcement-learn
s, confidence intervals, probabilistic graphical models) https://www.freecodecamp.org/news/an-introduction-to-deep-q-learnin
https://www.mlq.ai/deep-reinforcement-learning-twin-delayed-ddpg-alg

isualization systems https://github.com/adik993/reinforcement-learning-sutton

https://github.com/vinhvu200/MazeAI

https://github.com/rabieifk/Prison_Break_Machine_Learning

https://arxiv.org/pdf/1810.09967.pdf
nal-to-noise ratio, confidence bounds) https://mc.ai/choosing-a-deep-reinforcement-learning-library/
zation, confidence https://modelzoo.co/category/reinforcement-learning
https://deeplizard.com/learn/video/FU-sNVew9ZA
ratory/statistical data analysis (such as linear models, mu https://blog.varunajayasiri.com/ml/dqn.html
sis testing, confidence intervals, and various probabilityhttps://www.youtube.com/channel/UC7ZVvEo7-B7lA6LY2MVX72A/pla
https://github.com/dennybritz/reinforcement-learning/tree/master/DQN
https://www.endtoend.ai/rl-weekly/
ularization, boosting, bootstrapping, etc. https://joshgreaves.com/reinforcement-learning/introduction-to-reinfor
tools (Business Objects, Tableau, ChartIO, JMP, etc.) ishttps://www.youtube.com/watch?v=Pka0DC_P17k
a plus
SPSS, SAS, Numpy/Scipy, etc. https://dzone.com/articles/trading-strategies-using-deep-reinforcement
g, regression, statistical inference, collaborative filtering, and natural
University language processing, experimental design, socia
of Alberta
cikit-learn, numpy, pandas, jupyter, matplotlib, scipy, nltk, spacy, keras, tensorflow
orks such as TensorFlow and PyTorch, and toolkits such as Tensor2Tensor, Sockeye, and OpenNMT
tion and/or evaluation, hypothesis testing and experimental design
e ongoing insights.
ch as clustering, classification, regression, decision trees, neural nets, support vector machines
ware mandatory

nos, Looker, Power BI

shift / Postgres, AWS, Linux, Hive / Presto, Git

ools (eg. R, Qliksense, Qlikview, HTML5, D3)

nd/or A/B testing


oker, Tableau, Power BI, Google Data Studio)

doop, Pig, Hive, etc.)

havioral analysis

PostgreSQL, Redshift, etc.


ebook Insights, etc.
, Datalab, Redshift, etc

. Streaming Data Platform, Kafka, Logstash, Teradata, Hadoop, etc.), and data formats (e.g. SQL, NoSQL, AWS S3, JSO
g AMI’s, SageMaker
h as S3, EC2, EMR, SageMaker, ECS, Docker, Gitlab CI, Python packaging, command-line executi
, Airflow, Dataiku, etc.

e, Vagrant, Docker, Jenkins, Maven, Selenium, or Jira), provisioning, infrastructure as

as TensorFlow, PyTorch, Theano, Caffe, Caffe2, or MXNet


ther languages such as Keras, Caffe, TensorFlow, Lasagne or Torch

h as ICML, NIPS, JML, KDD, and INFORMS

st, tidyverse, Tensorflow, pytorch, Kafka, Spark, Elasticsearch etc.


orest, Bayesian model, SVM, neural networks

Luigi, Airflow, Dataiku, etc.

ware mandatory

nos, Looker, Power BI

shift / Postgres, AWS, Linux, Hive / Presto, Git

ools (eg. R, Qliksense, Qlikview, HTML5, D3)

nd/or A/B testing


oker, Tableau, Power BI, Google Data Studio)

RNN, LSTM), toolkits e.g. Matlab, RStudio, Weka, MLLib and frameworks PyTorch, TensorFlow, CNTK

phenomenology, hypothesis testing) and quantitative analysis techniques (e.g., clustering, regression, pattern recogn

doop, Pig, Hive, etc.)

havioral analysis

nd/or pattern recognition techniques


ustering, word embeddings, dimensionality reduction, etc

s, confidence intervals, probabilistic graphical models)

isualization systems

nal-to-noise ratio, confidence bounds)


zation, confidence

ratory/statistical data analysis (such as linear models, multivariate analysis, predictive modeling and stochastic mode
sis testing, confidence intervals, and various probability distributions

ularization, boosting, bootstrapping, etc.


tools (Business Objects, Tableau, ChartIO, JMP, etc.) is a plus
SPSS, SAS, Numpy/Scipy, etc.
PostgreSQL, Redshift, etc.

e, Vagrant, Docker, Jenkins, Maven, Selenium, or Jira), provisioning, infrastructure as code, and other DevOps concep
g, regression, statistical inference, collaborative filtering, and natural language processing, experimental design, socia
cikit-learn, numpy, pandas, jupyter, matplotlib, scipy, nltk, spacy, keras, tensorflow
orks such as TensorFlow and PyTorch, and toolkits such as Tensor2Tensor, Sockeye, and OpenNMT
tion and/or evaluation, hypothesis testing and experimental design
e ongoing insights.
ebook Insights, etc.
, Datalab, Redshift, etc

. Streaming Data Platform, Kafka, Logstash, Teradata, Hadoop, etc.), and data formats (e.g. SQL, NoSQL, AWS S3, JSO
ch as clustering, classification, regression, decision trees, neural nets, support vector machines, ensemble modeling and text min
g AMI’s, SageMaker
h as S3, EC2, EMR, SageMaker, ECS, Docker, Gitlab CI, Python packaging, command-line executions and shell scripting
Big Data and Data Engineering
Hadoop+Spark
Scala,
NoSQL databases and unstructured/semi-structured dat
https://maropost.breezy.hr/p/1e186b8808be
Scala and Spark for Big Data and Machine Learning (Ude
Spark and Python for Big Data with PySpark (Udemy)

with Tensorflow 2.0


Data Engineering

Udacity
master Big Data
https://www.edureka.co/all-courses
arning: A Hands-on Tutorial in Python
vanZhou/Reinforcement-learning-with-tensorflow

nt Learning With Python


ep-reinforcement-learning-for-trading-with-tensorflow-2-0/
ai/courses/move-37-course/lessons/markovdecisionprocesses/

ment Learning with TensorFlow & TRFL [Video]


siri.com/ml/dqn.html
m/What-are-some-of-the-good-Reinforcement-Learning-libraries
/rldm2015_silver_reinforcement_learning
chinelearning.com/category/reinforcement-learning/
amp.org/news/an-introduction-to-deep-q-learning-lets-play-doom
ep-reinforcement-learning-twin-delayed-ddpg-algorithm/

993/reinforcement-learning-sutton

vu200/MazeAI

eifk/Prison_Break_Machine_Learning

10.09967.pdf
a-deep-reinforcement-learning-library/
tegory/reinforcement-learning
learn/video/FU-sNVew9ZA
asiri.com/ml/dqn.html
om/channel/UC7ZVvEo7-B7lA6LY2MVX72A/playlists
nybritz/reinforcement-learning/tree/master/DQN
ai/rl-weekly/
m/reinforcement-learning/introduction-to-reinforcement-learning/
om/watch?v=Pka0DC_P17k
les/trading-strategies-using-deep-reinforcement-learni

https://www.quora.com/As-a-fresher-should-I-learn-Ha
#1 Big Data Hadoop Certification Training-Edureka
#2 Big Data Specialization by UC San Diego-Coursera
#3 Big Data Architect-Simplilearn
#4 Become a Data Engineer – Udacity
#5 Big Data Hadoop Certification Training-Simplilearn
#6 The Ultimate Hands-On Hadoop – Tame your Big Dat
#7 Taming Big Data with MapReduce and Hadoop – Ha
#8 Taming Big Data with Apache Spark and Python – Hands On!-Udemy
#9 Learn Big Data: The Hadoop Ecosystem Masterclass-Udemy (https://www.udemy.com/course/hands-on-hadoop-mastercla
#10 Hadoop MAPREDUCE in Depth | A Real-Time course on Mapreduce-Udemy
11 Learn By Example: Hadoop, MapReduce for Big Data problems-Udemy
12-Yandex
13-Cloudera CCP Spark and Hadoop Developer certificat
14- Data Science Council of America (DASCA)
15-Big Data University
16-https://data-flair.training/
17-https://www.npntraining.com
18-Pluarsight
19-Linjden: Architecting Big Data Applications: Batch Mode Application Engineering

SQL
Managing Big Data with MySQL by Coursera
Beginner’s Guide to PostgreSQL by Udemy
High-Performance MySQL

Hive
Accessing Hadoop Data using Hive by Big Data University
Learning Apache Hadoop Ecosystem Hive by Udemy
Apache Hive Documentation

Pig
Apache Pig 101 by Big Data University
Programming Hadoop with Apache Pig by Udemy

Apache Storm
Apache Kinesis Documentation
Amazon Kinesis Streams Developer Resources by Amazon Web Services
Apache Spark
Data Science and Engineering with Apache by edx
pache Spark Documentation
Book – Learning Spark
a formats (e.g. SQL, NoSQL, AWS S3, JSON, Redis, neo4j, etc.) for efficient ML feature extraction and data transforma

Apache Spark Streaming


Apache Spark Streaming Documentation

HDFS
Big Data and Hadoop Essentials by Udemy
ig Data Fundamentals by Big Data University
Hadoop Starter Kit by Udemy
Apache Hadoop Documentation

.Cloud
Big Data Technology Fundamentals by Amazon Web Services
Big Data on AWS by Amazon Web Services

Apache Kafka
https://www.analyticsvidhya.com/blog/2017/03/big-data-learning-path-for-all-engineers-and-data-scientists-out-there/
The complete Apache Kafka course for beginners by Udemy
Learn Apache Kafka Basics and Advanced topic by Udemy
Apache Kafka Documentation
Book – Learning Apache Kafka

Apache Zookeeper
Apache Zookeeper Documentation
Book – Zookeeper

Impala

Scala
Functional Programming in Scala Specialization Coursera

Google
Cloudera
IBM

orch, TensorFlow, CNTK

e.g., clustering, regression, pattern recognition, descriptive and inferential statistics)


predictive modeling and stochastic models). Experience extracting and cleaning data sets.

ructure as code, and other DevOps concepts and tools is desirable.


ge processing, experimental design, social networking analysis, feature engineering, etc.

Sockeye, and OpenNMT

a formats (e.g. SQL, NoSQL, AWS S3, JSON, Redis, neo4j, etc.) for efficient ML feature extraction and data transforma
or machines, ensemble modeling and text mining techniques such as sentiment analysis, topic modeling and entity extraction

-line executions and shell scripting


ML+DL Plan ML Arabic
1-Hisham + revision all notes and summery it ML Hisham
2-A-Z udemy ML Hisham Fealout + DL
3-Project udemy ML Irag important
4-https://github.com/tuanavu/coursera-university-of-washington/tree/master/machine_learning (university o
5-medium + github Projects Topics
6-revsion infertial + descriptive statistics

7-Udacity Machine Learning Projects


8- powerful Course

9- ahmed Fathi ML + DL + Math


10- Deep Learning Hisham
11- udemy A-Z DL
12- udemy
13- udemy

14- udacity Deep Learning


15- Motaz Saad all Deep Learning
16- AI for everythin (Hindi ) 25G
https://www.coursera.org/learn/tensorflow-sequences-time-series-and-prediction

17-Data Camp Courses


18-Husam Hourani Full Course
self driving Car udacity
https://github.com/alberduris/The_Math_of_Intelligence
Andrew E. Khalel Reinforcement Learning
Topics Waleed Yousef
data strcuture and alqorithms

Riad Almadani important Lectures

Udemy : TensorFlow 2.0: A Complete Guide on the Brand New TensorFlow

https://www.eduonix.com/learn-machine-learning-by-building-projects

https://www.eduonix.com/deep-learning-neural-networks-python-keras-for-dummies
Python Machine Learning Book
Pattern Recognition and Machine Learning Book + Walid Yousef
https://www.udemy.com/course/time-series-analysis-in-python/
Khan Academy Math
The Complete Python Course for Machine Learning Engineers (master python for ML Engineer ) Udemy
AWS Machine Learning: A Complete Guide With Python
https://aws.amazon.com/certification/certified-machine-learning-specialty/
fast.ai
Machine Learning EDX
Two Excellent Book Companions:Introduction to Statistical Learning, Hands-On Machine Learning with Scikit-Learn and Tensor
IBM
washnton University
Advanced Machine Learning Specialization — Coursera
Linkden Specilaization Topics very important
AWS Machine Learning Certifiacte
Google ??
https://randlow.github.io/categories/cat_machine-learning/ ( important project ) Credit risk
https://www.edureka.co/masters-program
https://www.dezyre.com/article/top-10-machine-learning-projects-for-beginners/397
ython – Hands On!-Udemy
asterclass-Udemy (https://www.udemy.com/course/hands-on-hadoop-masterclass-tame-the-big-data/)
Time course on Mapreduce-Udemy
or Big Data problems-Udemy
Pytorch
https://github.com/imhgchoi/pytorch-implementations
advance Topics :: Machine Learning from Youtube
https://www.youtube.com/watch?edufilter=NULL&list=PLqJm7Rc5-EXFv6RXaPZzzlzo93Hl0v91E&v=ZT8LszMo0D4
https://www.youtube.com/watch?edufilter=NULL&list=PLqJm7Rc5-EXFUOvoYCdKikfck8YeUCnl9&v=XLHB-Aktxw0
https://machinelearningmastery.com/start-here/ (jason Brownlee)
CBCSL teaching Youtube Channel explian math of machine learning
s: Batch Mode Application Engineering
https://www.packtpub.com/data/machine-learning
Bhavesh Bhatt ( Youtube Channel ) very important

ta University

es by Amazon Web Services


dis, neo4j, etc.) for efficient ML feature extraction and data transformations.

on Web Services

7/03/big-data-learning-path-for-all-engineers-and-data-scientists-out-there/
ners by Udemy
pic by Udemy
on Coursera

descriptive and inferential statistics)


perience extracting and cleaning data sets.

d tools is desirable.
orking analysis, feature engineering, etc.

dis, neo4j, etc.) for efficient ML feature extraction and data transformations.
chniques such as sentiment analysis, topic modeling and entity extraction
ML
Coursera 10 Course:

machine_learning (university of Washnton)

Udacity
Udemy 10 Course :

Edx 5 Courses :
Youtube Channel 20 Top

for ML Engineer ) Udemy

arning with Scikit-Learn and TensorFlow


v91E&v=ZT8LszMo0D4
UCnl9&v=XLHB-Aktxw0
DL NLP
Coursera (5 Courses ) + ahmed Fathi understand theory
Udemy A-Z
Udacity
IBM Coursera
Projects Practical python
Udacity infential and desciptive Statistics

Free lessons: MATH ADVANCED - Multivariate CalculuS For Machine Learning By Samuel J. Cooper Imperial College London
https://github.com/umer7/Deep-Learning

Neural Networks for Machine Learning by the University of Toronto (taught by Geoffrey Hinton) via Coursera
Creative Applications of Deep Learning with TensorFlow by Kadenze I + II + III (https://www.kadenze.com/courses/creative-ap
Reinforcement Learning Specialization (4 coursers ) Alberta Coursera
https://medium.com/free-code-camp/dive-into-deep-learning-with-these-23-online-courses-bf247d289cc0

Applied AI with DeepLearning IBM


TensorFlow in Practice Specialization (4 Courseres ) deeplearning.ai
https://github.com/NirantK/awesome-project-ideas https://www.coursera.org/specializa

http://course18.fast.ai/index.html
http://course18.fast.ai/part2.html
https://course.fast.ai/videos/?lesson=5
https://www.edx.org/professional-certificate/ibm-deep-learning
https://cs230.stanford.edu/

Full from A-Z in details Deep Learning ( Youtube ) [Ahlad Kuma]

https://www.youtube.com/watch?fbclid=IwAR0HxtFrUjsVtUYl_q2VE_xQ8CN5H-k-5LHpYrQcZ-zCgQN1f7FNVoh_FBA&edufilter

Youtube Channel :: Graphical Models [Max Planck Institute for Intelligent Systems]

https://www.youtube.com/watch?fbclid=IwAR02Ix6gO-x4PpCfyH_ZbqodCuzEPyaM5IU44NJRhaL_pfC94DxwVt5z4lc&index=1&
https://blog.floydhub.com/best-deep-learning-courses-updated-for-2019/
Deep Learning For Coders by Jeremy Howard, Rachel Thomas, Sylvain Gugger - fast.ai
CS224n: Natural Language Processing with Deep Learning by Christopher Manning, Abigail See - Stanford
CS231n: Convolutional Neural Networks for Visual Recognition by Stanford
MIT Deep Learning by MIT
Computer Visiointerview Data sceintist & Machine learning
very very important
Computer Vision Sp
Nividia Courses https://github.com/vlgiitr/DL_Topics
http://nitin-panwar.github.io/Top-100-Data-science-interview-questions/?utm_campaign=News&utm_med
https://www.quora.com/profile/Prasoon-Goyal

J. Cooper Imperial College London

by Geoffrey Hinton) via Coursera


ww.kadenze.com/courses/creative-applications-of-deep-learning-with-tensorflow/info)

urses-bf247d289cc0

https://www.coursera.org/specializations/tensorflow-in-practice
YrQcZ-zCgQN1f7FNVoh_FBA&edufilter=NULL&list=PLdxQ7SoCLQANQ9fQcJ0wnnTzkFsJHlWEj&v=Bn_jRbQcmV4

44NJRhaL_pfC94DxwVt5z4lc&index=1&edufilter=NULL&list=PLeyrCtKXMqvbafdWvljuORVu__CPlPEaB&v=ju1Grt2hdko

ail See - Stanford


unsupervised Key Concehttps://www.mlq.ai/guide-to-deep-reinforcement-learning/
Math
https://www.mlq.ai/stock-market-clustering-with-k-means/
The concepts we will https://www.mathsisfun.com/data/index.html
Convex Opti
_campaign=News&utm_med Markov Decision Processes (MDP
Dynamic Programming
Monte Carlo Methods
Temporal Difference Learning

Approximation Methods for Rein


Deep Learning

Introduction to OpenAI: CartPol


RBF Neural Networks
TD Lambda
Policy Gradient Methods
Deep Q-Learning

A3C: Asynchronous Advantage Actor-Critic


Summary: Deep Reinforcement Learning

Lecture 1: Introduction to Reinforcement Learning

Lecture 2: Markov Decision Processes


Lecture 3: Planning by Dynamic Programming

Lecture 4: Model-Free Prediction

Lecture 5: Model-Free Control

Lecture 6: Value Function Approximation

Lecture 7: Policy Gradient Methods

Lecture 8: Integrating Learning and Planning

Lecture 9: Exploration and Exploitation

Lecture 10: Case Study: RL in Classic Games


Statistics master
Intro to Descriptive Statistics (Udacity course) iliouns
Intro to Inferential Statistics (Udacity course) imperial
Elements of Statistical Learning (Free ebook)
https://medium.com/swlh/how-to-become-a-data-scientist-2a02ed565336
khan acacdemy https://uwaterloo.ca/data-science/graduate-programs/ma
Walid Yousef Vector Institute Scholarships.

EDX
AnalyticsVidhya

Bayesian Statistics: From Concept to Data Analysis Coursera


Statistics with Python Specialization (michigan ) Coursera
theEducationalNest GeoSociety ( Youtube) Practical Probability
LEARN & APPLY: Lean and Six Sigma ( Practical Probability )
StatQuest

Advantage Actor-Critic
nforcement Learning

on to Reinforcement Learning

ecision Processes
by Dynamic Programming

ee Prediction

ction Approximation

dient Methods

g Learning and Planning

n and Exploitation

dy: RL in Classic Games


Certificates R-Progamming
ML andrw Price : $80
DL andrw Price
Machine Learning Specialization university of washnton 4-Courses in details all topics on course 50$/Month https://github.com

data-science/graduate-programs/master-data-science-and-artificial-intelligence/tuition

ML Udacity Price : $1077


/ 3month
DL Udacity (https://github.com/amirkeren/deep-learning) $1436
Applied AI with Deep Learning by IBM

Machine Learning for Analytics MasterTrack™ Certificate university of chigao (4000$)

Big Data MasterTrack™ Certificate arizon (4500$)


Data Engineering, Big Data, and Machine Learning on GCP Specialization 5 Courses
IBM Data Science Professional Certificate 8 courses
Applied Data Science with Python Specialization university of michigan 5 Courses (https://github.com/AdityaGupta030697/Ap

Machine Learning: Algorithms in the Real World Specialization amiil aleberta


Machine Learning and Reinforcement Learning in Finance Specialization (tandon school of university)
Data Science: Statistics and Machine Learning Specialization 5 courses Johns Hopkins
Advanced Machine Learning Specialization 7 Courses compuer vision and NLP (https://github.com/SJD1882/MOOC-Coursera-A
IBM AI Engineering Professional Certificate 6 Courses
Practical Machine Learning on H2O (deep learning)
all coursera *https://github.com/shenweichen/Coursera)

Data Engineer Nanodegree Program ($1795


/5month)
data-analyst-nanodegree-($1436
$/4month)

data-structures-and-algorithms-nanodegree
$1436
/4month

Artificial Intelligence Specialist


(CV+NLP+DRL+AI+Data + alogrithem) very
difficult Udacity
CV 1077$/3M
NLP 1077%/3m
DRL 1436$/4M
AI 1077$/3M
AI for Trading $2154/6M
Python A-Z Java ( 2-Month )
best answer how to learn programming
https://www.quora.com/q/zkywxwosxcrxtdwy/All-in-one-CS-CP-resources
pics on course 50$/Month https://github.com/tuanavu/coursera-university-of-washington/tree/master

5 Courses

(https://github.com/AdityaGupta030697/Applied-Machine-Learning-Coursera) (https://github.com/villeristi/applied-machine-learning-in-

ion (tandon school of university)


ttps://github.com/SJD1882/MOOC-Coursera-Advanced-Machine-Learning) (https://github.com/jiadaizhao/Advanced-Machine-Learning-Sp
C++ / C ( 1-Month ) Javascript ( Client-Side )
Learn Advanced C++ Programming ReactJs
Learn C++ Programming -Beginner to Advance-Angular
Mastering Data Structures & Algorithms using C and C++
C++ Udacity
C++ Coursera
Design Patterns in Modern C++

Intermediate C++ Edx


Advance C++ Edx

https://channel9.msdn.com/Tags/stephan-t-lavavej
C++: From Beginner to Expert – Udemy
Beginning C++ Programming – From Beginner to Beyond – Udemy
NPTEL
The C++ Programming Language, 4th Edition

b.com/villeristi/applied-machine-learning-in-python)
m/jiadaizhao/Advanced-Machine-Learning-Specialization)
HTML + CSS + XML + JSON
https://www.levels.fyi/?compare=Microsoft,Facebook&track=Software%20Engineer
https://hackernoon.com/deeplearning-101-coursera-vs-udemy-vs-udacity-b4eb3d
-vs-udemy-vs-udacity-b4eb3de06dbe
Time Plan
Building an Effective Machine Learning Workflow with scikit-learn By Data School 129$
https://www.analyticsvidhya.com/blog/2020/01/learning-path-data-scientist-machine-learning-2020/
SQL 100 Pages Linkden
https://www.analyticsvidhya.com/blog/2020/01/learning-path-nlp-2020/
Tensorflow2 Keras API ( Udemy )
RanJ Hindi >> Mathmatical Concepts with example
e-learning-2020/
Hisham ML
Hand-on Book ML
Month 1 -2020
probability Waleed https://end-to-end-machine-learning.teachable.com/cour
Stat Waleed statQuest
probability for DS Book jbstatistics.com
Brandon Foltz https://projects.iq.harvard.edu/stat110/youtube
linear algebra waleed

Deeply prob + sts try to understand more more

Month 2 -2020
linear algebra waleed
Linear algebra Gilbert
Linear Algerbra from Ahmed Fathi
information theroy ( entropy ,cross entropy, KL ) aim:
STS Udacity des + infer 1-all everything about ML in parall
John hobkis STS 2-Expanded my knowledge
statquest Youtube Channel Bayesian >> Gaussian >> MMCM >
MCMC + Bayesian + Guassian 3-DL >> NLP
Discrete Math >> Graph theory 4-Big-Data + Data Engineering
PGM 5-IELTS
Linear alegbra ahmed fathi All ML (‫(مناهج‬ Udacity
Month 3 -2020 stop all >> start ML Udacity
Udacity Machine Learning
Win Kaggle

Discrete Mathmatics ( Graph Theory ) then >> bayesian >> ML (coursera download )
DS with alqorithms Waleed + Coursera or Udacity IBM data scinece Coursera
SQL Query datacamp data scientist + ML
Calculus including maximizing and minimizing algebraic equati PGM
Introduction to Version Control git,github time-series
OOP Python + refactoring Hand-On all Topics with details
udemy > learn R
Month 4 -2020 NLP

Bayesian
Applied Miachgen
ML > John Hopkins
appliedAI
Alberta Machine Intelligence Institute
Feature Engineering & Feature Selection ( check Correlation matrix Topic )
R-Programmin A-Z
Projects > udacity + udemy + hisham Kaggle + AnalyticVhdica 27 Projects + Udemy Kaggle master
Time Series
RL
DL Hisham + ahmed Fathi >> Coursera andrw + Russia IBM AI Engineering Professional Certificate
Geoffrey Hinton
Tensoflow + Keras + Pytorch
NLP >> Udacity + Coursera
PGM + Optimization
advance python deep dive + django + flask + web scraping + Design Pattern + Data Structure and alogrithm

Learn SQL Basics for Data Science Specialization Coursera

Hand-on Book ML in details for each Topics Reinforcement Learning


udemy A-Z (Python + R ) ML michigan
numpy + Data Analysis ( DataCamp ) road to become Data Scientist
Book from scratch
Hisham Kaggle Projects
[Udacity] AI Programming With Python
Nanodegree
Intro to Machine Learning with TensorFlow
Intro to Machine Learning with PyTorch
ML Udacity Certificate
DL Udacity CV NLP
C++ Self Driving Car + robot + flying Car
ML Coursera eritifcate
IBM data scinece Coursera
ML Edureka
probability
Sts > des > infer Udacity > John jobks
infromation theory 3Topics
Generative

ine-learning.teachable.com/courses/000-fou
Guassian
bayesian
Frqunist sta
d.edu/stat110/youtube A/B test
Markov Models
>> Linear Algebra
>> Discrete Math >> Graph theory
Probability Graphical models
then ML Theory Topics in details
DL >> RL

1-all everything about ML in parallel (DS and algorithm) Plan-2021 ( IELTS Exam ) + ( Canada P.E ) + ( Coursera Certifiacte )
2-Expanded my knowledge 1+2+3-Month Big Data Course Edurake >> Coursera >> >>
Bayesian >> Gaussian >> MMCM >> PGM >> University Lectures 4 BlockChain Course + Quantim Computing
5 Software Engineering Skill ( Hammad )
4-Big-Data + Data Engineering

AICourse

coursera download )
my Kaggle master

ssional Certificate

ata Structure and alogrithm

DRL AI
Plan-2020
Book ML A Probabilistic Perspective + element
all Topics of Lectures syllabs
Plan for ML Practical
1+2 Prob + STS + Linear
maybe 3 for PGM
4+5 ML revsion Hand-on with Topics in details from all
6+7+8 DL Hisham with andrw >> Udacity
Book Ian + illustarted + TensorFlow + Pytorch +Keras
9 Fully Reinforcement Learning
10 NLP + (R +SAS + SPSS)
11+12 ?? ML Time-Series Analysis
11+12 ?? Projects ************
OOP + DS with alqorithms + refactoring Python
cloud services (AWS, GCP, Azure) + Kurbane + Docker

am ) + ( Canada P.E ) + ( Coursera Certifiacte )


ta Course Edurake >> Coursera >> >>
+ Quantim Computing
ng Skill ( Hammad )
Travel and work Canada I Can Buy Car ?
prepare all Documents 3400 1
when you apply month 6/2021 Picanto or Obel astra or Polo 2
30 old >> then certificate >> Master 3
IELTS + P.E + Certificate + BootCamp +Master ( ML + Big-Data ) + Expert python 4
how much money need ? 5
air tickets 6
Somaya prepare for Dentals Exams 7
Ibwave L3 >> in-building 8
9
Linkden Certificate :: ?? Ready Account 10
11
12

Plan 2020-2021
ML Coursera
Deep Coursera
Alebra RL Coursera
data strcure and algorithm certificate
PGM Coursera
IBM data scinece Coursera + IBM AI Coursera
GCP Coursera
ML Udacity
DL Udacity
Self driving Car Udacity
practical Time series Certificate Coursera
Big Data San Diaego Coursera
course “Strategic Thinking”! #strategicthinking #leadership
GCP Google
advance Machine learning Coursera

Relation with Connections

RF
2G Full
3G
4G VoLTE

5G

Ibwave L3 >> in-building


Book 2Book/Month
Eating Fish Make Time: How to Focus on What Matters Every Day , Jake Knapp, John Zeratsky

You suggested books:


https://brohrer.github.io/mindfulness_reading_list.html
Flow: The Psychology of Optimal Experience , Mihaly Csikszentmihalyi
Rapt: Attention and the Focused Life , Winifred Gallagher

Peace is Every Step: The Path of Mindfulness in Everyday Life, Thich Nhat Hanh
Thinking, Fast and Slow , Daniel Kahneman
Make Time: How to Focus on What Matters Every Day , Jake Knapp, John Zeratsky
Essentialism: The Disciplined Pursuit of Less , Greg McKeown
Subliminal: How Your Unconscious Mind Rules Your Behavior , Leonard Mlodinow
Deep Work: Rules for Focused Success in a Distracted World , Cal Newport
Team Leader Book
2019 Data Science Bowl
Uncover the factors to help measure how young children learn
https://www.kaggle.com/c/data-science-bowl-2019/discussion/127469
Topics : Instructor: Sargur SrihariDepartment of Computer Sc
1. Introduction
1. Machine Learning-Overview(28MB)
2. Python and ML
Frameworks(13.9MB) Code
3. Linear Algebra(4.5MB) Code
4. Example: Curve Fitting(934KB)
5. Probability Theory(4.9MB) Code

6. Numerical Computation(1.4MB)
7. Decision-Theory(488KB)
8. Information Theory(715KB)
2. Probability Distributions

1. Discrete Distributions(1MB)Code

2. Gaussian Distribution(833KB) Code


3. Gaussian Bayesian Networks(738KB)
3. Linear Models for Regression
1. Regression with Basis
Functions(3MB)Code
2. Gradient Descent(1.3MB)
3. Bias-Variance(950KB)
4. Bayesian Regression(3.9MB)
5. Bayesian Model Comparison(478KB)
6. Evidence Approximation(746KB)
7. Example: Computer Science Ranking(126KB)
4. Linear Models for Classification
0. Overview(209KB)
1. Discriminant Functions(1.7MB)
2. Probabilistic Generative Models(1.2MB)
3. Probabilistic Discriminative
Models

1. Fixed Basis Functions(124KB)


2. Logistic
Regression(1.2MB) Code
3. Iterative Reweighted Least Squares(4MB)
4. Multiclass Logistic Regression(4.6MB)
5. Probit Regression(356KB)
6. Canonical Link Functions(263KB)
4. Laplace Approximation (1MB)
5. Bayesian Logistic Regression(855KB)
6. Variational Bayesian Logistic Regression(500KB)
5. Neural Networks

1. Biology(4.5MB)
2. Feed-forward Network Functions(669KB)
3. Network Training(2.6MB)
4. Backpropagation(8.7MB)
5. The Hessian Matrix(562KB)
6. Regularization in Neural Networks(1.2MB)
1. Convolutional Networks(4.9MB)
2. Soft Weight Sharing(1.2MB)
7. Mixture Density Networks (634KB)
8. Bayesian Neural Networks(716KB)
9. Deep Learning Overview(5.2MB)
10. See course on Deep Learning
6. Kernel Methods
1. Kernel Methods(6.3MB)
2. Radial Basis Function Networks(812KB)
3. Gaussian Processes(6.8MB)
7. Sparse Kernel Machines
1. Support Vector Machines(5.4MB)
2. SVM for Overlapping Distributions(1.3MB)
3. Multiclass SVMs (1.4MB)
4. Relation to Logistic Regression (446KB)
8. Probabilistic Graphical Models

See Course on Probabilistic Graphical Models


9. Mixture Models and EM
0. Unsupervised Learning(1.9MB)
1. K-means Clustering(1.4MB)Code
2. Gaussian Mixture
Models(1.5MB) Code
3. Latent Variable View of EM(1.1MB)
4. Bernoulli Mixture Models(3.1MB)
5. Theoretical Basis of EM(693KB)
10. Approximate Inference

1. Approximate Inference(180KB)
2. Variational Inference(3.3MB)
3. Variational Mixture of Gaussians(1MB)
11. Sampling Methods
1. Need for Sampling (6.6MB)
2. Basic Sampling Methods(2.5MB)
3. Markov Chain Monte Carlo Sampling(815KB)
4. Gibbs Sampling(1.2MB)
12. Continuous Latent Variables
1. Principal Components Analysis
See Section 3.2 of course on Data Mining
2. Nonlinear Latent Variable Models

13. Sequential Data


1. Markov Models(2.5MB)
2. Hidden Markov Models(3.1MB)
1. Maximum Likelihood for the HMM(8.5MB)
2. The forward-backward algorithm(15.9MB)
3. Extensions to HMMs(287KB)
3. Linear Dynamical Systems(217KB)
4. Conditional Random Fields(1.6MB)
14. Combining Models
1. Combining Models(pdf, 1.7MB)
2. Bagging(pdf, 675KB)
3. Boosting(pdf, 1.1MB)
4. Tree Models
1. Decision Trees(pdf, 1.9MB)
2. Learning Trees(pdf, 596KB)
5. Random Forests(pdf, 3.4MB)
15. Reinforcement Learning
1. Reinforcement Learning Overview(pdf 4MB)
2. The Learning Task (pdf 1MB)
3. Q-Learning (pdf 6MB)
4. Nondeterministic Q-Learning (pdf 4.9MB)
5. Deep Reinforcement Learning (pdf 3.5MB)
16. AI Ethics
1. AI Ethics(pdf 8.8MB)
17. Trustworthy AI
1. Trustworthy AI(pdf 2.9MB)
2. Explainable AI(pdf 24MB)
1. Explanation by Example(pdf 3.4MB)
2. Deep Explanation(pdf 14.6MB)
3. Causal Explanation(pdf 16.4MB)
18. Concept Learning
1. Hypothesis Space (pdf, 111KB)
2. Candidate Elimination (pdf,236KB)
19. Computational Learning Theory
1. PAC Learning(pdf, 98KB)
2. VC Dimension(pdf, 321KB)
3. Mistake Bound(pdf, 51KB)
Deep Learning Pattern Recognition Class (2012) UniHeidelberg Youtube Channel
1. Overview Syllabus:
1. Introduction to Deep Learning(19MB) 1. Introduction

2. Overview of Deep Learning(15MB) 1.1 Applications of Pattern Recognition


3. Depth in Deep Learning(2MB) 1.2 k-Nearest Neighbors Classification
4. Historical Trends in Deep Learning(9.9MB) 1.3 Probability Theory
1.4 Statistical Decision Theory
I. Applied Math and Machine Learning
Basics
2. Linear Algebra 2. Correlation measures, Gaussian Models
1. Linear Algebra for Machine Learning(2.1MB) 2.1 Pearson Correlation
3. Probability and Information
Theory 2.2 Alternative Correlation Measures
1. Probability and Information Theory(2.1MB) 2.3 Gaussian Graphical Models
4. Numerical Computation and
Gradient-based Optimization 2.4 Discriminant Analysis
1. Numerical Computation(228KB)
2. Gradient-based Optimization(6.6MB) 3. Dimensionality Reduction
5. Machine Learning Basics 3.1 Regularized LDA/QDA
1. Definition of Learning (5.3MB) 3.2 Principal Component Analysis (PCA)
2. Capacity:Underfitting,overfitting(1.7MB) 3.3 Bilinear Decompositions
3. Hyperparameters and Validation Sets(490KB)
4. Estimators, Bias and Variance(741KB) 4. Neural Networks
5. Maximum Likelihood Estimation(367KB) 4.1 History of Neural Networks
6. Bayesian Statistics(128KB) 4.2 Perceptrons
7. Supervised Learning Algorithms(350KB) 4.3 Multilayer Perceptrons
8. Unsupervised Learning Algorithms(299KB) 4.4 The Projection Trick
9. Stochastic Gradient Descent(366KB) 4.5 Radial Basis Function Networks
10. Building a Machine Learning Algorithm(259KB)

11. Challenges Motivativating Deep Learning(2.4MB5. Support Vector Machines


12. Software Libraries for Deep
Learning 5.1 Loss Functions
1. Python Libraries (130KB) 5.2 Linear Soft-Margin SVM
2. Tensorflow 5.3 Nonlinear SVM
3. Fizzbuzz in Tensorflow(130KB)
II. Deep Networks: Modern Practice 6. Kernels, Random Forest
6. Deep Feed-forward Networks 6.1 Kernels
1. Feed-forward Networks(2.1MB) 6.2 One-Class SVM
2. Gradient-based Learning(2.3MB) 6.3 Random Forest
3. Hidden Units(1MB) 6.4 Random Forest Feature Importance
4. Architecture Design(3.2MB)
5. Backward Propagation and
Differentiation 7. Regression
1. Forward/Backward Propagation(2.3MB) 7.1 Least-Squares Regression
2. Computational Graphs(2.4MB) 7.2 Optimum Experimental Design
3. Chain Rule in Backprop(543KB) 7.3 Case Study: Functional MRI
4. Symbol-symbol Derivatives(203KB) 7.4 Case Study: Computer Tomography
5. General Backprop(437KB) 7.5 Regularized Regression
6. Other Differentiation Algorithms(74KB)
7. Regularization 8. Gaussian Processes
0. Regularization: Overview(107KB) 8.1 Gaussian Process Regression
1. Parameter Penalties(431KB) 8.2 GP Regression: Interpretation
2. Norm Penalties as Constrained Optimization(127K8.3 Gaussian Stochastic Processes
3. Regularization and Underconstrained Problems(1 8.4 Covariance Function
4. Data Augmentation(1.5MB)
5. Noise Robustness(51KB) 9. Unsupervised Learning
6. Semi-Supervised Learning(61KB) 9.1 Kernel Density Estimation
7. Multi-Task Learning(168KB) 9.2 Cluster Analysis
8. Early Stopping(361KB) 9.3 Expectation Maximization
9. Parameter Tying and Parameter Sharing(104KB) 9.4 Gaussian Mixture Models
10. Sparse Representations(218KB)
11. Bagging(321KB) 10. Directed Graphical Models
12. Dropout(425KB) 10.1 Bayesian Networks
13. Adversarial Training(3.5MB) 10.2 Variable Elimination
8. Optimization for Training Deep
Models 10.3 Message Passing
0. Optimization for Training Deep Learning(2.5MB) 10.4 State Space Models
1. How Learning Differs from Pure Optimization(628KB)
2. Challenges in Neural Network Optimization(2.6M 11. Optimization

3. Basic Algorithms: SGD and Momentum(3.2MB) 11.1 The Lagrangian Method


4. Parameter Initialization Strategies(84KB) 11.2 Constraint Qualifications
5. Adaptive Learning Rates: RMSProp, Adam(1MB) 11.3 Linear Programming
6. Approximate Second-Order Methods: Newton, B 11.4 The Simplex Algorithm
7. Optimization Strategies and
Meta-Algorithms
1. Batch Normalization(302KB) 12. Structured Learning
2. Coordinate Descent(92KB) 12.1 structSVM
3. Pretraining(623KB) 12.2 Cutting Planes
9. Convolutional Networks
0. Convolutional Networks Overview(1.4MB)
1. The Convolution Operation(693KB)
2. CNN-Motivation(2.9MB)
3. Pooling(918KB)
4. Convolution-Pooling as Strong Prior(416KB)
5. CNN Variants(1MB)
6. Structured Output(163KB)
7. Capsule Networks(1.4MB)
10. Sequence Modeling: Recurrent
and Recursive Nets
0. Sequence Modeling(2.3MB)
1. Unfolding Graphs(1.8MB)
2. Recurrent Neural Networks
1. Recurrent Neural Networks(618KB)
2. Teacher Forcing(1.4MB)
3. RNN Gradients(1.3MB)
4. RNNs as PGMs(1.8MB)
5. RNNs Conditioned in Context(1.3MB)
3. Bidirectional RNNs(253KB)
4. Encoder-Decoder Architecture(1.4MB)
5. Deep Recurrent Networks(419KB)
6. Recursive Neural Networks(167KB)
7. Long-Term Dependencies(214MB)
8. Leaky Units(87KB)
9. Long Short-Term Memory(2.1MB)
11. Practical Methodology
1. Practical Design Process(53KB)
2. Performance Metrics(237KB)
3. Default Baseline Models(53KB)
4. Whether to gather more data(333KB)
5. Selecting hyperparameters(763KB)
6. Debugging Strategies(54KB)
12. Applications
1. Large Scale Systems(9.2MB)
2. Computer Vision(1.2MB)
3. Natural Language Processing
0. NLP Overview(384KB)
1. N-grams(809KB)
2. Neural Language Models(3.7MB)
3. High Dimensional Outputs(989KB)
4. Neural Machine Translation(372KB)
5. Attention Models(6.3MB)
4. Recommender Systems(2.8MB)
5. Knowledge Representation, Reasoning & QA(466KB)
6. Knowledge Graphs(3.5MB)
III. Deep Learning Research
13. Linear Factor Models
1. Linear Factor Models(11.9MB)
14. Autoencoders
1. Autoencoders(5.2MB)
15. Representation Learning
0. Overview of Representation Learning(10.1MB)
1. Greedy Layer-wise Unsupervised Pretraining(436KB)
2. Transfer Learning and Domain Adaptation(981KB)
3. Semi-supervised Disentangling of Causal Factors(4.2MB)
4. Distributed Representation(2MB)
5. Exponential Gains from Depth(65KB)
6. Providing Clues to Discover Underlying Causes(65KB)
16. Structured Probabilistic Models
for Deep Learning
1. Modeling Unstructured Data (6.5MB)
2. Using Graphs to Represent Model Structure(2.9MB)
3. Sampling from PGMs(183KB)
4. Advantages of Structured Modeling(62KB)
5. Learning about Dependencies(125KB)
6. Inference and Approximate Inference(88KB)
7. The Deep Learning Approach to PGMs(728KB)
17. Monte Carlo Methods
1. Monte Carlo Methods(153KB)
2. Importance Sampling(3.4MB)
3. Markov Chain Monte Carlo Methods(2.6MB)
4. Gibbs Sampling(419KB)
5. The Challenge of Mixing between Separated Modes(1.9MB)
18. The Partition Function
1. The Partition Function(117KB)
2. The Log-likelihood Gradient(167KB)
3. Training an RBM: Contrastive Divergence(865KB)
19. Approximate Inference
0. Approximate Inference Overview(285KB)
1. Inference as Optimization(807KB)
2. Expectation Maximization(232KB)
3. MAP Inference and Sparse Coding(232KB)
4. Variational Inference and Learning(583KB)
5. Learned Approximate Inference (232KB)
20. Deep Generative Models
0. Deep Generative Model Overview(153KB)
1. BoltzmannMachines(5MB)
2. Restricted Boltzmann Machines(447KB)
3. Deep Belief Networks (DBNs)(277KB)
4. Deep Boltzmann Machines (DBMs)(991KB)
5. Directed Generative Nets
1. Sigmoid Belief Networks(68KB)
2. Differentiable Generator Nets(605KB)
6. Variational Autoencoders
(VAEs)
1. VAE: PGM View(11.7MB)
2. VAE: Neural Network View(12.1MB)
3. VAE Applications(9.6MB)
7. Generative Adversarial
Networks (GANs)
1. GAN Motivation(26MB)
2. GAN Theory(7.7MB)
3. GAN Mode Collapse(4.7MB)
4. Wasserstein GAN(18.8MB)
5. GAN Variants(25MB)
Youtube Channel
Syllabus: www.cs.ubc.ca/~murphyk
Data visualization, probabilistic models, MLE

Basic concepts
Linear regression
Linear algebra, Ridge regression
Logistic regression

MVN, LDA/QDA
Naive Bayes; Beta-Binomial model
Bayesian concept learning; Beta-Binomial; Dirichlet-Multinomial

Bayesian parameter estimation for Gaussians, generative classifiers, linear and logistic regression
Decision theory ; model selection

Midterm
Feature selection
L1 regularization

Mixture models, EM, non-parametric models


Guest lecture by Matt Brown on applications of non-parametric regression
Directed graphical models
Conditioanl mixture models, sparse Bayesian learning, EM as bound optimization
Remembrance day
Kalman filters
PCA
Markov models
HMMs
MCMC
Syllabus:Bufflo univ
Introduction

Machine Learning-Overview(28MB)
Python and ML Frameworks(13.9MB) Code
Linear Algebra(4.5MB) Code
Example: Curve Fitting(934KB)

Probability Theory(4.9MB) Code


Numerical Computation(1.4MB)
Decision-Theory(488KB)

Information Theory(715KB)
Probability Distributions

Discrete Distributions(1MB)Code
Gaussian Distribution(833KB) Code
Gaussian Bayesian Networks(738KB)

Linear Models for Regression


Regression with Basis Functions(3MB)Code
Gradient Descent(1.3MB)
Bias-Variance(950KB)
Bayesian Regression(3.9MB)
Bayesian Model Comparison(478KB)
Evidence Approximation(746KB)
Example: Computer Science Ranking(126KB)
Linear Models for Classification
Overview(209KB)
Discriminant Functions(1.7MB)

Probabilistic Generative Models(1.2MB)

Probabilistic Discriminative Models


Fixed Basis Functions(124KB)
Logistic Regression(1.2MB) Code
Iterative Reweighted Least Squares(4MB)
Multiclass Logistic Regression(4.6MB)
Probit Regression(356KB)
Canonical Link Functions(263KB)
Laplace Approximation (1MB)
Bayesian Logistic Regression(855KB)
Variational Bayesian Logistic Regression(500KB)

Neural Networks
Biology(4.5MB)
Feed-forward Network Functions(669KB)
Network Training(2.6MB)
Backpropagation(8.7MB)
The Hessian Matrix(562KB)
Regularization in Neural Networks(1.2MB)
Convolutional Networks(4.9MB)
Soft Weight Sharing(1.2MB)
Mixture Density Networks (634KB)
Bayesian Neural Networks(716KB)
Deep Learning Overview(5.2MB)
See course on Deep Learning
Kernel Methods
Kernel Methods(6.3MB)
Radial Basis Function Networks(812KB)
Gaussian Processes(6.8MB)
Sparse Kernel Machines
Support Vector Machines(5.4MB)
SVM for Overlapping Distributions(1.3MB)
Multiclass SVMs (1.4MB)
Relation to Logistic Regression (446KB)

Probabilistic Graphical Models


See Course on Probabilistic Graphical Models
Mixture Models and EM
Unsupervised Learning(1.9MB)

K-means Clustering(1.4MB)Code
Gaussian Mixture Models(1.5MB) Code
Latent Variable View of EM(1.1MB)
Bernoulli Mixture Models(3.1MB)

Theoretical Basis of EM(693KB)


Approximate Inference
Approximate Inference(180KB)
Variational Inference(3.3MB)
Variational Mixture of Gaussians(1MB)
Sampling Methods
Need for Sampling (6.6MB)
Basic Sampling Methods(2.5MB)
Markov Chain Monte Carlo Sampling(815KB)
Gibbs Sampling(1.2MB)
Continuous Latent Variables
Principal Components Analysis
See Section 3.2 of course on Data Mining

Nonlinear Latent Variable Models


Sequential Data
Markov Models(2.5MB)
Hidden Markov Models(3.1MB)
Maximum Likelihood for the HMM(8.5MB)
The forward-backward algorithm(15.9MB)
Extensions to HMMs(287KB)
Linear Dynamical Systems(217KB)
Conditional Random Fields(1.6MB)
Combining Models
Combining Models(pdf, 1.7MB)
Bagging(pdf, 675KB)
Boosting(pdf, 1.1MB)
Tree Models
Decision Trees(pdf, 1.9MB)
Learning Trees(pdf, 596KB)
Random Forests(pdf, 3.4MB)
Reinforcement Learning
Reinforcement Learning Overview(pdf 4MB)
The Learning Task (pdf 1MB)
Q-Learning (pdf 6MB)
Nondeterministic Q-Learning (pdf 4.9MB)
Deep Reinforcement Learning (pdf 3.5MB)
AI Ethics
AI Ethics(pdf 8.8MB)
Trustworthy AI
Trustworthy AI(pdf 2.9MB)
Explainable AI(pdf 24MB)
Explanation by Example(pdf 3.4MB)
Deep Explanation(pdf 14.6MB)
Causal Explanation(pdf 16.4MB)
Concept Learning
Hypothesis Space (pdf, 111KB)
Candidate Elimination (pdf,236KB)
Computational Learning Theory
PAC Learning(pdf, 98KB)
VC Dimension(pdf, 321KB)
Mistake Bound(pdf, 51KB)
BC Univerity https://www.cs.ubc.ca/~schmidtm/Courses/LecturesOnML/Course Schedule
1. Supervised Learning Week 1  Class overview: Class organization, topics o

Overview  Introduction: what is ML; Problems, data, and tools


Exploratory Data Analysis (I)
Decision Trees (Notes on Big-O Notation) Week 2  Linear regression; SSE; gradient descent; cl
Fundamentals of Learning (Notation Guide) features

Probabilistic Classifiers (Probability Slides, Notes on Probability)  Overfitting and complexity; training, validation, tes
Non-Parametric Models to Matlab (II)
Ensemble Methods Week 3  Classification problems; decision boundarie

2. Unsupervised Learning Week 4  Probability and classification, Bayes optima


Clustering  Naive Bayes and Gaussian class-conditional distribu

More Clustering Week 5  Linear classifiers


Outlier Detection  Bayes' Rule and Naive Bayes Model
Finding Similar Items Week 6  Logistic regression, online gradient descent

3. Linear Models  Decision tree and Review for Mid-term


Linear Regression (Notes on Calculus, Notes on Linear Algebra, Notes on Week 7  Mid-term
Nonlinear Regression Week 8  Ensemble methods: Bagging, random fores
Gradient Descent  A more detailed discussion on Decision Tree and Bo
Robust Regression Week 9  Unsupervised learning: clustering, k-means
Feature Selection Week 10  Advanced discussion on clustering and EM
Regularization Week 11  Latent space methods; PCA.
More Regularization Week 12  Text representations; naive Bayes and mu
Linear Classifiers latent space models
More Linear Classifiers Week 13  VC-dimension, structural risk minimization
Multi-Class Linear Classifiers vector machines (SVM)

Kernel Methods Week 14  Support vector machines and large-margi

Stochastic Gradient  Time series; Markov models; autoregressive model


Maximum Likehood (Notes on Max and Argmax)
MAP Estimation
4. Latent-Factor Models
Principal Component Analysis
More PCA
Sparse Matrix Factorization
Recommender Systems
Nonlinear Dimensionality Reduction
5. Deep Learning

Neural Networks
More Neural Networks
Even More Neural Networks
Convolutional Neural Networks
More CNNs, Boosting
Part 2: Data Science 573 and 575
The second set of notes are from courses I've taught in UBC's Master of Data Science (MDS) program in 2017 and 2018, which
Structure Learning
Sequence Mining
Semi-Supervised Learning
PageRank
Markov Chains and Monte Carlo
Part 3: Computer Science 540
The third set of notes is from the January-April 2019 offering CPSC 540, a graduate-level course on machine learning. Related r
Videos covering the first month of material in the 2016 offering are available here. Note that the material has gone through so

A. Fundamentals
340 Overview
Fundamentals of Learning
Convex Optimization (Notes on Norms)
B. Large-Scale Machine Learning
Gradient Descent Convergence

Rates of Convergence (Notes on Convexity Inequalities, Notes on Implementing Gradient Descent)


Subgradients
Proximal Gradient
Structured Regularization

Coordinate Optimization
Stochastic Subgradient
SGD Convergence Rate
Stochastic Average Gradient

Kernel Methods and Fenchel Duality


C. Density Estimation
Density Estimation
Multivariate Gausians
Mixture Models
Expectation Maximization (Notes on EM)
Kernel Density Estimation
Probabilistic PCA, Factor Analysis, Independent Component Analysis
D. Graphical Models
Markov Chains
Monte Carlo Methods
Message Passing
Hidden Markov Models

DAG Models
More DAGs
Undirected Graphical Models
Approximate Inference
Log-Linear Models
Boltzmann Machines
E. Discriminative Models
Conditional Random Fields
Structured SVMs
Deep Structured Models
Fully-Convolutional Networks
Recurrent Neural Networks
Long Short Term Memory
F. Bayesian Learning
Bayesian Statistics
Empirical Bayes
Hierarchical Bayes
Topics Models
More Approximate Inference
Non-Parametric Bayes
VAEs and GANs
Part 4: Machine Learning Reading Group
The final set of notes are topics that I have not covered in a formal course, but where I've given overviews in our machine lear
Parallel and Distributed Machine Learning
Online, Active, and Causal Learning
Reinforcement Learning
Overview of Other Large/Notable Topics
Syllabus

ass organization, topics overview, software etc. Topics covered include: Algorithmic models of learning. Learning classifi

Problems, data, and tools; Visualization; Matlab from experience. Bayesian, maximum a posteriori, and minimum descrip
support vector machines, Bayesian networks, bag of words classifiers, N
SSE; gradient descent; closed form; normal equations; nearest neighbor classifiers, locally weighted regression, ensemble class
dimension, Occam learning, accuracy and confidence boosting. Dimensi

; training, validation, test data, and introduction hierarchical clustering, distributional clustering. Reinforcement learning
automated knowledge acquisition, pattern recognition, program synthe
lems; decision boundaries; nearest neighbor methods web, and bioinformatics and computational biology.

ssification, Bayes optimal decisions


class-conditional distribution

, online gradient descent, Neural Networks

or Mid-term

s: Bagging, random forests, boosting


n on Decision Tree and Boosting
ning: clustering, k-means, hierarchical agglomeration
sion on clustering and EM

ons; naive Bayes and multinomial models; clustering and

ructural risk minimization; margin methods and support

machines and large-margin classifiers

ls; autoregressive models


in 2017 and 2018, which could naturally follow the topics above.

achine learning. Related readings and assignments are available from the course homepage. This course is intended as a continuation on C
erial has gone through some substantial improvement since then.
iews in our machine learning reading group.
ls of learning. Learning classifiers, functions, relations, grammars, probabilistic models, value functions, behaviors and programs

osteriori, and minimum description length frameworks. Parameter estimation, sufficient statistics, decision trees, neural networks,
rks, bag of words classifiers, N-gram models; Markov and Hidden Markov models, probabilistic relational models, association rules,
ed regression, ensemble classifiers. Computational learning theory, mistake bound analysis, sample complexity analysis, VC
confidence boosting. Dimensionality reduction, feature selection and visualization. Clustering, mixture models, k-means clustering,

ering. Reinforcement learning; Learning from heterogeneous, distributed, data and knowledge. Selected applications in data mining,
n recognition, program synthesis, text and language processing, internet-based information systems, human-computer interaction, semanti
tended as a continuation on CPSC 340 and the notation in this course is almost the same, except that we switch to using superscripts to re
Self Notes on ML and Stats. Course Material
Machine learning and statistics tie into many different fields, in
viors and programs Contents

ees, neural networks, Overarching Themes


dels, association rules, Basics Five important problems in data mining: classification, clusteri
xity analysis, VC Supervised,unsupervised,reinforcement The "top 10 algorithms in data mining"
els, k-means clustering, Bias-variance trade-off Frameworks for knowledge discovery (CRISP-DM, KDD)

plications in data mining, Overfitting, underfitting Priors in statistics


-computer interaction, semantic Gradient descent:-batch,stochastic Concepts
Resampling methods Training and testing, cross-validation

Bootstrapping Overfitting/underfitting, structural risk minimization, bias/varia


Cross-Validation Regularized learning equation

Linear discriminant analysis (LDA) Conjugate priors and exponential families


Principal Component Analysis(PCA) Algorithms (some covered in more depth than others)
Learning Vector Quantization (LVQ) Apriori (for association rule mining)

Regularization methods:- Ridge,LASSO k-NN (for classification)


Kernel smoothing methods k-means (for clustering)
Ensemble learning:-Bagging(bootstrap aggrNaive Bayes (for classification)
Ordinary least squares Decision trees (for classification)
Partial Least squares Perceptron (for classification)
Kernel density Estimation SVM (for classification)
Radial basis functions AdaBoost and RankBoost (classification and ranking)
Multi co-linearity :Detection(Tolerance Hierarchical Bayesian modeling (for density estimation), i
AIC,BIC History
K-fold cross validation Selected topics from the history of machine learning and statis
Generalized Additive Models (GAMs) Theory

Multivariate Adaptive Regression Spl SVM derivation: convex optimization, Hilbert spaces, repr

Gradient boosting Large deviation bounds and generalization bounds: Hoeff


NLP
Word sense disambiguation
Pronoun resolution
Machine translation
Tokenization
Regular expressions
Stemming
Lemmatization
Evaluation metrics

AUC
Precision
Recall
Specificity
Mean absolute percentage error
Root mean square error
Algorithms
Linear regression: Usually performed through OLS
Logistic regression
Naive Bayes
K-Nearest Neighbors
K means clustering
Classification and regression trees(CARTs)
itch to using superscripts to refer Support
to vector machines
AdaBoost
Random forest
ARIMA
Decision Trees
ID3
CHAID
C4.5, C5.0
Hierarchical Clustering

Miscellaneous
Curse of dimensionality
No free lunch theorem
Occams Razor

Deep Learning
Neural Networks
Bayesian neural nets
Deep Boltzmann Machine(DBM)

Deep Belief Networks(DBN)


Convolutional Neural Networks
Stats
Hypothesis testing
Confidence intervals
P-value
different types of tests
Types of Sampling
Scedasticity
2) Gradient descent

Gradient descent is an optimization algorithm used to find the values of parameters (coefficients) of a fu

Batch Gradient Descent


One iteration of the algorithm is called one batch and this form of gradient descent is referred to as batc

Stochastic gradient descent

In this the gradient descent variation the update to the coefficients is performed for each training instan

3 a)Bootstrapping

In the context of machine learning with bootstrapping, we’re drawing random samples from another sam

3 b) Cross Validation

Cross-validation (also called rotation estimation) is a method to estimate how well a model generalizes o

4 ) LDA

Parametric, linear, Classification algorithm used to classify more than two categories

If you have more than two classes, then the Linear Discriminant Analysis is the preferred linear classifica

22 a) Linear Regression

It is a linear, parametric, regression algorithm.

Different techniques can be used to prepare or train the linear regression equation from data, the most

OLS:

When we have more than one input we can use Ordinary Least Squares to estimate the values of the coe

Gradient Descent:

When there are one or more inputs you can use a process of optimizing the values of the coefficients by

Ridge and LASSO:

There are extensions of the training of the linear model called regularization methods. These seek to bot

Two popular examples of regularization procedures for linear regression are:

Lasso Regression: where Ordinary Least Squares is modified to also minimize the absolute sum of the coe

Ridge Regression: where Ordinary Least Squares is modified to also minimize the squared absolute sum o

Assumptions for linear regression:


Linear Assumption Linear regression assumes that the relationship between your input and output is line

Remove Noise
Remove Collinearity
Gaussian Distributions. Linear regression will make more reliable predictions if your input and output var
Rescale Inputs: Linear regression will often make more reliable predictions if you rescale input variables
22 b) Logistic regression

It is a linear, parametric classification algorithm. One vs Rest logistic regression can be used to classify m

Logistic regression is named for the function used at the core of the method, the logistic function. The lo

Logistic regression models the probability of the default class

Logistic regression is a linear method, but the predictions are transformed using the logistic function

22 c) Naive bayes

Classification algorithm

It is called naive Bayes or idiot Bayes because the calculation of the probabilities for each hypothesis are

22 d )KNN

k-Nearest Neighbors (KNN) algorithm is a non-parametric for classification and regression

KNN is used for regression problems the prediction is based on the mean or the median of the K-most sim

22 f) CART

It is a nonlinear algorithm

Classically, this algorithm is referred to as decision trees

For classification the Gini cost function is used which provides an indication of how pure the leaf nodes a

The most common stopping procedure is to use a minimum count on the number of training instances a

You can use pruning after learning your tree to further lift performance. The complexity of a decision tre

22 g) SVM

Non parametric
support vector machine is a generalization of a simple and intuitive classifier called the maximal margin c

support vector machine, which is a further extension of the support vector classifier in order to accomm

A hyperplane is a subspace whose dimension is one less than that of its ambient space

maximal margin hyperplane (also known as the maximal margin hyperplane optimal separating hyperpla

22 k) Decision Trees

It is a Non-parametric, nonlinear algorithm

There are many good ways to decide the variable which should be used for splitting. Below are a few:

Gini Index (used for the popular CART algorithm)


A perfect separation results in a Gini score of 0
Formula for Gini Index

where pi is the probability of an object being classified to a particular class.

Chi Square (used for CHAID algorithm)


CHAID (Chi-square Automatic Interaction Detector) analysis is an algorithm used for discovering relation
Information Gain/Entropy (used for ID3 algorithm)
Reduction in Variance

Q and A

1)Is rotation necessary in PCA?

Yes, the rotation is necessary because it maximizes the differences between the variance captured by th

2)What happens if the components are not rotated in PCA?

It is a straight effect. If the components are not rotated, then it will diminish eventually, and one must us

3)Explain why Naive Bayes is so Naive?

It assumes that all the features in the data set are important, equal and independent.

4)What is the difference between stochastic gradient descent (SGD) and gradient descent (GD)?

Both algorithms are methods for finding a set of parameters that minimize a loss function by evaluating

In standard gradient descent, you'll evaluate all training samples for each set of parameters. This is akin
In stochastic gradient descent, you'll evaluate only 1 training sample for the set of parameters before up

5)What is the Box-Cox transformation used for?

The Box-Cox transformation is a generalized "power transformation" that transforms data to make the d

6)What are 3 ways of reducing dimensionality?

1. Removing collinear features.

2. Performing PCA, ICA, or other forms of algorithmic dimensionality reduction.

3. Combining features with feature engineering.

7)How can you choose a classifier based on training set size?

If the training set is small, high bias / low variance models (e.g. Naive Bayes) tend to perform better beca

If training set is large, low bias / high variance models (e.g. Logistic Regression) tend to perform better be

8)Explain Principle Component Analysis (PCA).

PCA is a method for transforming features in a dataset by combining them into uncorrelated linear comb

These new features, or principal components, sequentially maximize the variance represented (i.e. the fi

As a result, PCA is useful for dimensionality reduction because you can set an arbitrary variance cutoff.

9)What is the advantage of performing dimensionality reduction before fitting an SVM?

Support Vector Machine Learning Algorithm performs better in the reduced space. It is beneficial to perf

10)How will you find the correlation between a categorical variable and a continuous variable ?

You can use the analysis of covariance technqiue to find the correlation between a categorical variable a

11)Difference between factor analysis and PCA?

12)Explain p-value. Present it as if talking to a client.?

13)Explain what a local optimum is and why it is important in a specific context, such as K-means clusteri

14)What are feature vectors?


15)What is: collaborative filtering, n-grams, cosine distance?

16)Explain the difference between Gaussian Mixture Model and KMeans.?

17)How would you deal with an imbalanced dataset?

18)How do you deal with sparse data?

19)Explain bootstrapping as if you’re talking to a non-technical person.

20)What do you understand by statistical power and how do you calculate it?

21)What’s the Central Limit Theorem and what are its practical implications?

22)Tell me what heteroscedasticity is and how to solve it?

If the variability of true values along the regression line is not constant, then this condition is known as h

Heteroscedasticity means unequal scatter.

23)Difference between Kmeans and KNN?

Good resources:

https://remicnrd.github.io./the-...
cs tie into many different fields, including decision theory, information theory, functional analysis (Hilbert spaces), convex optimization, an

ata mining: classification, clustering, regression, ranking, density estimation Chapters Available as Individual PDFs
Shannon Theory
scovery (CRISP-DM, KDD) Fourier Transforms

Linear Mesh Processing


Wavelets
Multiresolution Mesh Processing

tural risk minimization, bias/variance tradeoff Linear and Non-linear Approximation


Compression

ntial families Denoising


more depth than others) Variational Priors and Regularization
Inverse Problems

Sparse Regularization
Convex Analysis
Gradient Descent Methods
Non Smooth Optimization
Theory of Sparse Regularization
Compressed Sensing
classification and ranking) Machine Learning
eling (for density estimation), including sampling techniques Deep-Learning

ory of machine learning and statistics

timization, Hilbert spaces, reproducing kernel Hilbert spaces

d generalization bounds: Hoeffding bounds, Chernoff bounds (derived from Markov's bound), McDiarmid's inequality, VC bound
parameters (coefficients) of a function (f) that minimizes a cost function (cost)
ent descent is referred to as batch gradient descent. Batch gradient descent is the most common form of gradient descent described in m

erformed for each training instance, rather than at the end of the batch of instances

andom samples from another sample to generate a new sample that has a balance between the number of samples per class. This is usefu

e how well a model generalizes on a training dataset. In cross-validation we split the training dataset into N number of splits and then sepa

wo categories

s is the preferred linear classification technique.

on equation from data, the most common of which is called Ordinary Least Squares.

to estimate the values of the coefficients. The Ordinary Least Squares procedure seeks to minimize the sum of the squared residuals.

the values of the coefficients by iteratively minimizing the error of the model on your training data. This operation is called Gradient Desc

ation methods. These seek to both minimize the sum of the squared error of the model on the training data (using Ordinary Least Squares)

mize the absolute sum of the coefficients (called L1 regularization). ˆ

imize the squared absolute sum of the coefficients (called L2 regularization)


ween your input and output is linear.

tions if your input and output variables have a Gaussian distribution.


ons if you rescale input variables using standardization or normalization

ression can be used to classify multiple categories

thod, the logistic function. The logistic function, also called the sigmoid function

ed using the logistic function

babilities for each hypothesis are simplified to make their calculation tractable. Rather than attempting to calculate the values of each attr

on and regression

n or the median of the K-most similar instances.

tion of how pure the leaf nodes are (how mixed the training data assigned to each node is).

e number of training instances assigned to each leaf node.

The complexity of a decision tree is defined as the number of splits in the tree. Simpler trees are preferred. They are easy to understand
sifier called the maximal margin classifier

tor classifier in order to accommodate non-linear class boundaries

ambient space

lane optimal separating hyperplane), which is the separating hyperplane that optimal separating hyperplane is farthest from the training o

for splitting. Below are a few:

hm used for discovering relationships between a categorical response variable and other categorical predictor variables.

ween the variance captured by the components

nish eventually, and one must use a lot of various components to explain the data set variance.

independent.

d gradient descent (GD)?

ize a loss function by evaluating parameters against data and then adjusting.

h set of parameters. This is akin to taking big, slow steps toward the solution.
the set of parameters before updating them. This is akin to taking small, quick steps toward the solution.

at transforms data to make the distribution more normal.

yes) tend to perform better because they are less likely to be overfit.

ession) tend to perform better because they can reflect more complex relationships

em into uncorrelated linear combinations.

e variance represented (i.e. the first principal component has the most variance, the second principal component has the second most, an

et an arbitrary variance cutoff.

fitting an SVM?

uced space. It is beneficial to perform dimensionality reduction before fitting an SVM if the number of features is large when compared to

a continuous variable ?

between a categorical variable and a continuous variable.

context, such as K-means clustering. What are specific ways of determining if you have a local optimum problem? What can be done to av
then this condition is known as heteroskedasticity.
aces), convex optimization, and probability. We will cover introductory material from most or all of these areas.

Syllabus
Introduction to different paradigms of machine learning Syllabus
Linear prediction, Regression Supervised Learning: Decision Trees and K-Nearest-N

Maximum Likelihood, MAP, Bayesian ML Models (Linear Regression and Logistic Regression),
Regularization, Generalization, Cross Validation and Flat Clustering, Gaussian Mixture Models (via Ex
Basics of Optimization Manifold Learning; Assorted Topics: Boosting, Reduc

Linear Classification, Logistic Regression, Naïve Bayes Topic Models for Text.
Support Vector Machines

Kernel Methods
Neural Networks, Backpropagation
Convolutional Neural Networks

Unsupervised Learning, Clustering, k-means


Dimensionality Reduction, PCA

rmid's inequality, VC bounds, margin bounds, covering numbers


adient descent described in machine learning

amples per class. This is useful when we’d like to model against a dataset with highly unbalanced classes.

number of splits and then separate the splits into training and test groups. We train on the training group of splits and then test the mode

of the squared residuals.

eration is called Gradient Descent

(using Ordinary Least Squares) but also to reduce the complexity of the model (like the number or absolute size of the sum of all coefficien
lculate the values of each attribute value P(d1, d2, d3|h), they are assumed to be conditionally independent given the target value and ca

They are easy to understand


is farthest from the training observations. That is, we can compute the (perpendicular) distance from each training observation to a given

or variables.
nent has the second most, and so on).

es is large when compared to the number of observations.

blem? What can be done to avoid local optima?


Decision Trees and K-Nearest-Neighbors, Linear and Ridge Regression, Perceptron, Support Vector Machines (SVM), Kernels and nonlinea

ession and Logistic Regression), Model Selection (AIC/BIC/Cross-validation, etc.), Feature Selection, Learning Theory; Unsupervised Learnin
Gaussian Mixture Models (via Expectation Maximization), Linear Dimensionality Reduction and Matrix Factorization, Nonlinear Dimensiona
ssorted Topics: Boosting, Reductions, Structured Prediction, Ranking, Semi-supervised Learning, Active Learning, Reinforcement Learning,
of splits and then test the model on the test group of splits. We rotate the splits between the two groups many times until we’ve exhauste

e size of the sum of all coefficients in the model).


nt given the target value and calculated as P(d1|h) × P(d2|h) and so on. This is a very strong assumption that is most unlikely in real data
h training observation to a given separating hyperplane; the smallest such distance is the minimal distance from the observations to the hy
Index of Topics
-- Use your browser's "Find" to look for keywords below --
Aggregation
es (SVM), Kernels and nonlinear SVMs, Probabilistic Overview of ensemble learning (boosting, blending, before and after the

Theory; Unsupervised Learning: Hierarchical Bayesian Learning


ization, Nonlinear Dimensionality Reduction and Validity of the Bayesian approach (prior, posterior, unknown versus prob
ning, Reinforcement Learning, Bayesian Learning, Bias-Variance Tradeoff

Basic derivation (overfit and underfit, approximation-generalization trad


Example (sinusoidal target function)

Noisy case (Bias-variance-noise decomposition)


Bin Model
Hoeffding Inequality (law of large numbers, sample, PAC)

Relation to learning (from bin to hypothesis, training data)


Multiple bins (finite hypothesis set, learning: search for green sample)
Union Bound (uniform inequality, M factor)
Data Snooping
Definition and analysis (data contamination, model selection)
Error Measures
User-specified error function (pointwise error, CIA, supermarket)
Gradient Descent
Basic method (Batch GD) (first-order optimization)
Discussion (initialization, termination, local minima, second-order metho
Stochastic Gradient Descent (the algorithm, SGD in action)

Initialization - Neural Networks (random weights, perfect symmetry)

Learning Curves
Definition and illustration (complex models versus simple models)
Linear Regression example (learning curves for noisy linear target)
Learning Diagram
Components of learning (target function, hypothesis set, learning algorit
Input probability distribution (unknown distribution, bin, Hoeffding)
Error measure (role in learning algorithm)
Noisy targets (target distribution)
Where the VC analysis fits (affected blocks in learning diagram)
Learning Paradigms

Types of learning (supervised, reinforcement, unsupervised, clustering)


Other paradigms (review, active learning, online learning)
Linear Classification
The Perceptron (linearly separable data, PLA)
Pocket algorithm (non-separable data, comparison with PLA)
Linear Regression
The algorithm (real-valued function, mean-squared error, pseudo-invers
Generalization behavior (learning curves for linear regression)
Logistic Regression
The model (soft threshold, sigmoid, probability estimation)
Cross entropy error (maximum likelihood)
The algorithm (gradient descent)
Netflix Competition
Movie rating (singular value decomposition, essence of machine learnin
Applying SGD (stochastic gradient descent, SVD factors)
Neural Networks
Biological inspiration (limits of inspiration)
Multilayer perceptrons (the model and its power and limitations)
Neural Network model (feedforward layers, soft threshold)
Backpropagation algorithm (SGD, delta rule)
Hidden layers (interpretation)
Regularization (weight decay, weight elimination, early stopping)

Nonlinear Transformation
Basic method (linearity in the parameters, Z space)
Illustration (non-separable data, quadratic transform)
Generalization behavior (VC dimension of a nonlinear transform)

Occam's Razor
Definition and analysis (definition of complexity, why simpler is better)
Overfitting
The phenomenon (fitting the noise)

A detailed experiment (Legendre polynomials, types of noise)


Deterministic noise (target complexity, stochastic noise)
Radial Basis Functions
Basic RBF model (exact interpolation, nearest neighbor)
K Centers (Lloyd's algorithm, unsupervised learning, pseudo-inverse)
RBF network (neural networks, local versus global, EM algorithm)
Relation to other techniques (SVM kernel, regularization)
Regularization
Introduction (putting the brakes, function approximation)
Formal derivation (Legendre polynomials, soft-order constraint, augmen
Weight decay (Tikhonov, smoothness, neural networks)
Augmented error (proxy for out-of-sample error, choosing a regularizer)
Regularization parameter (deterministic noise, stochastic noise)

Sampling Bias
Definition and analysis (Truman versus Dewey, matching the distribution
Support Vector Machines
SVM basic model (hard margin, constrained optimization)
The solution (KKT conditions, Lagrange, dual problem, quadratic program
Soft margin (non-separable data, slack variables)
Nonlinear transform (Z space, support vector pre-images)
Kernel methods (generalized inner product, Mercer's condition, RBF ker
Validation
Introduction (validation versus regularization, optimistic bias)
Model selection (data contamination, validation set versus test set)
Cross Validation (leave-one-out, 10-fold cross validation)
VC Dimension
Growth function (dichotomies, Hoeffding Inequality)
ny times until we’ve exhausted all the variations Examples (growth function for simple hypothesis sets)
Break points (polynomial growth functions)
Bounding the growth function (mathematical induction, polynomial bou
Definition of VC Dimension (shattering, distribution-free, Vapnik-Chervo
VC Dimension of Perceptrons (number of parameters, lower and upper b
Interpreting the VC Dimension (degrees of freedom, Number of example
t is most unlikely in real data
om the observations to the hyperplane, and is known as the margin.
The Learning Problem - Introduction; supervised, unsupervised, and reinforcement learning. C
ywords below -- Is Learning Feasible? - Can we generalize from a limited sample to the entire space? Relations
The Linear Model I - Linear classification and linear regression. Extending linear models throug
blending, before and after the fact) Error and Noise - The principled choice of error measures. What happens when the target we

Training versus Testing - The difference between training and testing in mathematical terms. W
osterior, unknown versus probabilistic) Theory of Generalization - How an infinite model can learn from a finite sample. The most imp
The VC Dimension - A measure of what it takes a model to learn. Relationship to the number o

roximation-generalization tradeoff) Bias-Variance Tradeoff - Breaking down the learning performance into competing quantities. T
The Linear Model II - More about linear models. Logistic regression, maximum likelihood, and

Neural Networks - A biologically inspired model. The efficient backpropagation learning algori
Overfitting - Fitting the data too well; fitting the noise. Deterministic noise versus stochastic n
s, sample, PAC) Regularization - Putting the brakes on fitting the noise. Hard and soft constraints. Augmented

is, training data) Validation - Taking a peek out of sample. Model selection and data contamination. Cross valid
ng: search for green sample) Support Vector Machines - One of the most successful learning algorithms; getting a complex
Kernel Methods - Extending SVM to infinite-dimensional spaces using the kernel trick, and to
Radial Basis Functions - An important learning model that connects several machine learning m
n, model selection) Three Learning Principles - Major pitfalls for machine learning practitioners; Occam's razor, sa
Epilogue - The map of machine learning. Brief views of Bayesian learning and aggregation met
rror, CIA, supermarket)

l minima, second-order methods)


m, SGD in action)

weights, perfect symmetry)

s versus simple models)


s for noisy linear target)

hypothesis set, learning algorithm)


stribution, bin, Hoeffding)

in learning diagram)

ent, unsupervised, clustering)


online learning)
mparison with PLA)

-squared error, pseudo-inverse)


or linear regression)

bility estimation)

n, essence of machine learning)


, SVD factors)

power and limitations)


s, soft threshold)

nation, early stopping)

transform)
a nonlinear transform)

lexity, why simpler is better)

ials, types of noise)


chastic noise)

est neighbor)
learning, pseudo-inverse)
s global, EM algorithm)
regularization)

approximation)
soft-order constraint, augmented error)
ral networks)
e error, choosing a regularizer)
oise, stochastic noise)

wey, matching the distributions)


d optimization)
al problem, quadratic programming)

tor pre-images)
t, Mercer's condition, RBF kernel)

on, optimistic bias)


dation set versus test set)
oss validation)

othesis sets)

cal induction, polynomial bound)


tribution-free, Vapnik-Chervonenkis)
parameters, lower and upper bounds)
freedom, Number of examples)
Contents

and reinforcement learning. Components of the learning problem. Introduction to Machine Learning
to the entire space? Relationship between in-sample and out-of-sample. General information and basic concepts. Overview
xtending linear models through nonlinear transforms. Supervised machine learning theory
happens when the target we want to learn is noisy. The supervised Machine Learning problem setup. C

sting in mathematical terms. What makes a learning model able to generalize? Linear methods for regression
a finite sample. The most important theoretical result in machine learning. Error functions for regression. Least squares: analy
. Relationship to the number of parameters and degrees of freedom. Linear methods for classification

ce into competing quantities. The learning curves. Error functions for classification. The perceptron a
ion, maximum likelihood, and gradient descent. Artificial neural networks

ackpropagation learning algorithm. Hidden layers. Artificial neural networks: multilayer perceptron a
istic noise versus stochastic noise. Kernel functions and support vector machines
d soft constraints. Augmented error and weight decay. Definition and properties of Kernel functions. Supp

ata contamination. Cross validation. Unsupervised machine learning


algorithms; getting a complex model at the price of a simple one. Unsupervised machine learning techniques. Cluste
using the kernel trick, and to non-separable data using soft margins. Reinforcement learning and control
ects several machine learning models and techniques. Description of reinforcement learning. Markov dec
ractitioners; Occam's razor, sampling bias, and data snooping. Survey of advanced topics
learning and aggregation methods. Bagging and boosting. Notes on deep learning, tran
hine Learning
and basic concepts. Overview to the problems tackled by machine learning techniques. Supervised learning (classification and regression)
learning theory
hine Learning problem setup. Classification and regression problems. Bias-variance tradeoff. Regularization. Overfitting and underfitting. G

egression. Least squares: analytical and iterative methods. Regularized least squares. The Delta rule. Examples.
lassification

assification. The perceptron algorithm. Novikoff's theorem. Separations with maximum margin. Generative learning algorithms and Gauss

works: multilayer perceptron and radial basis functions network. Application to classification and to regression problems.
support vector machines
rties of Kernel functions. Support vector machines for classification and regression problems.

ne learning techniques. Clustering algorithms: EM algorithm and k-means algorithm. Kernel Density Estimation.
ing and control
rcement learning. Markov decision processes and Bellman equations. Values and Temporal Difference methods. Q-learning and the Sarsa

g. Notes on deep learning, transductive learning and other hot topics. Challenging applications.
(classification and regression), unsupervised learning (clustering and density estimation) and semi-supervised learning (reinforcement and

Overfitting and underfitting. Generalization bounds. Complexity of a model: Vapnik-Chervonenkis dimension and Rademacher complexity.

earning algorithms and Gaussian discriminant analysis. Naive Bayes. Logistic regression. Multinomial regression.

n problems.

ods. Q-learning and the Sarsa algorithm.


Schedule

Week 1 --- Introduction and overview


d learning (reinforcement and transductive). Examples. Week 2 --- Hidden Markov Models
Week 3 --- Data Analysis
and Rademacher complexity. Model selection and feature selection. Week 4 --- Applications of Hidden Markov Models

Week 5 --- Profile Hidden Markov Models


Week 6 --- Applications of Profile Hidden Markov M
Week 7 --- Principal Component Analysis

Week 8 --- Applications of Principal Component An


Week 9 --- Support Vector Machines

Week 10 --- Applications of Support Vector Machin


Week 11 --- Clustering
Week 12 --- Clustering Applications

Week 13 --- k-Nearest Neighbor, Neural Networks,


Week 14 --- Linear Discriminant Analysis, Naive Ba
Week 15 --- Project presentations
We plan to cover topics in generalization theory, o

on and overview Topics may include:


arkov Models
Concentration of measure
ons of Hidden Markov Models Statistical learning

dden Markov Models Empirical process theory


ons of Profile Hidden Markov Models Approximation theory
Component Analysis Boosting and margins

ons of Principal Component Analysis Convex surrogates and optimization


Vector Machines Online learning with experts

tions of Support Vector Machines Online classification


Online convex optimization
ng Applications Multi-armed bandits

st Neighbor, Neural Networks, Boosting/AdaBoost, Random Forests Gaussian concentration


iscriminant Analysis, Naive Bayes, Regression Analysis, Conditional Random Fields Best-fit subspaces
presentations Spectral clustering
pics in generalization theory, online learning and optimization, and high-dimensional datSchedule and notes:
Lecture 0

Risk vs. Risk: Some terminology differences betwee


(ML people have not defined risk analogously, cau
lecture notes pdf
Lecture 1: 1/12/11

Introduction; Bias-Variance Tradeoff


lecture notes pdf
Lecture 2: 1/19/11

nd optimization Fixed Design Regression and Ridge Regression


lecture notes pdf

Lecture 3: 1/24/11
Ridge Regression and PCA
lecture notes pdf

Lecture 4: 1/26/11
The Central Limit Theorem; Large Deviations; and R
lecture notes pdf
Lecture 5: 1/30/11
The Moment Method; Convex Duality; and Large/M
lecture notes pdf
Lecture 6: 2/2/11
Hoeffding, Chernoff, Bennet, and Bernstein Bound
lecture notes pdf
Lecture 7: 2/7/11
Feature Selection, Empirical Risk Minimization, and

lecture notes pdf

Lecture 8: 2/9/11
Feature Selection and Chi^2 Tail bounds
lecture notes pdf
Lecture 9: 2/14/11
Risk vs. Risk: Some terminology differences betwee
lecture 0 notes pdf
Empirical Processes
lecture 9 notes pdf
Lecture 10: 2/16/11
Bracketing Covering Numbers

lecture 10 notes pdf


Lecture 11: 2/21/11
Symmetrization and Rademacher Averages
lecture 11 notes pdf
Lecture 12: 2/23/11
Rademacher Composition and Linear Prediction
lecture 12 notes pdf
Lecture 13: 2/28/11
Review: Norms and Dual Norms
Lecture 14: 3/2/11
Bounded Differences, Rademacher Averages, and L
lecture 14 notes pdf
Lecture 15: 3/14/11
Rademacher Averages, Linear Prediction, and Conv
lecture 15 notes pdf
further reading:
S. M. Kakade, S. Shalev-Shwartz, A. Tewari. Regula
Lecture 16: 3/16/11
Uniform and Empirical Covering Numbers
lecture 16 notes pdf
Lecture 17: 3/21/11
Dudley's Theorem and Packing Numbers

lecture 17 notes pdf


Lecture 18: 3/28/11
Mistake Bound Model, Halving Algorithm, Linear C
lecture 18 notes pdf

Lecture 19: 3/30/11


Perceptron Lower Bound & The Winnow Algorithm
lecture 19 notes pdf
Lecture 20: 4/4/11

The Perceptron for Generalized Linear Models and


lecture 20 notes pdf
Lecture 21: 4/6/11
Online Convex Programming and Gradient Descen
lecture 21 notes pdf
Lecture 22: 4/11/11
Exponentiated Gradient Descent
lecture 22 notes pdf
Lecture 23: 4/13/11
Online to Batch Conversions
lecture 23 notes pdf
Lecture 24: 4/18/11
Growth Functions and the VC dimension

lecture 24 notes pdf


Lecture 25: 4/20/11
Boosting
lecture 25 notes pdf
Probabilistic formulations of prediction problems
Plug-in estimators, empirical risk minimization

erminology differences between Stats and ML Linear threshold functions, perceptron algorithm
defined risk analogously, causing some confusion) Risk bounds
Concentration inequalities
Uniform convergence

ariance Tradeoff Rademacher averages; combinatorial dimensions


Convex surrogate losses for classification
Game-theoretic formulations of prediction problems

sion and Ridge Regression Minimax strategies for log loss, linear loss, and quadratic loss
Universal portfolios

Online convex optimization


Neural networks
Stochastic gradient methods

Combinatorial dimensions and Rademacher averages


eorem; Large Deviations; and Rate Functions Hardness results for learning
Efficient learning algorithms
Kernel methods
d; Convex Duality; and Large/Medium/Small DeviationsReproducing kernel Hilbert spaces, Mercer's theorem
Convex optimization for kernel methods
Representer theorem
Bennet, and Bernstein Bounds Ensemble methods
AdaBoost
AdaBoost as I-projection
mpirical Risk Minimization, and The Orthogonal Case Convergence and consistency of AdaBoost

d Chi^2 Tail bounds

erminology differences between Stats and ML

Rademacher Averages
sition and Linear Prediction

s, Rademacher Averages, and L1 Regularization

es, Linear Prediction, and Convex Duality

ev-Shwartz, A. Tewari. Regularization Techniques for Learning with Matrices. pdf

al Covering Numbers

nd Packing Numbers

el, Halving Algorithm, Linear Classifiers, & Perceptron

ound & The Winnow Algorithm

Generalized Linear Models and Single Index Models

amming and Gradient Descent

ent Descent

nd the VC dimension
Here is a tentative outline for the course:
Nonnegative Matrix Factorization [slides]

Qualitative Comparisons to SVD


New Algorithms via Separability
Applications to Topic Models

D. Lee and S. Seung. Learning the Parts of Objects by Nonnegative Matrix Factorization, Nature 1999.
S. Vavasis. On the Complexity of Nonnegative Matrix Factorization, SIOPT 2009.
S. Arora, R. Ge, R. Kannan and A. Moitra. Computing a Nonnegative Matrix Factorization -- Provably, STOC 2012.

S. Arora, R. Ge and A. Moitra. Learning Topic Models -- Going Beyond SVD, FOCS 2012.
S. Arora et al. A Practical Algorithm for Topic Modeling with Provable Guarantees, ICML 2013.

Discussion: When does well-posedness lead to better algorithms?

M. Balcan, A. Blum and A. Gupta. Clustering under Approximation Stability, JACM 2013.

Tensor Decompositions [slides]


Tensor Rank, Border Rank and the Rotation Problem
Jennrich's Algorithm and the Generalized Eigenvalue Problem
Learning HMMs
Mixed Membership Models and Community Detection
Cumulants and Independent Component Analysis

C. Hillar and L. Lim. Most Tensor Problems are NP-hard, JACM 2013.
E. Mossel and S. Roch. Learning Nonsingular Phylogenies and Hidden Markov Models, STOC 2005.

A. Anandkumar, D. Foster, D. Hsu, S. Kakade and Y. Liu A Spectral Algorithm for Latent Dirichlet Allocation, NIPS 201

A. Anandkumar, R. Ge, D. Hsu and S. Kakade. A Tensor Spectral Approach to Learning Mixed Membership Commun
N. Goyal, S. Vempala and Y. Xiao. Fourier PCA, STOC 2014.

Discussion: When do algorithms rely (too much) on a distributional model?

U. Feige and J. Kilian. Heuristics for Semirandom Graph Problems, JCSS 2001.

Sparse Coding
Sparse Recovery, Incoherence and Uncertainty Principles
Alternating Minimization via Approximate Gradient Descent [slides]

Sum-of-Squares and Noisy Tensor Decomposition

B. Olshausen and D. Field. Emergence of Simple-cell Receptive Field Properties by Learning a Sparse Code for Natur
D. Spielman, H. Wang and J. Wright. Exact Recovery of Sparsely-Used Dictionaries, COLT 2012.
S. Arora, R. Ge, T. Ma and A. Moitra. Simple, Efficient and Neural Algorithms for Sparse Coding, Manuscript 2014.
B. Barak, J. Kelner and D. Steurer. Dictionary Learning and Tensor Decomposition via the Sum-of-Squares Method, M

Discussion: When does belief propagation (provably) work?

S. Geman and D. Geman. Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images, Trans.

Learning Mixture Models


Expectation Maximization
Clustering in High-Dimensions
Method of Moments and Systems of Polynomial Equations [slides]

A. Dempster N. Laird and D. Rubin. Maximum Likelihood from Incomplete Data via the EM Algorithm, J. Royal Statis
S. Dasgupta. Learning Mixtures of Gaussians, FOCS 1999.
S. Arora and R. Kannan. Learning Mixtures of Separated Nonspherical Gaussians, Annals of Applied Probability 2005
A. Kalai, A. Moitra and G. Valiant. Efficiently Learning Mixtures of Two Gaussians, STOC 2010.
A. Moitra and G. Valiant. Settling the Polynomial Learnability of Mixtures of Gaussians, FOCS 2010.
M. Belkin and K. Sinha. Polynomial Learning of Distribution Families, FOCS 2010.

Discussion: Is nature an adversary? And if not, how can we model and exploit that?

A. Bhaskara, M. Charikar, A. Moitra and A. Vijayaraghavan. Smoothed Analysis of Tensor Decompositions, STOC 201

Linear Inverse Problems


Nuclear Norm, Atomic Norm and Matrix Completion
Alternating Minimization via Principal Angles

Tensor Prediction and Random CSPs

E. Candes and B. Recht. Exact Matrix Completion via Convex Optimization, FOCM 2009.
V. Chandrasekaran, P. Parrilo, B. Recht and A. Willsky. The Convex Geometry of Linear Inverse Problems, FOCM 201
P. Jain, P. Netrapalli and S. Sanghavi. Low-rank Matrix Completion using Alternating Minimization, STOC 2013.
M. Hardt. Understanding Alternating Minimization for Matrix Completion, FOCS 2014.
B. Barak and A. Moitra. Tensor Prediction, Rademacher Complexity and Random 3-XOR, Manuscript 2015.

Discussion: Do we have enough average-case assumptions?

Q. Berthet and P. Rigollet. Computational Lower Bounds for Sparse PCA, COLT 2013.
V. Chandrasekaran and M. Jordan. Computational and Statistical Tradeoffs via Convex Relaxation, PNAS 2013.
1. Introduction

Overview of the learning problem : statistical and online learning frameworks.


PAC and agnostic PAC models, online learning protocol, (adaptive/ oblivious adversaries, various notions

2. Minimax formulation for learning, distribution free and adversarial learning settings, uniform guarante

n, Nature 1999.
3. Statistical Learning Framework
on -- Provably, STOC 2012. . Empirical risk minimization and Regularized empirical risk minimization

. Uniform convergence (iid data)


. finite classes, PAC Bayes theorem, compression bounds

. VC dimension and growth function


. Rademacher complexity, covering numbers, Dudley integral bounds, fat-shattering dimension
. Supervised learnability : necessary and sufficient conditions via uniform convergence (iid)

. Local Rademacher analysis and fast rates

4. Online Learning Framework (sequential prediction/decision making)


. Learning with expert advice, perceptron, winnow
. Sequential minimax analysis for online learning
. Uniform convergence over martingales
. Sequential Rademacher complexity, sequential covering numbers, Sequential fat-shattering dimension
. Supervised online learnability : necessary and sufficient conditions via martingale uniform convergence

5. Deriving algorithms through relaxation and minimax analysis for online learning
STOC 2005.

t Dirichlet Allocation, NIPS 2012. 6. Additional topics

Mixed Membership Community . Connections between learning and approximation algorithms


. Connections between learning and optimization
. Algorithmic stability tools
. Statistical estimation Vs statistical learning Vs Stochastic optimization

rning a Sparse Code for Natural Images, Nature 1996.


e Coding, Manuscript 2014.
he Sum-of-Squares Method, Manuscript 2014.

Restoration of Images, Trans. Pattern Analysis and Machine Intelligence 1984.

e EM Algorithm, J. Royal Statistical Society 1977.

als of Applied Probability 2005.

s, FOCS 2010.

sor Decompositions, STOC 2014.

r Inverse Problems, FOCM 2012.


Minimization, STOC 2013.

OR, Manuscript 2015.

x Relaxation, PNAS 2013.


Lectures :

us adversaries, various notions of regret) connections toLecture 1 : Introduction, course details, what is learning theory, learning framewor
Reference : [1] (ch 1 and 3)
ing settings, uniform guarantees and no free lunch theorems

Lecture 2 : Minimax Rates, comparing the different frameworks [lec2]

Lecture 3 : No Free Lunch, Statistical Learning [lec3]

Lecture 4 : Statistical Learning: ERM ,Finite classes, MDL [lec4]

hattering dimension Lecture 5 : Statistical Learning: MDL continued, infinite classes [lec5]
onvergence (iid)

Lecture 6 : Statistical Learning: Symmetrization, Rademacher Complexity, Growth f

Lecture 7 : Statistical Learning: Growth function, VC Dimension, Massart's finite lem

Lecture 8 : Statistical Learning VC Sauer Shelah Lemma continued [lec7]

ntial fat-shattering dimension Lecture 9 : Statistical Learning: Properties of Rademacher complexity, examples [le
rtingale uniform convergence
Lecture 10 : Statistical Learning: Examples continued, Covering numbers, Pollard bo

Lecture 11 : Statistical Learning: Covering numbers, Pollard bound, Dudley Bound [

Lecture 12 : Statistical Learning: Covering numbers, fat-shattering dimension, learn

Lecture 13 : Online Learning: Halving, Exponential weights, minimax rate for bit pre

Lecture 14 : Online Convex Learning: Online Gradient Descent [lec14]

Lecture 15 : Online Mirror Descent [lec15]

Lecture 16 : Online Mirror Descent [lec16]

Lecture 17 : Online Mirror Descent, continued. [lec17]

Lecture 18 : Minimax Rates for Online Learning [lec18]


Lecture 19 : Minimax Rates for Online Learning, Sequential Rademacher Complexit

Lecture 20 :Sequential Rademacher Complexity [lec20]

Lecture 21 :Sequential Complexity Measures [lec21]

Lecture 22: Relaxations for Online Learning [lec22]

Lecture 23: Relaxations for Online Learning [lec23]

Lecture 24: Relaxations for Randomized Algorithms [lec24]


The schedule will be updated and revised as the course progresses. Each topic will

Tue, Aug 25
ng theory, learning frameworks [slides] [notes]
Introduction, history, overview, and administrivia.

ameworks [lec2] Olivier Bousquet, Stéphane Boucheron, and Gábor Lugosi, Introduction to statistica
Theodoros Evgeniou, Massimiliano Pontil, and Tomaso Poggio, Statistical learning t
Ulrike von Luxburg and Bernhard Schölkopf, Statistical learning theory: models, con

Tomaso Poggio and Steve Smale, The mathematics of learning: dealing with data, N
Cosma Shalizi, Learning theory (formal, computational or statistical) (http://cscs.um

Thu, Aug 27
te classes [lec5] Tue, Sep 1
[notes]

macher Complexity, Growth function [lec6] Concentration inequalities: Markov, Chebyshev, McDiarmid (bounded differences i

Dimension, Massart's finite lemma [lec7] Torben Hagerup and Christine Rüb, A guided tour of Chernoff bounds, Information
Gábor Lugosi, Concentration-of-measure inequalities, lecture notes, 2003-2009
ma continued [lec7] Colin McDiarmid, Concentration, Probabilistic Methods for Algorithmic Discrete Ma
Terence Tao, Concentration of measure (http://terrytao.wordpress.com/2010/01/0
cher complexity, examples [lec9] Thu, Sep 3
[notes]
Covering numbers, Pollard bound [lec10] Formulation of the learning problem: concept and function learning; realizable case

ollard bound, Dudley Bound [lec11] Dana Angluin, Queries and concept learning, Machine Learning, vol. 2, no. 4, pp. 31

David Haussler, PAC learning model, and decision-theoretic generalizations, with ap

at-shattering dimension, learnability [lec12] Leslie Valiant, A theory of the learnable, Communications of the ACM, vol. 27, no. 1
Tue, Sep 8
ights, minimax rate for bit prediction [lec13] [notes]
Formulation of the learning problem, continued: agnostic (model-free) learning; co
Descent [lec14]
Dana Angluin, Queries and concept learning, Machine Learning, vol. 2, no. 4, pp. 31
David Haussler, PAC learning model, and decision-theoretic generalizations, with ap
Leslie Valiant, A theory of the learnable, Communications of the ACM, vol. 27, no. 1
Thu, Sep 10
Tue, Sep 15

[notes]
Empirical Risk Minimization: abstract risk bounds and Rademacher averages -- stoc
Peter Bartlett and Shahar Mendelson, Rademacher and Gaussian complexities: risk
ential Rademacher Complexity [lec19] Olivier Bousquet, Stéphane Boucheron, and Gábor Lugosi, Theory of classification:
Thu, Sep 17
[notes]
Vapnik-Chervonenkis classes: shatter coefficients; VC dimension; examples of VC cl
Anselm Blumer, Andrzej Ehrenfeucht, David Haussler, and Manfred Warmuth, Lear
Gábor Lugosi, Pattern classification and learning theory, in Principles of Nonparame
Tue, Sep 22
Thu, Sep 24
[notes]
Binary classification: bounds for simple VC classes (linear and generalized linear dis
Peter Bartlett, Michael Jordan, and Jon McAuliffe, Convexity, classification, and risk
Olivier Bousquet, Stéphane Boucheron, and Gábor Lugosi, Theory of classification:
Tue, Sep 29
Thu, Oct 1
No class: Allerton conference

Tue, Oct 6
Thu, Oct 8

[notes]
Binary classification, continued: reproducing kernel Hilbert spaces and kernel mach
Peter Bartlett, Michael Jordan, and Jon McAuliffe, Convexity, classification, and risk
Olivier Bousquet, Stéphane Boucheron, and Gábor Lugosi, Theory of classification:

Tue, Oct 13
Thu, Oct 15
[notes]
Regression with quadratic loss

Presentation loosely based on Chapter 8 of Cucker and Zhou.

Tue, Oct 20
Thu, Oct 22
Thu, Oct 29
Tue, Nov 3
Thu, Nov 5
[notes]
Stability of learning algorithms: learnability without uniform convergence; average
Olivier Bousquet and André Elisseeff, Stability and generalization, Journal of Machin
Alexander Rakhlin, Sayan Mukherjee, and Tommaso Poggio, Stability results in lear
Shai Shalev-Shwartz, Ohad Shamir, Nathan Srebro, and Karthik Sridharan, Learnabi
Moritz Hardt, Ben Recht, and Yoram Singer, Train faster, generalize better: stability

Kobbi Nissim and Uri Stemmer, On the generalization properties of differential priv
Tue, Nov 10
Thu, Nov 12
Online learning: basic model; regret; regret bounds for online convex and strongly
Martin Zinkevich, Online convex programming and generalized infinitesimal gradien
Elad Hazan, Amit Agarwal, and Satyen Kale, Logarithmic regret algorithms for onlin
Nicolò Cesa-Bianchi, Alex Conconi, and Claudio Gentile, On the generalization abilit
Jacob Abernethy, Alekh Agarwal, Peter Bartlett, and Alexander Rakhlin, A stochasti
Tue, Nov 17
Thu, Nov 19
Tue, Dec 1
[notes]
Minimax lower bounds: binary classification under a margin assumption; reduction
Pascal Massart and Élodie Nédélec,, Risk bounds for statistical learning, Annals of S
Bin Yu, Assouad, Fano, and Le Cam, in Festschrift for Lucien Le Cam, edited by D. Po
se progresses. Each topic will come with links to reference materials; key references will be highlighted. To get a rough idea of the materia

ugosi, Introduction to statistical learning theory, in Advanced Lectures in Machine Learning (O. Bousquet, U. von Luxburg, and G. Rätsch, ed
so Poggio, Statistical learning theory: a primer, International Journal of Computer Vision, vol. 38, no. 1, pp. 9-13, 2000
al learning theory: models, concepts, and results (http://arxiv.org/abs/0810.4752), 2008

f learning: dealing with data, Notices of the American Mathematical Society, vol. 50, no. 5, pp. 537-544, 2003
al or statistical) (http://cscs.umich.edu/~crshalizi/notebooks/learning-theory.html), Jan 09, 2011 [A nice succinct summary, with lots of use

iarmid (bounded differences inequality), examples

Chernoff bounds, Information Processing Letters, vol. 33, no. 6, pp. 305-308, 1990 [Short and sweet]
, lecture notes, 2003-2009
ds for Algorithmic Discrete Mathematics, pp. 1-46, 1998
tao.wordpress.com/2010/01/03/254a-notes-1-concentration-of-measure/), Jan 03, 2010

nction learning; realizable case; Probably Approximately Correct (PAC) learning.

e Learning, vol. 2, no. 4, pp. 319-342, 1988

eoretic generalizations, with applications to neural nets, in Mathematical Perspectives on Neural Networks, Lawrence Erlbaum Associates,

tions of the ACM, vol. 27, no. 11, pp. 1134-1142

ostic (model-free) learning; consistency; Empirical Risk Minimization

e Learning, vol. 2, no. 4, pp. 319-342, 1988


eoretic generalizations, with applications to neural nets, in Mathematical Perspectives on Neural Networks, Lawrence Erlbaum Associates,
tions of the ACM, vol. 27, no. 11, pp. 1134-1142

d Rademacher averages -- stochastic inequalities for ERM; Rademacher averages (structural results, Finite Class Lemma); introduction to VC
nd Gaussian complexities: risk bounds and structural results, Journal of Machine Learning Research, vol. 3, pp. 463-482, 2002
ugosi, Theory of classification: a survey of recent advances, ESAIM Probability and Statistics, vol. 9, pp. 323-375, 2005 (Section 3 only)

dimension; examples of VC classes; Sauer-Shelah lemma; implication for Rademacher averages


, and Manfred Warmuth, Learnability and the Vapnik-Chervonenkis dimension, Journal of the ACM, vol. 36, no. 4, pp. 929-965, 1989
ry, in Principles of Nonparametric Learning (L. Györfi, editor), pp. 1-56, Springer, 2002 (parts of Section 1.4)

near and generalized linear discriminant rules); surrogate loss functions; margin-based bounds
nvexity, classification, and risk bounds, Journal of the American Statistical Association, vol. 101, no. 473, pp. 138-156, 2006
ugosi, Theory of classification: a survey of recent advances, ESAIM Probability and Statistics, vol. 9, pp. 323-375, 2005

Hilbert spaces and kernel machines; convex risk minimization


nvexity, classification, and risk bounds, Journal of the American Statistical Association, vol. 101, no. 473, pp. 138-156, 2006
ugosi, Theory of classification: a survey of recent advances, ESAIM Probability and Statistics, vol. 9, pp. 323-375, 2005

uniform convergence; average and uniform stability of learning algorithms; the role of convexity and strong convexity; stability of Stochasti
neralization, Journal of Machine Learning Research, vol. 2, pp. 499-526, 2002
Poggio, Stability results in learning theory, Analysis and Applications, vol. 3, no. 4, pp. 397–417, 2005
nd Karthik Sridharan, Learnability, stability, and uniform convergence, Journal of Machine Learning Research, vol. 11, pp. 2635-2670, 2010
ter, generalize better: stability of stochastic gradient descent, preprint, 2015

n properties of differential privacy, preprint, 2015


or online convex and strongly convex programming via projected gradient descent; online-to-batch conversions; relation to Rademacher a
eneralized infinitesimal gradient ascent, ICML 2003
mic regret algorithms for online convex optimization, Machine Learning, vol. 69, no. 2-3, pp. 169-192, 2007
le, On the generalization ability of online learning algorithms, IEEE Transactions on Information Theory, vol. 50, no. 9, pp. 2050-2057, 2004
Alexander Rakhlin, A stochastic view of optimal regret through minimax duality, COLT 2009

margin assumption; reduction to finite testing on a binary hypercube (Assouad's lemma); extra log factor for rich VC classes; information-t
statistical learning, Annals of Statistics, vol. 34, no. 5, pp. 2326-2366, 2006.
Lucien Le Cam, edited by D. Pollard, E. Torgersen, and G. Yang, pp. 423-435, 1997, Springer-Verlag.
get a rough idea of the material, check out the schedules from past offerings: Fall 13, Fall 14.

von Luxburg, and G. Rätsch, editors), pp. 208-204, Springer, 2004

inct summary, with lots of useful references]

awrence Erlbaum Associates, 1995

awrence Erlbaum Associates, 1995

ass Lemma); introduction to VC classes


p. 463-482, 2002
75, 2005 (Section 3 only)

no. 4, pp. 929-965, 1989

138-156, 2006

138-156, 2006

onvexity; stability of Stochastic Gradient Descent; connection between differential privacy, stability, and generalization

, vol. 11, pp. 2635-2670, 2010


ons; relation to Rademacher averages.

50, no. 9, pp. 2050-2057, 2004

rich VC classes; information-theoretic methods (Fano's inequality)


Linear regression (overfitting, crossvalidation/bootstrap, model selection, regulariz

Linear classification: Logistic regression (feature selection, sparsity, multiclass)


Kernels and the kernel trick (Properties of kernels; applications to linear and logisti
The statistical perspective (regularization as prior; loss as likelihood; learning as M
Statistical decision theory (decision making based on statistical models and utility f

Discriminative vs. generative modeling (benefits and challenges in modeling joint v


Bayes' classifiers (Naive Bayes, Gaussian Bayes; MLE)
Bayesian networks and exact inference (conditional independence; variable

Approximate inference (sum/max product; Gibbs sampling)


Latent variable models (Gaussian Misture Models, EM Algorithm)

Temporal models (Bayesian filtering, Hidden Markov Models)


Sequential decision making (MDPs, value and policy iteration)
Reinforcement learning (modelbased RL, Qlearning)
ECE236C - Optimization Methods for Large-Scale Systems
rap, model selection, regularization, [stochastic] gradient descent) Prof. L. Vandenberghe, UCLA

ction, sparsity, multiclass) Lecture notes (Spring 2019)


pplications to linear and logistic regression; kNN Introduction
ss as likelihood; learning as MAP inference) Gradient method
statistical models and utility functions)

challenges in modeling joint vy. conditional distributions) Subgradients

onal independence; variable elimination; TANs) Subgradient method

s sampling)
ls, EM Algorithm) Proximal gradient method

rkov Models)
Conjugate functions

The proximal mapping

Accelerated proximal gradient methods

Proximal point method

Dual decomposition

Dual proximal gradient method

Douglas-Rachford splitting and ADMM

Primal-dual proximal methods

Generalized distances and mirror descent

Generalized proximal gradient method

Conjugate gradient method

Newton's method

Quasi-Newton methods

Gauss-Newton method
Lectures from previous years
Conic optimization and interior-point methods

Conic optimization

Barrier functions

Path-following methods

Symmetric cones

Primal-dual interior-point methods

First-order methods

Fast proximal gradient methods (FISTA)

Smoothing

Proximal mapping via network optimization

Localization and cutting-plane methods

Cutting-plane methods

Analytic center cutting-plane method

Ellipsoid method
ds for Large-Scale Systems EE236A - Linear Programming (Fall Quarter 2013-14) CSE 291: Topics in unsupervised learning
Prof. L. Vandenberghe, UCLA Time

Lecture notes TuTh 3.30-4.50 in University Center 413A room 1


Introduction
Instructor:
Piecewise-linear optimization Sanjoy Dasgupta

Office hours Mon 3-5 in EBU3B 4138


Polyhedra Lectures and readings
Lecture 1: Clustering in metric spaces [4/1]

Convexity Gonzalez. Clustering to minimize the maximum int

Alternatives Lecture 2: The k-means problem [4/3, 4/8]


Arthur and Vassilvitskii. Kmeans++: the advantages
Duality Kanungo, Mount, Netanyahu, Piatko, Silverman, an

Duality, part 2 Lecture 3: The k-medoid problem [4/10]


Lin and Vitter. Approximation algorithms for geom
Linear-fractional optimization
Lecture 4: Hierarchical clustering [4/15]
Structural optimization Hartigan. Clustering algorithms.
Dasgupta and Long. Performance guarantees for h
Filter design
Lecture 5: Finding meaningful clusters in data [4/1
Control applications Jardine and Sibson. Mathematical taxonomy.
Kleinberg. An impossibility theorem for clustering.

Simplex method Balcan, Blum, and Vempala. A discriminative frame

The central path Review: Concentration of measure [4/24, 4/29]

Barrier method Lecture 6: Clustering in an online/streaming setting


Charikar, Chekuri, Feder, and Motwani. Increment
Primal-dual interior-point method Beygelzimer, Kakade, and Langford. Cover trees fo
Guha, Meyerson, Mishra, Motwani, and O'Callagha
Self-dual formulations Indyk. Sublinear time algorithms for metric space p

Network flow optimization Lecture 7: Random projection [5/13]

Dasgupta and Gupta. An elementary proof of the J


Integer linear programming
Lecture 8: Spectral methods [5/15, 5/20, 5/22]
Strang. Linear algebra and its applications: append
Drineas, Kannan, Frieze, Vempala, and Vinay. Clust
-point methods
Lecture 9: Multidimensional scaling; Frechet and B
Schoenberg. Metric spaces and positive definite fu
Kruskal and Wish. Multidimensional scaling.
Matousek. Lectures on discrete geometry, chapter

Lecture 10: Embedding spaces of low intrinsic dime


Tenenbaum, de Silva, and Langford. A global geom
Roweis and Saul. Nonlinear dimensionality reducti
Clarkson. Tighter bounds for random projections o
Indyk and Naor. Nearest neighbor preserving emb

optimization
nsupervised learning

niversity Center 413A room 1

5 in EBU3B 4138

in metric spaces [4/1]

to minimize the maximum intercluster distance.

ans problem [4/3, 4/8]


kii. Kmeans++: the advantages of careful seeding.
tanyahu, Piatko, Silverman, and Wu. A local search approximation algorithm for k-means clustering.

doid problem [4/10]


ximation algorithms for geometric median problems.

al clustering [4/15]

Performance guarantees for hierarchical clustering.

eaningful clusters in data [4/17, 4/22]


Mathematical taxonomy.
sibility theorem for clustering.

empala. A discriminative framework for clustering via similarity functions.

on of measure [4/24, 4/29]

in an online/streaming setting [5/1, 5/6, 5/8]


der, and Motwani. Incremental clustering and dynamic information retrieval.
, and Langford. Cover trees for nearest neighbor search.
shra, Motwani, and O'Callaghan. Clustering data streams: theory and practice.
e algorithms for metric space problems.

rojection [5/13]

. An elementary proof of the Johnson-Lindenstrauss lemma.

methods [5/15, 5/20, 5/22]


a and its applications: appendix A on SVD.
eze, Vempala, and Vinay. Clustering large graphs via the singular value decomposition.

ensional scaling; Frechet and Bourgain embeddings [5/27, 5/29]


spaces and positive definite functions.
ultidimensional scaling.
on discrete geometry, chapter 15: Embedding finite metric spaces into normed spaces.

ng spaces of low intrinsic dimension [6/3, 6/5]


, and Langford. A global geometric framework for nonlinear dimensionality reduction.
nlinear dimensionality reduction by locally linear embedding.
unds for random projections of manifolds.
rest neighbor preserving embeddings.
Part I: Supervised learning

Lecture 1. The Laplace functional [9/21, 9/26, 9/28]


Probability review, concentration of Lipschitz functions of bounded independent ra
Homework 1, due 10/3

Lecture 2. The PAC model of learning [10/3, 10/5, 10/10]


Proper and agnostic learning, canonical function classes, Occam-style bounds

Homework 2, due 10/17

Lecture 3. Uniform convergence [10/10, 10/12, 10/17, 10/19]


VC dimension, canonical examples, Rademacher averages
Homework 3, due 11/14

Lecture 4. Mistake-bounded learning [10/24, 10/26]


Halving, perceptron, winnow, conversions to PAC

Lecture 5. Strong and weak learning [10/31, 11/2, 11/7]


Boosting algorithms, margin-based bounds

Lecture 6. Fourier methods [11/14, 11/16, 11/21]


Linear-algebra approach to learning, decision trees, circuits

Lecture 7. Models of noise [11/28, 11/30]

Random misclassification, statistical queries, malicious noise

Part II: Unsupervised, semisupervised, and active learning

Lecture 8. Gaussian concentration


Johnson-Lindenstrauss lemma
Lecture 9. Embedding finite metrics into L2
Notions of distortion, Bourgain's embedding, probabilistic embeddings
Lecture 10. Algorithms for nearest neighbor search
Locality-sensitive hashing, cover trees

Lecture 11. Spectral projection


Low-rank approximation, application to mixture models
Lecture 12. Approximation algorithms for clustering
k-center, k-median, k-means, correlation clustering
Lecture 13. Semisupervised learning
Lecture 14. Active learnin
CSE 291: Bayesian methods

Time
TuTh 11-12.30 in CSE 2154
ns of bounded independent random variables
Instructor:

Sanjoy Dasgupta
Office hours Tue 2-4 in CSE 4138
ses, Occam-style bounds Administrative details

Course requirements: There will be periodic homework assignments as well as a final project.

The following textbooks contain a lot of the material we'll be covering:

Gelman, Carlin, Stern, Rubin. Bayesian Data Analysis.

Murphy. Machine Learning: A Probabilistic Perspective.


Barber. Bayesian Reasoning and Machine Learning.
Lecture schedule, homework assignments, and optional accompanying readings
Course outline (Jan 8)
Efron. A 250-year argument: belief, behavior, and the bootstrap.

Entropy, exponential families, and maximum likelihood (Jan 10,15,17,22)


A nice introduction to maximum entropy modeling is:
Berger, Della Pietra, Della Pietra. A maximum entropy approach to natural language processin
For entropy and asymptotic equipartition, consult Chapters 2 and 3 of the following fantastic
Cover and Thomas. Elements of Information Theory.

The axiomatic formulation of entropy that I presented is one of many, but my personal favorit

Aczel, Forte, Ng. Why Shannon and Hartley entropies are "natural". (Find it in JSTOR, or email
Here's the paper for the species distribution problem we discussed:
Phillips, Dudik, Schapire. A maximum entropy approach to species distribution modeling.
Homework 1, due 1/31.
Bayesian inference for exponential families (Jan 24,29,31)
Here's the paper on modeling amino acid distributions using Dirichlet mixtures:
Sjolander, Karplus, Brown, Hughey, Krogh, Mian, Haussler. Dirichlet mixtures: a method for im
listic embeddings Homework 2, due 2/12.
Gaussian models: conditioning, linear regression, kernel trick, Bayesian model selection, Gaus
A good basic reference on Gaussian processes is the following book, available online:

Rasmussen and Williams. Gaussian processes for Machine Learning.


For the mathematically inclined:
Adler and Taylor. Random Fields and Geometry.
Markov random fields: the Hammersley-Clifford theorem, Gibbs sampling, and MAP inference
Geman and Geman. Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of
Geyer. Practical Markov chain Monte Carlo.
Mixture models and Dirichlet processes (Mar 5,7)
Neal. Markov chain sampling methods for Dirichlet process mixture models.
Sethuraman. A constructive definition of Dirichlet priors.
Topic models (Mar 12,14)
Griffiths, Steyvers. Finding scientific topics.
Blei, Griffiths, Jordan. The nested Chinese restaurant process and Bayesian nonparametric infe
Lectures: Statistical Learning Theory

Lectures: 306 Soda. Tuesday/Thursday 9


Lecture 1 A Probabilistic Approach to Pattern Recognition
Lecture 2 Introduction to Classification and Regression Instructor: Peter Bartlett.
Lecture 3 Introduction to Complexity Regularization Office Hours: Wed 9:00-10:00, 399 Evan

Lecture 4 Denoising in Smooth Function Spaces


Lecture 5 Plug-in Rules and Histogram Classifiers GSI:
Lecture 6 Probably Approximately Correct (PAC) Learning Alan Malek. Office Hours: Wed 5:00-6:0

ents as well as a final project. Lecture 7 Chernoff's Bound and Hoeffding's Inequality
Lecture 8 Classification Error Bounds This course will provide an introduction

Lecture 9 Error Bounds in Countably Infinite Models Spaces


Lecture 10 Complexity Regularization Probabilistic formulations of prediction
Lecture 11 Decision Trees Plug-in estimators, empirical risk minim

Lecture 12 Complexity Regularization for Squared Error Loss Linear threshold functions, perceptron a
Lecture 13 Maximum Likelihood Estimation Risk bounds
panying readings Lecture 14 Maximum Likelihood and Complexity Regularization Concentration inequalities
Lecture 15 Denoising II: Adapting to Unknown Smoothness Uniform convergence
Lecture 16 Wavelet Approximation Theory Rademacher averages; combinatorial di
Lecture 17 Denoising III: Spatial Adaptivity Convex surrogate losses for classificatio
Lecture 18 Introduction to VC Theory Game-theoretic formulations of predicti
Lecture 19 The VC Inequality Minimax strategies for log loss, linear lo
to natural language processing. Lecture 20 Applications of VC Theory Universal portfolios
nd 3 of the following fantastic text: Online convex optimization
Neural networks

many, but my personal favorite. It is from: Stochastic gradient methods

al". (Find it in JSTOR, or email me) Combinatorial dimensions and Radema


Hardness results for learning
es distribution modeling. Efficient learning algorithms
Kernel methods
Reproducing kernel Hilbert spaces, Mer
ichlet mixtures: Convex optimization for kernel method
hlet mixtures: a method for improving detection of weak but significant protein sequence homologRepresenter theorem
Ensemble methods
ayesian model selection, Gaussian processes (Feb 5,7,19) AdaBoost
ook, available online: AdaBoost as I-projection

Convergence and consistency of AdaBo


sampling, and MAP inference (Feb 21,26,28)
nd the Bayesian restoration of images.

ure models.

d Bayesian nonparametric inference of topic hierarchies.


Learning Theory

306 Soda. Tuesday/Thursday 9:30-11:00am.

: Peter Bartlett.
urs: Wed 9:00-10:00, 399 Evans; Thu 11:00-12:00; 723 SDH.

ek. Office Hours: Wed 5:00-6:00 and Thu 4:00-5:00; 283H Soda.

e will provide an introduction to the theoretical analysis of prediction methods, focusing on statistical and computational aspects. It will c

tic formulations of prediction problems


timators, empirical risk minimization

eshold functions, perceptron algorithm

ation inequalities
onvergence
her averages; combinatorial dimensions
urrogate losses for classification
oretic formulations of prediction problems
strategies for log loss, linear loss, and quadratic loss

nvex optimization

c gradient methods

orial dimensions and Rademacher averages


results for learning
earning algorithms

ing kernel Hilbert spaces, Mercer's theorem


ptimization for kernel methods
ter theorem

as I-projection

nce and consistency of AdaBoost


omputational aspects. It will cover approaches such as neural networks, kernel methods and boosting algorithms, and probabilistic and ga
thms, and probabilistic and game theoretic formulations of prediction problems, and it will focus on tools for the theoretical analysis of th
r the theoretical analysis of the performance of learning algorithms and the inherent difficulty of learning problems.
http://people.csail.mit.edu/dsontag/courses/ml13/
Introduction To Machine Learning

Fall 2013

Overview

Machine learning is an exciting and fast-moving field of computer science with many recent consumer applications (e.g., Micro
General information

Lectures: Tuesday and Thursday, 11am-12:15pm


Room: Warren Weaver Hall 312

Instructor:
Prof. David
dsontag {@ | at} cs.nyu.edu
Grader:
Chen-Chien Wang

ccw352 {@ | at} nyu.edu


Office hours: Tuesday 5-6pm. Location: 715 Broadway, 12th floor, Room 1204

Grading: problem sets (50%) + midterm exam (25%) + project (20%) + participation (5%). Problem Set policy
Pre-requisites: Basic Algorithms (CS 310) is required, but can be taken concurrently. Students should be very comfortable with

Books: No textbook is required (readings will come from freely available online material). If an additional reference is desired,

Machine Learning: a Probabilistic Perspective, by Kevin Murphy (2012).


Pattern Recognition and Machine Learning, by Chris Bishop (2006).
Mailing list: To subscribe to the class list, follow instructions here.
Project information

Schedule

Note: the Bishop and Murphy readings are optional

Lecture Date Topic Required r Assignments


1 Sept 3 (Tues)
Overview [Slides]
Chapter 1 of Murphy's book

Bishop, Chapter 1 (optional)


2 Sept 5 (Th)
Introduction to learning [Slides]

Loss functiBarber 17.1-2 (stop before 17.2.1) on least-squares regression, 29.1.1-4 (review of vector algebra)

Notes on perceptron mistake bound (just section 1)


ps1 (data), due Sept 17 at 11am
3
Sept 10 (Tues)
Linear classifiers [Slides]

Introduction to Support vector machines


Notes on support vector machines (sections 1-4)

Bishop, Section 4.1.1 (pg. 181-182) and Chapter 7 (pg. 325-328)

Murphy, Section 14.5.2 (pg. 498-501)

4
Sept 12 (Th)
Support vector machines [Slides]
See above. Also:

Bishop, Sections 7.1.1 and 7.1.3

5
Sept 17 (Tues)
Support vector machines (continued) [Slides]

Derivation of SVM dual, introduction to kernels


Notes on SVM dual and kernel methods (sec. 3-8)

If you would like a second reference, see these notes (sections 5-8)

Bishop, Section 6.2, Section 7.1 (except for 7.1.4), and Appendix E

Murphy, Chapter 14 (except 14.4 and 14.7)


ps2, due Sept 24 at 11am
[Solutions]
6
Sept 19 (Th)
Kernel methods [Slides]
See above.
Optional: For more on SVMs, see Hastie, Sections 12.1-12.3 (pg. 435). For more on cross-validation see Hastie, Section 7.10 (pg

Optional: For more advanced kernel methods, see chapter 3 of this book (free online from NYU libraries)

7
Sept 24 (Tues)
Kernel methods & optimization

Mercer's theorem, convexity


Lecture notes

8
Sept 26 (Th)
Learning theory [Slides]

Generalization of finite hypothesis spaces


Lecture notes

These have only high-level overviews:


- Murphy, Section 6.5.4 (pg. 209)
- Bishop, Section 7.1.5 (pg. 344)
ps3 (data), due Oct. 8 at 11am
9
Oct 1 (Tues)
Learning theory (continued) [Slides]

VC-dimension
Notes on learning theory

10
Oct 3 (Th)
Learning theory (continued) [Slides]

Also margin-based generalization


Notes on gap-tolerant classifiers (section 7.1, pg. 29-31)

11
Oct 8 (Tues)
Nearest neighbor methods [Slides]

Hastie et al., Sections 13.3-13.5 (on nearest neighbor methods)

Bishop, Section 14.4 (pg. 663)

Murphy, Section 16.2


ps4, due Oct. 17 at 11am
[Solutions]
12
Oct 10 (Th)

No class on Oct 15 (Fall recess)


Decision trees [Slides]
Mitchell Ch. 3

Rudin's lecture notes


13
Oct 17 (Th)
Ensemble methods [Slides]

Random forHastie et al., Section 8.7 (bagging)

Optional: Hastie et al. Chapter 15 (on random forests)

Oct 22 (Tues)
Midterm exam

A Few Useful Things to Know About Machine Learning

14
Oct 24 (Th)
Clustering [Slides]

K-means
Hastie et al., Sections 14.3.6, 14.3.8, 14.3.9, 14.3.12

Murphy, Sections 25.1, 25.5-25.5.3

Bishop, Section 9.1 (pg. 424)


Project proposal, due Oct. 31 at 5pm by e-mail
15
Oct 29 (Tues)
Clustering (continued) [Slides]

Hierarchical clustering
See above.

16
Oct 31 (Th)
Clustering (continued) [Slides]
Spectral clustering
Hastie et al., Section 14.5.3

Optional: Tutorial on spectral clustering

Murphy, Section 25.4

17
Nov 5 (Tues)
Introduction to Bayesian methods [Slides]

ProbabilityMurphy, Sections 3-3.3

Bishop, Sections 2-2.3.4

18
Nov 7 (Th)
Naive Bayes [Slides]

Murphy, Sections 3.4, 3.5 (naive Bayes), 5.7 (decision theory)

Bishop, Section 1.5 (decision theory)

19
Nov 12 (Tues)
Logistic regression [Slides]

Notes on naive Bayes and logistic regression

Murphy, 8-8.3 (logistic reg.), 8.6 (generative vs. discriminative)

Bishop, 4.2-4.3.4 (logistic reg.)


ps5, due Nov 21 at 11am [Solutions]
20
Nov 14 (Th)
Mixture models, EM algorithm [Slides]

Notes on mixture models


Notes on Expectation Maximization

Murphy, 11-11.4.2.5, Section 11.4.7


Bishop, Sections 9.2, 9.3, 9.4

21
Nov 19 (Tues)
EM algorithm (continued) [Slides]

22
Nov 21 (Th)
Hidden Markov models [Slides]

Notes on HMMs
Tutorial on HMMs

Murphy, Chapter 17
Bishop, Sections 8.4.1, 13.1-2

23
Nov 26 (Tues)
Dimensional
Notes on PCA
More notes on PCA

Bishop, Sections 12.1 (PCA), 12.4.1 (ICA)

Optional: Barber, Chapter 15


ps6 (data), due Dec. 5 at 11am
24
Dec 3 (Tues)
No class on Nov 28 (Thanksgiving)
Bayesian networks [Slides]

Latent Dirichlet allocation


Review article on topic modeling
Introduction to Bayesian networks

25
Dec 5 (Th)
Collaborative filtering
[Slides]

Overview of matrix factorization

26
Dec 10 (Tues)
Applications in computational biology [Slides]
An introduction to graphical models

Dec 12 (Th)
Project presentations (group 1)

Dec 17 (Tues)
10-11:50am
Project presentations (everyone else)

During final exam slot. Note the special time! Same location.

Acknowledgements: Many thanks to the University of Washington, Carnegie Mellon University, UT Dallas, Stanford, UC Irvine,

Reference materials
Machine learning books
Trevor Hastie, Rob Tibshirani, and Jerry Friedman, Elements of Statistical Learning, Second Edition, Springer, 2009. (Can be dow
David Barber, Bayesian Reasoning and Machine Learning, Cambridge University Press, 2012. (Can be downloaded as PDF file.)
Probability
Chapter 2 of either Murphy or Bishop (see also Bishop Appendix B)
Review notes from Stanford's machine learning class
Sam Roweis's probability review
Linear algebra
Bishop Appendix C
Online class from MIT
Review notes from Stanford's machine learning class
Sam Roweis's linear algebra review
Calculus
Bishop Appendix D and E (Lagrange multipliers)
Notes from MIT on Lagrange multipliers
Dan Klein's Lagrange Multipliers without Permanent Scarring
Optimization
Convex Optimization by Stephen Boyd and Lieven Vandenberghe. (Can be downloaded as PDF file.)
sumer applications (e.g., Microsoft Kinect, Google Translate, Iphone's Siri, digital camera face detection, Netflix recommendations, Google

m Set policy
ould be very comfortable with basic mathematical skills in addition to good programming skills. Some knowledge of linear algebra and mu

dditional reference is desired, the following books are good options. Bishop's book is easier to read, whereas Murphy's book has more dep
ctor algebra)
on see Hastie, Section 7.10 (pg. 250).
UT Dallas, Stanford, UC Irvine, Princeton, and MIT for sharing material used in slides and homeworks.

n, Springer, 2009. (Can be downloaded as PDF file.)


n be downloaded as PDF file.)
tflix recommendations, Google news) and applications within the sciences and medicine (e.g., predicting protein-protein interactions, speci

edge of linear algebra and multivariable calculus will be helpful.

s Murphy's book has more depth and coverage (and is up to date).


ein-protein interactions, species modeling, detecting tumors, personalized medicine). In this undergraduate-level class, students will learn
-level class, students will learn about the theoretical foundations of machine learning and how to apply machine learning to solve new pro
hine learning to solve new problems.
https://studyabroad.shiksha.com/top-10-master-of-data-science-colleges-in-canada-articlepage-1883

List of universities offering Data Science colleges in Canada:

The University of British Columbia < 12 Month + < 15$

University of Toronto > 16 - 24 Month

Mc Gill University ranked

Queen University

University of Western Ontario

HEC Montreal

Simon Fraser University

Ryerson University

Carleton University

University of Waterloo'
https://skoolville.com/blog/canada-universities-with-masters-in-dat

1. The University of British Columbia – Master of Data Science

2. University of Waterloo -Master of Mathematics in Computer Scien

3. Carleton University – Master of Computer Science – Data Science

4. Ryerson University – Master of Science in Data Science and Analyti

5. Saint Mary’s University – Master of Science in Computing and Dat

6. Trent University – Master of Science in Big Data Analytics

7. Langara College – Post Degree Diploma in Data Analytics

8. Simon Fraser University – Professional Master’s program in Comp

9. University of Toronto – Master of Science in Applied Computing –

10. University of Alberta – Master of Science in computing Science –

11. Western University, Ontario – Master of Data Analytics

12. University of Calgary – Diploma in Data Science and Analytics

13. HEC Montreal – Master of Science – Business Analytics

14. McGill University – Masters in Data Science


15. University of New Brunswick – MS in Computer Science – Data A

16. Acadia University – MSc in Computer Science – Data Analytics

17. Bow Valley College – Postbaccalaureate certificate – Data Manag

18. Georgian College – Ontario College Graduate Certificate-Big Data

19. Lambton College – Ontario College Graduate Certificate- Big Data

20.  St.Clair College – Graduate Certificate-Data Analytics for Busines

21. Sheridan College – Data Science Sheridan Certificate


sities-with-masters-in-data-science/ https://www.mbacrystalball.com/masters-degree/data-science-anal

ster of Data Science

ematics in Computer Science – Data Science

er Science – Data Science

in Data Science and Analytics

nce in Computing and Data Analytics

Big Data Analytics

in Data Analytics

Master’s program in Computer Science – Big data Specialization

ce in Applied Computing – Data Science

ce in computing Science – Statistical Machine Learning

of Data Analytics

a Science and Analytics

usiness Analytics
omputer Science – Data Analytics

Science – Data Analytics

e certificate – Data Management and Analytics

aduate Certificate-Big Data Analytics

aduate Certificate- Big Data – Science & Informatics

-Data Analytics for Business

dan Certificate
-degree/data-science-analytics/canada-data-science-analytics
Applied Data Science and Big Data weCloud Data
https://weclouddata.com/courses/data-science-diploma
springboard
BrainStation
Which is a better deal: a Masters in Data Science or Data Science Bootcamp?
https://www.coursecompare.ca/best-data-analytics-certification/

brainstrom
York University School of Continuing studies
The G. Raymond Chang School of ryreson
University of Toronto School of Continuing Studies Continuing Education, Ryers
https://lanterninstitute.ca/

13 weeks
faster way ? To get job >> master
https://www.switchup.org/bootcamps/weclouddata

NYC Data Science Academy


12-Week Data Science Bootcamp

The University of Toronto School of Continuing Studies Boot Camps offer a 12-week, full-time

york
mcgill
waterloo
toronto
https://www.coursecompare.ca/subject/data-science-courses/
https://www.ryerson.ca/graduate/datascience/admission/faq/
cience Bootcamp?

ing Education, Ryerson University

er a 12-week, full-time
OSAP student finacial assistant
https://www.ryerson.ca/sfa/
https://www.ryerson.ca/sfa/govt_aid/osap/fulltime/student_group/
https://www.ontario.ca/page/how-apply-osap
Important Dates
https://www.ryerson.ca/sfa/govt_aid/osap/fulltime/important-dates/
National Student Loans Service Centre (NSLSC)
https://www.csnpe-nslsc.canada.ca/en/funding-options
https://www.ryerson.ca/sfa/govt_aid/osap/fulltime/
https://mscac.utoronto.ca/concentrations/data-science
https://web.cs.toronto.edu/graduate/admissions
OSAP and Tuition for University of Toronto (in 4 minutes) Youtube Channel
OsapLogoAid estimator
https://osap.gov.on.ca/AidEstimator1920Web/enterapp/enter.xhtml

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy