Yousef AI Follow-Up Sheet
Yousef AI Follow-Up Sheet
Hauweii
Nokia
Bell
AT&T
rogers
TELUS
Ericsson is world’s leading provider of communications technology and services. Our offerings include services, consulting,
Using innovation to empower people, business and society, Ericsson is working towards the Networked Society: a world co
We are truly a global company, operating across borders in over 180 countries, offering a diverse, performance-driven cult
Exciting Opportunity:
It will be practically impossible for human brains to understand how to run and optimize next generation of wireless netwo
Machine Intelligence, the combination of Machine Learning and other Artificial Intelligence technologies is what Ericsson u
Ericsson is now looking for Principal Data Scientists to significantly expand its global team for AI acceleration for our group
Do you want to apply and extend those skills to solve real complex problems with high societal impact; going beyond ML/A
Then, you do want to join Ericsson’s global team of Engineers/Scientists pushing the technology frontiers to automate, sim
Role Summary:
As a Principal Data Scientist, you shall build and deploy AI models into production with focus on scaling, monitoring and pe
Your knowledge and experience in Data Science methodologies will be applied to solve challenging real-world problems as
Key Responsibilities:
Bachelors/Masters/Ph.D. in Computer Science, Data Science, Artificial Intelligence, Machine Learning, Electrical Engineerin
Applied experience: 8+ years of ML and/or AI production level experience; and an overall industry experience of about 15+
Proven skills of implementing a variety of Machine Learning techniques
Strong Programming skills (R/Python) with proficiency in at least one
Strong grounding in mathematics, probability, statistics needed for data analysis and experiments
Proven ability of leading AI/ML projects end-to-end with complete ownership
Proven skills in building AI/ML based solutions using a variety of frameworks such as Python, R, H2O, Keras, TensorFlow, Sp
Experience in implementing new algorithms and methodologies from leading open source initiatives and research papers
Extensive experience in model development and life-cycle-management in one or more industry/application domain
Experience in building models using semi-structured and unstructured data
Hands-on experience in designing and building AI models using Deep Neural Networks for applicable scenarios
Experience in using ensembles and stacking techniques to solve complex ML problems
Able to build and deploy AI models into production with focus on scaling, monitoring and performance
Knowledge of building explainable models (XAI) and prescriptive analytics
Experience with working in Big Data technologies such as Hadoop, Cassandra etc.
Able to Define/Design data storage and retrieval strategies from various kind of data sources such as NOSQL DBs
Knowledge of designing data pipelines and flow strategies
Familiarity with data pipelining frameworks such as Air Flow, AWS Sagemaker, etc. would be a plus
Able to design APIs for AI/ML models with focus on business, modularity and versioning
Experience in writing and presenting white papers, journal articles and technical blogs on the results
Soft Skills:
As a Senior Data Scientist, you will need to have strong programming skills and deep understanding of data science and M
Key Responsibilities:
Lead functional and technical analysis within Ericsson businesses and for strategic customers to understand MI-driven bus
Define the model validation strategy and business success criteria in data science terms
Identify the right architecture and flow for the data and DS model
Design the implementation and deployment strategy for the model into production
Contribute to rapid and iterative development of validated minimum viable solutions addressing these needs. This include
Lead studies and creative usage of new and/or existing data sources. Work with Data Architects to leverage existing data m
Collaborate with product development teams and partners in Ericsson Businesses to industrialize machine learning models
Work with unstructured data including text and images in AI/ML models
Work with new technologies and be the ambassador for them in MI Communities within Ericsson, nurturing the communiti
Provide MI Competence build-up in Ericsson Businesses and Customer Serving Units
Develop new and apply/extend existing, concepts, methodologies, techniques for cross functional initiatives
Engage with external ecosystem (academia, technology leaders, open source etc.) to develop the skills and technology por
Present and be prominent in MI related forums and conferences, e.g., publishing patents, presenting papers, organizing se
Key Qualifications:
Bachelors/Masters/Ph.D. in Computer Science, Data Science, Artificial Intelligence, Machine Learning, Electrical Engineerin
Applied experience: 5+ years of ML and/or AI production level experience; and an overall industry experience of around 10
Proven skills of implementing a variety of Machine Learning techniques
Experience in Security, Internet of Things is a plus
Strong skills in the use of current machine learning frameworks such as H2O, Keras, TensorFlow, Spark ML etc.
Demonstrated ability to implement new algorithms and methodologies from leading open source initiatives and research p
Experience with Big Data technologies such as Hadoop, Cassandra etc.
Good with effective big data storage and retrieval strategies including indexing, partitioning, etc.
Hands on working with data pipeline and flow
Hands on with API design/development for AI/ML models
Strong grounding in math and statistics.
Proven ability of leading projects end-to-end.
Proven experience writing production-grade software
Extensive experience in model development and AI model life-cycle-management in one or more industry/application dom
Strong Programming skills in various languages (C++, Scala, Java, R) with proficiency in Python and/or C++
Good communication skills in written and spoken English
Creativity and ability to formulate problems and solve them independently
Ability to build and nurture internal and external communities
Experience in writing and presenting white papers, journal articles and technical blogs on the results
Additional Requirements:
Data Scientist
Santa Clara, California
Research & Development
In this role you will:
Create and maintain optimal data and model dataOps pipeline architecture
Assemble large, complex data sets that meet functional / non-functional business requirements.
Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, r
Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data so
Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issue
Keep data separated and secure across national boundaries through multiple data centers and strategic customers/partne
Create tool-chains for analytics and data scientist team members that assist them in building and optimizing our product in
Work with data and machine learning experts to strive for greater functionality in our data and model life cycle manageme
Support dataOps competence build-up in Ericsson Businesses and Customer Serving Units
BS, MS or PhD degree in Computer Science, Informatics, Information Systems or another related field.
3-4 years’ experience using the following software/tools: Hadoop, Spark, Kafka, etc.
Experience with relational SQL and NoSQL databases, including Postgres and Cassandra.
Experience with Data and Model pipeline and workflow management tools: Azkaban, Luigi, Airflow, Dataiku, etc.
Experience with stream-processing systems: Storm, Spark-Streaming, etc.
Experience with object-oriented/object function scripting languages: Python, Java, C++, Scala, etc.
You have advanced SQL knowledge and experience working with relational databases, query authoring (SQL) as well as wo
You have experience building and optimizing ‘big data’ data pipelines, architectures and data sets.
Experience performing root cause analysis on internal and external data and processes to answer specific business questio
You have strong analytic skills related to working with unstructured datasets.
You have built processes supporting data transformation, data structures, metadata, dependency and workload managem
Working knowledge of message queuing, stream processing, and highly scalable ‘big data’ data stores.
Strong project management and interpersonal skills.
Experience supporting and working with cross-functional teams in a dynamic environment
Data Scientist
Lisbon, Portugal
Network Operation and Integration
Ericsson is one of the leading ICT providers, with about 40% of the world’s mobile traffic carried through our networks. We
Key Qualifications:
Student or Recent Graduate in Technical Telecom, Electronics or Computer Science Engineering major
Previous experience in the field (professional or university project)
Good knowledge of statistical analysis, theory of probabilities, design of experiments and machine learning
Understanding of data preparation, data mining and pre-processing
Good knowledge of SQL, PL/SQL, SQL Server or SPSS
Good command of few programming language and software environment for statistical analysis, graphics representation a
Knowledge of Business Objects, Tableau, Cognos
Knowledge of Hadoop, Spark, Data Lake would be appreciated
Key Responsabilities:
Gather and process data at scale; scripts, files, APIs, database queries, etc.
Work closely with analytics, and engineering team members
Interpret data, analyze results using statistical techniques and provide ongoing reports
Acquire data from primary or secondary data sources and maintain databases/data systems
Identify, analyze, and interpret trends or patterns in complex data sets
What we offer:
Job Description
Date: Apr 17, 2020
Responsibilities:
Create Scalable Machine Learning systems that are highly performant.
Identify patterns in data streams and generating actionable insights.
Customize Machine Learning algorithms in image recognition and computer vision.
Participate related Proof-of-Concept with customer to apply machine learning algorithms in different use cases.
Investigate new machine learning technologies & identify feasible ones for specific cases.
Collaborate with others both internally and externally in machine learning area.
Prepare and provide machine learning trainings to other colleagues.
Key Qualifications:
An interest in exploring how Machine Learning can be leveraged for the use cases in telecom domain
4+ years of software development experience post graduate school
2+ years of experience working in applying Machine Learning to solve complex problems
MS in Computer Science, Mathematics, Physics, Artificial Intelligence, or related
Additional Requirements:
Data Scientist
Stockholm, Sweden
Performance and Transformation
Job Description
Date: Apr 14, 2020
Are you interested in leading our company through the future challenges of the 5G industry? Do you have a passion for de
We are currently looking for Data Scientists in Stockholm, Sweden, to join our Digital Center.
Activities of a Data Scientist includes defining, processing, and analyzing data to identify actionable solutions, and applying
University degree / MBA in Computer Science, Machine Learning, Computer Engineering, Mathematics, Physics, or related
Significant experience from applying data science methodologies to solve challenging real-world business problems (predi
Strong foundation in mathematics and statistics
Fluency in at least one scripting language (e.g. Python, R)
Strong analytical skills and ability to acquire new knowledge and apply it in the job
Acumen for business flow understanding and expertise in data preparation, data mining, and pre-processing
Strong communication, presentation, and collaboration skills
Application Process:
The selection and interview process is ongoing. Therefore, please send in your application in English as soon as possible. Fo
We have a programming test as a qualifier and part of screening process for this position. Please note that you may be req
Software Developer
Gurgaon, India
Product Development
Job Summary:
Automation System Tester function is to ensure the execution of system test activities for the developed products. This fun
Responsibilities:
•Assigned test activities are realized within approved cost, time and quality
•Keep description of the test environment (both HW & SW) up-to-date. Track all software and hardware licenses and inven
•Establish integration & verification scope, design detail test cases according to SUT (system under test) implementation fe
•Execute functional / non-functional test cases according to detail test cases’ description, elementary bugs’ analysis and b
Key Qualifications:
Competence / Skills
Technical Competence
1
Shell Scripting
Important
Core Java
Important
UNIX/LINUX
Important
Database concepts
Important
Important
Virtualization
Important
Important
Important
10
Important
11
Important
12
Important
Test Competence
Important
2
Good knowledge of Test tools and Test environment
Important
Important
Important
Important
Important
Domain Competence
Telecom
Important
2
Domain -Mediation, Billing
Good to have
Others
Good to have
Must Have
Must have
Individual Capacities
(General abilities)
1
Communication Ability
Important
Important
Important
Result Orientation
Important
Customer Orientation
Important
6
Enthusiasm/ Drive
Important
Important
Discipline/Punctuality
Important
Adaptability
Important
10
Professionalism
Important
Job Description
Date: Apr 30, 2020
Have you ever heard of Mobile positioning? We are developing the Ericsson Mobile Positioning System that is being used b
From early studies and proofs-of-concept to deployment, we are responsible for the complete life cycle of the product. Ind
If you are eager to learn, have a can-do personality, would like to work in a project that has a truly global footprint, join us
What we offer:
Data Scientist
Company NameHuawei Company Location Kuala Lumpur, Malaysia
Huawei´s Southern Pacific Regional Office Big Data Team is looking for a self-driven Data Scientist to join our team. Ideally,
Job Responsibilities:
Exploratory research to understand user behavior, selecting features, building and optimizing classifiers and building mach
Address the most important analytical questions with a view on driving product impact, and build products metrics.
Data mining using state-of-the-art methods
Extending company’s data with third party sources of information when needed
Enhancing data collection procedures to include information that is relevant for building analytic systems
Processing, cleansing, and verifying the integrity of data used for analysis
Doing ad-hoc analysis and presenting results in a clear manner
Desired skills and background:
Education: Master’s Degree in applied statistics, data mining, machine learning, physics or a related quantitative discipline
Working experience:>5years delivering world-class data science outcomes, you solve complex analytical problems using
Customer orientation with excellent understanding of operator’s business/technical requirements, you have a keen desire
Achievement orientation, energetic, strong influence skills, self-initiative, teamwork spirit, persistency, logical thinking abil
Great communication skills
Excellent understanding of machine learning techniques and algorithms, such as Deep Learning, Naive Bayes, SVM, Decisio
Experience with relational databases as you are with Hadoop-based data mining frameworks. You are familiar with SQL, Py
Experience in the use of statistical analysis environments such as R, NumPy/Pandas, SPSS or SAS.
Good applied statistics skills, such as distributions, statistical testing, regression, etc.
Good scripting and programming skills
Data-oriented personality
How to apply:
Leave your application through LinkedIn. Please remember to attach your CV to your application to yong.sze.miin3@huaw
Are you enthusiastic about applying big data to resolve real world problems?
Does it sound exciting to you to improve billions of people’s daily life by creating great mobile devices? If these describe yo
Apply CNN, DNN, RNN/LSTM and other latest deep learning technologies to high-impact applications, such as anomaly det
JOB SPECIFICATIONS
Education/Knowledge:
Major/Discipline: Computer Science/ Math/ Statistics or related
Minimum Requirements:
Knowledge of data analysis techniques (probability, statistics, machine learning, etc) and experience with applications.
Hands-on experience with big data analytics tools, such as Python, R, SQL, Hadoop, Spark, Java, Tableau, Vertica, Pig,etc.
Strong analytical skills and detail oriented. Excellent analytical thought process – ability to understand the question, and d
Preferred:
Versed in the process of applying effective algorithms to large scale data modeling.
Huawei is a leading global information and communications technology (ICT) solutions provider. Driven by a commitment
The size of our cloud platform is gaining momentum and it is already planet scale. Huawei Cloud is one of the largest and f
Huawei's Munich Research Center is responsible for advanced technology research, architectural development, design and
We are seeking for a highly motivated Data Scientist (m/f/d) to join the Intelligent Cloud Operations team in Huawei Muni
Responsibilities
Propose new innovative approaches to operate planet-scale cloud platforms using AI (e.g., AIOps).
Rapid prototyping of innovative features using Big Data platforms (e.g., Spark, Kafka, HDFS).
Explore new approaches to develop cloud-native distributed systems (e.g., Netflix’s Hystrix).
Implement emerging observability paradigms (e.g., Google Dapper, OpenTracing, and OpenCensus).
Integrate modern AI algorithms (e.g., Deep Learning and Facebook PyTorch) into production systems.
Requirements
Excellent PhD in Computer Science, or related field.
First experiance as a data scientist, data engineer, computational biologist, or bioinformatician.
Experience with statistical software (e.g., Pandas, Scikit-learn).
Expertise with data analysis such as forecasting, multivariate analysis, stochastic models.
Experience with appling machine learning on large-scale datasets.
Demonstrated ability to solve challenging engineering problems is required.
Fluent written and spoken English.
What you can expect
Meaningful work: Our products and solutions connect people in over 170 countries, serving more than one third of the wo
Enormous investments in research, development and innovation: Huawei invests over 10% of its revenues in research and
Team spirit: At Huawei, we are proud of a strong social integration. Doors are open, and people collaborate with each othe
International work environment: Our business language is English and our team comprises unique experts from around 50
Robust growth across all business segments, thanks to balanced global presence and strategic focus.
If you are enthusiastic to shape the German Research Center in Munich together with us, being part of a multicultural team
Responsibilities
Use deep learning and machine learning to create scalable solutions for business problems.
Develop new tools using cutting edge technology focusing on efficiency and automation.
Work closely with the AIOps team to jointly develop innovative tools driven by AI.
Build ML systems in production settings.
Collaborate with colleagues from science, engineering, and business backgrounds.
Requirements
Experience with statistical software (e.g., Pandas, R) and programming languages (e.g., Python, Java).
Hands-on expertise with ML libraries (e.g., Scikit-Learn, TensorFlow, Keras).
Experience with applying machine learning on large-scale datasets.
Experience with fast prototyping.
Demonstrated ability to solve challenging engineering problems is required.
Fluent written and spoken English.
PhD in Computer Science or related field
What you can expect
Meaningful work: Our products and solutions connect people in over 170 countries, serving more than one third of the wo
Enormous investments in research, development and innovation: Huawei invests over 10% of its revenues in research and
Team spirit: At Huawei, we are proud of a strong social integration. Doors are open, and people collaborate with each othe
International work environment: Our business language is English and our team comprises unique experts from around 50
Robust growth across all business segments, thanks to balanced global presence and strategic focus.
Data Reply
What sets us apart is our imagination and the ability to inspire our customers for pioneering technologies and to introduce
We are always on the lookout for enthusiasts who question the existing, try out new ideas and want to achieve exciting go
TASKS:
As part of our team, you support our customers in the successful development and implementation of complex solutions f
With the help of powerful open source and cloud technologies, you offer our customers scalable, cost-effective and flexibl
You are responsible for the implementation, improvement and evaluation of ML applications and their implementation
You communicate regularly with our customers and stakeholders in an efficient and professional manner
QUALIFICATIONS:
Location:
Berlin, Copenhagen, Frankfurt, Helsinki, London, Munich, Oslo, Paris, Sao Paulo, Stockholm, Warsaw
Geography:
Central & South America, Europe & The Middle East
Capabilities:
Big data & advanced analytics, innovation & product development, technology & digital
Industries:
Automotive & Mobility, Biopharmaceuticals, Consumer products, Education, Energy & environment, Engineered products
About Us
Boston Consulting Group partners with leaders in business and society to tackle their most important challenges and captu
To succeed, organizations must blend digital and human capabilities. Our diverse, global teams bring deep industry and fun
BCG GAMMA combines innovative skills in computer science, artificial intelligence, statistics, and machine learning with de
Role profiles
The Gamma Engineering team is building the next generation of analytics tools. Clients need to easily interact with our ana
POSITION SUMMARY:
BCG Gamma is seeking a Machine Learning Engineer to join our engineering team. The ideal candidate will have industry e
As a strong software expert in building complex systems, you will be responsible for inventing how we use technology, ma
RESPONSIBILITIES:
We are looking for a Machine Learning Engineer who can bring bleeding edge machine learning models into production to
Your qualifications
REQUIREMENTS:
• Knowledge of or experience in building production quality and large scale deployment of applications related to natural
TECHNOLOGIES:
• Strong programming skills in at least one object oriented programming language (Java, Scala, C ++, Python, etc.)
• Strong skills in parallel processing technologies and languages: Hadoop, Spark, Scala etc.
• Experience with Python applied to machine learning (Pandas, Scikit-learn, Scipy, Numpy etc.)
• Strong knowledge of machine learning techniques required (KNN, random forest, Bayesian statistics etc.)
• Strong knowledge of machine learning techniques preferred (TensorFlow, Keras, PyTorch, Caffe, MxNet)
WORK ENVIRONMENT:
• Position is located in Gamma European hubs (Paris, London, Germany, Nordics etc.)
Date Posted:
4-Feb-19
We are looking for an expert who can lead us to success in the big data and database field. The position is a senior technic
If you have extensive cutting-edge technology research and system design experience, and extensive practical application
The mission of the Irish Research Center is to position Huawei as a recognized technology leader in global information and
To become an innovator and leader in cloud data centers, Huawei's IT product line focuses on IT infrastructure and promo
JOB RESPONSIBILITIES:
As the lead of the Data Lake Platform Development team, you will help to create the next generation Big Data Analytics pla
Lead in analyzing the software requirements and software elements for Big Data Platform design.
Self-motivated AI engineer who takes ownership of the design and implementation of SW components
Take charge of the design and code writing for a specific cross-sub-system or codes of key algorithms.
Work with cross-functional teams to integrate AI-based solutions into production SW Stack
Text and Language processing, classification, summarization, topic extraction, and de-identification using state-of-the-art m
Research, design new ideas and develop them in real world
Lead in development of Big Data Platform incorporation with existing services
Work closely with the other teams to ensure architectural integrity.
Participate in different open source and standard meetings to present solutions
Travel as work needs, including visiting our HQ in China 2-4 times per year.
Reporting to the Chief Big Data Architect for the IRC Big Data Team
JOB QUALIFICATIONS:
Professional Knowledge:
PhD or Masters in Computer Science, Computer/Electrical Engineering. Bachelors degree with related industrial experienc
10+ years of work experience in related fields for principle level
Professional Skills:
Strong programming skills with Python, C, C++, Java
Hands-on experience in one of the AI frameworks such as TensorFlow, Caffe, etc
Demonstrated strong background/experience in Natural Language Processing
Strong background in deep learning models such as those with RNNs, LSTMs, and encoder-decoder.
Experience in deploying AI models in real industrial platform/hardware is required
Experience developing and using virtualization, container-based and cloud platforms such as Kubernetes, Open stack, Swa
Location
This is a full-time position at our Ireland Research Center based at Townsend Street in Dublin 2, Ireland.
Huawei Ireland
Senior Data Scientist
CT Operations Labs mission is to be recognized leaders of world class intelligent and autonomous operations by driving tec
As a Senior Data Scientist, your knowledge and experience in Data Science methodologies will be applied to solve challeng
Benefits
This is a permanent position at our R&D center based in Dublin, Ireland. Business trips may be requested as necessary to s
Job Title: The Dock Senior Data Scientist (ML & DL)
The Dock Dublin 2
The Dock is a diverse team of creative problem-solvers within Accenture where design, business and technology meet und
We believe future commercial success will come from businesses that are conscious of the intended and unintended conse
We have an opportunity for a Senior Data Scientist in our Analytics & Artificial Intelligence (A&AI) Team. Operating across
We are looking for an experienced hands-on Data Scientist with proven Machine Learning and Deep Learning solution dev
The Analytics & AI team within the Dock applies leading Analytics & Artificial Intelligence (AI) technologies and techniques
• Take a leading role in teams delivering innovative Analytics & AI projects; ranging from research, proofs-of-concept to th
• Be a deep hands-on expert practitioner, owning and shaping development and experimentation in the application of Ma
• Use AWS, Azure and/or GCP tools and services to develop prototype and scalable Analytics & AI solutions, demonstrating
• Work closely with software engineering colleagues to develop robust Analytics & AI applications encapsulated in softwar
• Drive the utilization of best practice in Analytics & AI technical delivery, methods and approaches on project teams.
• Actively contribute to bringing Analytics & AI perspectives to experimentation, design, workshops, sprints and prototypin
• Embrace working in multi-disciplinary teams and collaborate with designers, software developers, business experts and A
• Lead and mentor junior team members in projects and in their technical and professional development.
• Understand the key technology trends in the Analytics & AI domain, the business implications, and be able to match the
• Bring your passion for improving the world through technology innovation to our team and demonstrate the ability to th
• Applying your deep technical expertise to solve problems and develop new solution offerings for our business and our cl
• Bringing deep expertise in developing data-driven solutions in Machine Learning, Deep Learning and Advanced Analytics
• Applying your expertise and hands-on coding experience with AWS, Azure and/or GCP tools and services to develop Ana
• Keeping up to date and learning the capabilities of leading AI platform solution providers and the AI ecosystem; including
• Applying data science, data engineering and solution architecture knowledge to design and develop robust end-to-end A
• Demonstrating deep understanding of the potential utility, limitations and challenges of a wide range of Advanced Analy
• Being responsible for shaping, owning and delivering complex Advanced Analytics & AI applications and experiments and
• Being responsible for estimating, planning and managing your own work and the work of analytics teams.
• Leading teams and nurturing new skills and capabilities in these teams. Contributing to the performance management an
• Interacting, collaborating and sharing experiences with colleagues from different disciplines and backgrounds.
• Presenting to, interacting with and managing client and business stakeholders, clearly articulating and communicating An
• Producing quality technical components, documentation, write-ups, papers or articles that capture the value of the work
• Contributing to the growth and development of the Analytics & AI capability and community in the Dock, outside immed
• Contributing to steering and shaping the development of the Innovation pipeline in Analytics & AI at the Dock, both in te
Requirements
• Masters Degree or higher in Computer Science, Mathematics, Engineering, Artificial Intelligence or a closely related disci
• Deep hands-on experience in a number of Advanced Analytics, Machine Learning, Deep Learning, NLP, Knowledge Graph
• 5 years experience in designing and implementing Advanced Analytics systems for R&D or commercial applications. Indu
• 5 years of project or product development methodology experience, for example agile development and CRISP-DM.
• Proven proficiency in several Analytics & AI related tools, programming languages & frameworks.
• Proven experience in designing and developing AI based systems and applications architected for scale.
• Experience in standards, methods and best practice for Advanced Analytics development, solution quality, accuracy and
• Hands-on experience and expertise in a number of statistical and application programming languages (e.g. Python, R, Jav
• Proven technical experience in solution development with AWS, Azure and/or GCP preferred.
• Experience in leading technical teams and in managing and mentoring team members.
• Experience in dealing with multiple business and technical stakeholders.
Accenture is a global management consulting, technology services and outsourcing company, with more than 490,000 peo
Accenture is an equal opportunities employer and welcomes applications from all sections of society and does not discrim
Sage acquired AutoEntry in September 2019 - demonstrating our commitment to innovation and adding value to Sage Bus
#LI-JL
Key Responsibilities Key accountabilities and decision ownership:
Must have:
• Strong theoretical foundations in linear algebra, probability theory, optimization.
• Strong programming skills in Python.
• Experience in working with numpy, scipy, scikit-learn, pandas.
• Experience shipping production machine learning models.
• Experience communicating projects to both technical and non-technical audiences.
• Experience reporting machine learning accuracy in industry.
• You are familiar with (in no particular order): logistic regression, gradient descent, regularization, cross-validation, overfi
Desirable:
• PhD in Computer Science, Electrical Engineering, Statistics, Physics, or similar quantitative fields.
• Publications in top conferences.
• Experience writing complex SQL queries.
• You have deep experience with these things: logistic regression, gradient descent, regularization, cross-validation, overfi
Huawei Ireland
3.5
Data Scientist - Knowledge Graph & ML Researcher (Contract)
Dublin
Huawei Ireland Research Centre is starting a new research team that will focus on research and developing AI algorithms f
The Ireland-based research team will work closely with the Huawei Consumer Business Group (CBG) in the Headquarters, C
Key Responsibilities
Responsible to build the NLP related model capability to solve the service requirement and issue.
Responsible for conducting scientific comparative analysis of operation data and find the proper AI techniques to apply to
Responsible for encapsulating the AI modules into workable research proof-of-concepts.
Responsible for handling huge amount of data and hence, buildings scalable machine learning models.
Responsible for technology transfer of the developed AI models and working closely with Huawei business units.
Skills & Qualifications
PhD in a relevant field is preferred (ML, Knowledge Graph, Computer Science…) and 2-5 years of experience applying adva
Experience with Natural Language Processing (ML, Knowledge graph) techniques is must.
Experience with Deep Learning (NNs, Recurrent NNs, Convolutional NNs, Encoder-Decoder with Attentions, Bayesian Deep
Experience working with Deep Learning Programming frameworks such as Keras, PyTorch, TensorFlow
Proficient in Supervised Learning, Un-Supervised Learning, and Semi-Supervised Learning algorithms.
Experience with Model-based Reinforcement Learning, MDPs and optimization is a good plus.
Publication track records in machine learning and computer science conferences and journals, including, but not limited to
Experience in Cloud or Data Centres optimizations using AI is a great plus.
Recruitment Privacy Notice: http://career.huawei.com/reccampportal/portal/hrd/...
Woebot
4.1
Senior Machine Learning Engineer
Dublin
As a Machine Learning Engineer, you will work closely with our Data Science, Product and Engineering teams to develop &
Ramping Up
In your first 2 weeks, you'll learn about the Woebot content architecture and how ML and NLP are used to guide conversa
In your first 3 weeks, you'll list improvements that could be made to our existing set of classifiers.
Own Our Machine Learning Models, Systems & Processes
During your first 45 days you will develop infrastructure for the full cycle of our machine learning efforts, this includes, mo
To accomplish this you'll collaborate with engineers to integrate algorithms efficiently with backend production services w
You will also build machine learning models that enable Woebot to more naturally understand users' natural language inp
Improve Woebot's Sentiment Analysis.
Work with our Product team to define solutions and integrate them into a 1-year roadmap.
Help scale our services using GPUs and modern distributed processing tools in the cloud (AWS).
Dig into data with ad hoc analysis as necessary for technical, clinical, and user needs.
Help Woebot Have More Natural Flowing Conversations
Within your first 60 days you will be responsible for gathering data and building data labeling systems so that Woebot con
You will have unique datasets to work with, such as: 1M+ user conversations, support tickets, and other natural/unstructu
Turn Millions of Data Points Into Valuable Insights
By day 90 you will create user profiles and build models that define how Woebot interacts with each user. We consider pe
You will improve our machine learning models that enable Woebot to more naturally converse with and understand users
After a few months you're conducting deeper analysis to improve models that enable Woebot to derive insights about ind
To accomplish this you'll work closely with our Product team to deliver these insights at the right time and in the right man
This Might Be Your Next Career Move IF:
You care about helping make quality mental health care realistically accessible to millions of people nationwide.
You passionately follow the latest trends in NLP and are excited by the challenge of applying the latest research to real-wo
You want to get closer to the data and realize that advances in algorithms often come second to high-quality data.
You love tackling big meaningful issues with data, even though they are often hard to measure.
Core Competencies
You've deployed natural language processing and/or deep learning models using spaCy and/or NLTK
Experience deploying systems into production, at scale, using Docker, Kubernetes, or ECS
Knowledge of one or more modern ML/NLP frameworks, such as PyTorch or Tensorflow
Bachelors, Masters or PhD in Computer Science, Mathematics, Data Analysis or Machine Learning or related technical field
Ability to productize the latest research and establish a clear vision for how ML/NLP can be used in mental health
Agile go-to-market product mindset: we do research and we innovate, but we also ship often
Strong written and verbal communication skills
Our Core Values
Empathic: Place a high value on user-experience. Motivated to help others be successful.
Proactive & flexible: Hit the ground running. Even with ambiguity, you can get the job done.
Self-awareness and growth-mindset: Wants to learn and grow in the role.
High standards: Take pride in your work and apply high standards toward everything.
Strong work ethic: Work hard to get the job done.
Benefits
Competitive Salary
Health, Dental & Vision
Employee Volunteer Program
Bank of Ireland
3.1
Senior Data Scientist
Dublin
Division Description
Led by the recently appointed Chief Marketing Officer, we are creating Group Marketing function which will develop a stro
The Group Marketing function will be underpinned by the following strategic imperatives designed to transform the Bank
The Group Customer Analytics team sits within the Group Marketing function and supports our customers, colleagues and
In a changing market environment, where customer expectations continue to grow and there is an increasing demand for
The purpose of this role is to play a lead technical role in a team responsible for the development of analytical solutions to
Key Accountabilities
Deliver advanced data science projects that transform complex problems into compelling marketing and customer insights
Using the vast amounts of data available develop analytical models that support our customers and deliver on commercial
Lead the development of advanced analytical projects, such as propensity modelling, next best action modelling and text a
Act as technical expert on data mining, machine learning, statistical analysis, and modelling.
Create processes to measure the value of modelling to the organisation with clear targets, KPIs and measurement in place
Identify and integrate new internal and external data sources including internal structured data, semi-structured web data
Use data visualization tools to explain your results simply and succinctly to senior audiences.
Provide technical leadership to team of high-skilled data scientists, while mentoring and developing junior members of the
This is an exciting opportunity to take a senior positionon atalented data science team to solve complex problemsfocused
Essential Qualifications
Third Level qualification, in Maths, Computer Science, Statistics, Economics, Machine Learning, Analytics or an equivalent q
Masters or PhD in Maths, Statistics, Computer Science, Economics, Engineering or Machine Learning.
Web-scraping skills are an advantage
Knowledge of other tools like Cloudera Data Science Workbench is a plus
Experience with Big Data technologies or with Digital/Web analytics a distinct advantage.
Knowledge of Cloudera Data Science Workbench, and associated analytical tools (i.e. PySpark)
People management and coaching experience.
Key Competencies
PDU Radio Products is a part of Development Unit Networks (DNEW) and has an overall responsibility for development of
Radio SW is one of the sectors in PDU Radio, where Kista (Stockholm) is our base but the world is our arena. Radio SW pro
We are now looking for a talented experienced developer to work with one of our teams.
Position Summary:
As a Developer at Radio SW you will be responsible for build and delivery of our Radio SW products following Ericsson com
Position Qualifications:
Core Competences:
Very good experience of SW component build and deployment tools, such as GNU AutoTools and The Yocto Project includ
Experience in automation of build and delivery environments using Jenkins or similar scheduling utility for implementing a
Experience in Product Life Cycle Management and about the Software Development Life cycle including version control to
Experience in using Unicies, with ability to program or script if needed in BASH, tsch, Python, Perl, Ruby, Java, C++.
Experience of Modular Software, Software resue and SW development in general, ability to compile and link libraries, unde
Knowledge of Agile and Lean methodologies such as Scrum and Kanban.
Good knowledge about Radio system
Radio SW/HW product development knowledge
Knowledge about the DURA functional framework including the CI-machinery
You have a minimum Bachelor’s degree in Computer Science or Electrical Engineering or equivalent and:
a minimum of 1 years of working experience from automation of large build and delivery systems
a minimum of 3 years of working with SW development
a minimum of 1 years of working experience from working with Yocto or similar build and deployment tools.
good English skills in spoken word and writing
Applications:
For any enquires and for application details, please contact the responsible Recruiter, Suma Haregoppa Venkatagiri at sum
Please note that, you need to submit the application in English, and we do not accept any applications via email. The last d
Job Description
Date: Apr 22, 2020
At Ericsson, you can be a game changer! Because working here isn’t just a deal. It’s a big deal. This means that you get to le
You are responsible for ensuring that the delivered software components provide vital functionality and perform in accord
You Will,
Develop & Maintain different projects in the Telco sector.
Improve your technical knowledge every single day and share your knowledge with your team
Work with the Quality Assurance Team to make things smooth and workable to offer highly qualified products and service
You are responsible for ensuring that the delivered software components provide vital functionality and perform in accord
You Will,
Develop & Maintain Java Backend services and & frontends using ReactJS / Angular
Your developed full-stack applications should ensure outstanding quality, performance, security and documentation stand
You will be located in scrum teams each dedicated to a different product line
Improve your technical knowledge every single day and share your knowledge with your team
Work with the Quality Assurance Team to make things smooth and workable to offer highly qualified products and service
To Be Successful In The Role You Are
Min. 2 years of experience in developing Java-based full-stack systems
Experience with Core Java, Spring, Spring Boot, Hibernate
Good level of understanding of Java technologies
Good software engineering academic background
Experience with React JS and/or Angular JS
Experience with HTML, CSS, JavaScript
Good level of relational database technologies especially Oracle
Familiarity with dev-ops Technologies such as Jenkins, Docker
Solution-oriented can take the initiative, responsible,
BS or MS in Computer Science or related field preferred
No military obligations for male candidates, or postponed for 1 year
Proficiency speaking & writing in Turkish
Job Description
Date: Apr 20, 2020
Software Developer
We are looking for a skilled software developer to help us build the applications in Ericsson’s 5G Core network, connecting
In addition to having solid software development skills, we also hope you would be interested in taking on the Scrum Mast
Job Description
Date: Apr 20, 2020
At Ericsson, you can be a game changer! Because working here isn’t just a deal. It’s a big deal. This means that you get to le
You Will,
Design, implement and unit test features that meet the specifications and requirements
Interact with product management, system and technology team, software/hardware development teams, and stakeholde
Perform troubleshooting and support customer needs
Address complex technical challenges that warrant innovative and future proof solutions
Drive continuous improvements of products and processes
To Be Successful In The Role You Are
Deep experience in C and C++
Knowledge of scripting languages (PERL, bash, Python)
Solid understanding of Linux OS
Networking and IP protocol experience, knowledge of system software
Have good basic knowledge of Lean and Agile principles
Ability to work in agile team (Scrum, Kanban)
5+ years’ experience in similar role
BSc or MSc degree in Electronics Engineering / Computer Science
As the tech firm that enabled the mobile internet connectivity around the world, at Ericsson we’ve made it our business to
We are on a quest, we´ve promised to never stand still; relentlessly innovating to make technology easy to adopt, easy to
With us you’ll be part of the next step developing our products. We enable millions of simultaneous connected mobile use
For this you’ll learn to master groundbreaking technology in IP networks, distributed real time embedded systems executin
The Ericsson office is located at Lindholmen in central Gothenburg, with beautiful scenery and modern facilities. We’re nex
You’ll be enjoying all the benefits of a good collective agreement, as well as a personal health account, gyms in the house,
The position
We are currently looking for software developers that have knowledge in cloud technologies and tools, container/microse
We stand at the forefront of Agile software development, using methods and principles like Lean and Scrum. Early custom
The team will design, implement and test the feature, product and system from requirements to production and commerc
About you
You are passionate about what you do which is obvious from your actions
You have a talent for software development and computer systems
You are comfortable learning from team members and sharing with team members
You are continuously developing your knowledge through experience, as well as reading and experimentation
You love solving problems and digging into complex problems
You have focus and ambition to understand customer needs on developed features
You take pride in understanding the whole product and its environment
Software is more than just a job for you
As we are looking for developers to cover several different areas within our Packet Core Controller development, the used
Qualifications
BI Developer
Kraków, Poland
Network Operation and Integration
Job Description
Date: Apr 9, 2020
Job Summary:
Currently, for Application Development and Maintenance unit that is responsible for cooperation preparing applications an
As a new joiner, you will be a part of the team that strongly work to recognize the importance of delivery the highest quali
Key Responsibilities:
Designing and developing BI solutions based on Microsoft Business Intelligence stack focused on SSAS and SSRS with a bit
Providing high level estimations based on a given requirement,
Improving and maintenance software delivery environment (continuous integration, continuous delivery),
Hands-on development and end-to-end responsibility for features delivery (initial requirement, architecture, story refinem
Cooperating with team to maintain the architectural/platform road maps, practices and standards,
Following best software development practices to ensure high quality product and adherence to design objectives,
Sharing your knowledge and mentoring less experienced team members.
Key Qualifications:
Experience in Microsoft BI Stack (in particular: SQL Server, SSAS Multi-Dimensional model, SSRS),
Experience in working with DWH database platforms and understanding of Data Warehouse and ETL standards,
Basic understanding of Agile methodology (Scrum),
Average level of writing and verbal communication skills in English,
Team oriented – collaborative style willing to support others and ask for help when needed.
What We Offer:
Stable employment on the basis of an employment contract;
Work based on developing the latest solutions in the area of mobile technology;
Clearly defined career paths, trainings;
Rich benefit package (private medical care for the employee and their family, life insurance, Ok System MultiSport);
Work in an international environment based on cooperation;
Flexible working hours, laptop and mobile phone;
Work – life balance.
As an Engineer you will analyze, prepare, implement and verify the configuration and integration of a node, network and/o
Responsibilities
Integration Engineer
Gurgaon, India
Network Operation and Integration
Get Job Alerts
Share Job
Job Description
Date: Apr 7, 2020
Ericsson is one of the leading providers of Information and Communication Technology (ICT) to service providers. We enab
Job Summary:
We are now looking for an Integration Engineer to analyze, prepare, implement and verify the configuration and integratio
Responsibilities:
You will support pre-sales activities, including pre-studies
Good experiance in operation support with ITIL process.
Very Good hands-on experience in Configuring ClickSoftware Service Optimization Suite ( Click Schedule, Click Mobile, Click
Good knowledge on integrating Click Service Optimization Suite with other products\systems
Must have worked in large scale deployment programs which involves multiple systems and integration
Good hands-on experience on Oracle database, PL/SQL scripting
Must have knowledge in Web deployment projects (.NET web services)
Good understanding in SDLC (Software Development Lifecycle). Must have involved in complete project lifecycle execution
Plan the implementation of the product configuration / integration work
Execute product configuration
Execute integration and migration work
Prepare system test, module test and accept test
You will work to identify and drive improvements
Post project activities
E2e technical understanding
Execute test
Scripting & coding
Knowledge sharing and collaboration skills
Key Qualifications:
Education: Academic degree, minimum on bachelor level, in engineering (IT, Telecom) or
Good hands-on experience in Configuring ClickSoftware Service Optimization Suite ( Click Schedule, Click Mobile, Click Plan
3-5 years’ experience of deploy system test and lead testing team.
Min years of experience: (Recruiter to supply)
Domain experience: (Recruiter to supply area of expertise – e.g.: Cloud, BSS, OSS etc.)
Creating & innovating
Applying expertise & technology
Analytical learning and researching skills
Delivering results & meeting customer expectations
You will need excellent planning and organizing skills
Additional Requirements:
If you have ISEB/ISTQB software testing qualifications that would be an advantage
C++ Developer
Guangzhou (Canton), China
Product Development
Get Job Alerts
Share Job
Job Description
Date: Mar 24, 2020
Job Summary:
We are now looking for a Developer to maintain products (units, nodes, networks, systems and solutions). Your role will in
PDU Radio Products is a part of Development Unit Networks (DNEW) and has an overall responsibility for development of
Radio SW is one of the sectors in PDU Radio, where Kista (Stockholm) is our base but the world is our arena. Radio SW pro
We are now looking for a talented experienced developer to work with one of our teams.
Position Summary:
As a Developer at Radio SW you will be responsible for build and delivery of our Radio SW products following Ericsson com
Develop and maintain the automated build and delivery environment including frameworks and tools.
Track build and delivery failure issues until finished.
Work with Track Management in maintaining SW track and branch strategy.
Take part in early project phases to secure a smooth adaptations needed in the automated build and delivery environmen
Develop and communicate the SW CM strategy for Radio SW.
Interact with both internal and external Radio SW parties to align requirements and share information about our build and
Provide SW CM training to developers and document all processes and procedures
Position Qualifications:
Core Competences:
Very good experience of SW component build and deployment tools, such as GNU AutoTools and The Yocto Project includ
Experience in automation of build and delivery environments using Jenkins or similar scheduling utility for implementing a
Experience in Product Life Cycle Management and about the Software Development Life cycle including version control to
Experience in using Unicies, with ability to program or script if needed in BASH, tsch, Python, Perl, Ruby, Java, C++.
Experience of Modular Software, Software resue and SW development in general, ability to compile and link libraries, unde
Knowledge of Agile and Lean methodologies such as Scrum and Kanban.
Good knowledge about Radio system
Radio SW/HW product development knowledge
Knowledge about the DURA functional framework including the CI-machinery
You have a minimum Bachelor’s degree in Computer Science or Electrical Engineering or equivalent and:
a minimum of 1 years of working experience from automation of large build and delivery systems
a minimum of 3 years of working with SW development
a minimum of 1 years of working experience from working with Yocto or similar build and deployment tools.
good English skills in spoken word and writing
Applications:
For any enquires and for application details, please contact the responsible Recruiter, Suma Haregoppa Venkatagiri at sum
Please note that, you need to submit the application in English, and we do not accept any applications via email. The last d
Add your career to the resources of Ericsson and amazing things can happen. We are a world leader in the rapidly changin
We are now looking for a Systems Architect in VoLTE area at MTAS to join our development organization in Budapest.
At our Development Center MTAS within the Product Development Unit (PDU) Converged Core we develop and support th
MTAS comprises several application servers providing a rich feature set enabling communication services for multiple type
You will love working in an organization with agile Exchange To Exchange (E2E) development teams and close collaboratio
The MTAS development organization is distributed across Sweden, Hungary and India. This position is located in Budapest
Define the scope and shape the requirements in collaboration with product management and technical leaders
Feature break down into user stories and epics
Solution and system design in the areas of VoLTE/MTAS features.
Networking and collaboration with Market Areas, Customers and Strategic Product Management to understand customer
Developing in line with MTAS practices of lean, agile, continuous integration and continuous delivery
Contributing to MTAS architecture evolution and governance as part of the MTAS architecture community
Contributing to the continuous evolution of MTAS best practices in ways of working
Be a key person in knowledge sharing and learning
Have the opportunity to continuously develop your technical competence
What we offer
As a first step within our selection process, you will be asked to fill out our technical tests based on your experience and pr
The candidates who best match the criteria for the position, will move on to the next step in the process – the interview.
Position Description
•Work with leading edge technology for Ericsson’s media storage and delivery solution
•Perform software design and verification: from requirement analysis, system design, implementation, verification till deli
•Co-operate with colleagues in Europe, U.S.A and Israel to ensure a large-scale e2e solution with good quality
•Perform trouble-shooting on released software globally and travel across different countries if necessary
Qualifications
•Solid experience on Java (or C, C++/Lua) programming experience, at least over 3 years
•Solid experience on Linux, including but not limited to configuration for different purpose, performance tuning and debug
•Familiar with the network architecture and protocol, e.g. TCP/IP, UDP, HTTP, as well as traffic analysis
3. Others
•Accountable, and can work independently, easy to work with and strong teamwork spirit
Job Description
Date: Feb 12, 2020
Job Summary:
We are developing world class Load Testing Tool(named Dallas) supporting the continued success of Ericsson’s Packet Core
Dallas is the essential part of the R&D process that maintains the SUT(System Under Testing) in premium quality standard
You will be immersed by ICT buzzword, e.g. 3GPP, 5G, Virtualization, Cloud, Distributed System, Kubernetes, ……
Bachelor degree or above in Computer Science, Telecommunication, Information Technology, and Electronics.
Good knowledge in C/C++ programming
Good knowledge in software development methods and tools (Object Oriented design, Design pattern, UML, Rose)
Good knowledge in data structure, e.g. array, list, set, queue, tree, etc.
Good knowledge in Linux
Good knowledge in Python and Erlang is preferred
Good knowledge in Unix (Solaris) and VxWorks is preferred
Good English communication skills (both in written and verbal), able to express own thinking clearly
Ability to learn new technology quickly and apply to work tasks
Innovative and solution thinking abilities
Open minded and willing to accept challenges
Responsibilities:
. Work as a developer to design and test high quality software product.
. Work directly with product owner, understand customer requirements and come up with solutions.
. Work in cross-functional team, which is self-organizing, international and highly independent with Lean/Agile best practi
. Drive or contribute for continuous improving of defined concepts, such as architecture and refactoring, unit/componen
. Keep learning and trying new things in software craftsmanship which suits and benefits for team.
Key Qualifications:
. University bachelor or above degree, major in Computer Science, Telecommunication, Software Engineering or equivale
. Experienced knowledge in C/C++, Python, Perl programming.
. Familiar with the network architecture, protocol and traffic analysis with Tcpdump, Wireshark and other packet analysis
. Linux development, working with high performance Linux application.
. Good English communication skills (both in written and verbal), able to express own thinking clearly
. Good troubleshooting, debugging skill, analyzing core dumps, memory leak and performance issue.
. Excellent in team working and communication
*LI-POST
With over 90,000 employees across 180+ countries, we have a culture that respects and supports your ambitions, in alignm
Next Steps:
What happens next once you apply? Read about the next steps here
For your prep and reference, here is our overall Brand video and some insights about our innovations in 5G
Ericsson provides equal employment opportunities (EEO) to all employees and applicants for employment without regard
Ericsson complies with applicable country, state and all local laws governing nondiscrimination in employment in every loc
This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, terminati
Ericsson expressly prohibits any form of workplace harassment based on race, color, religion, sex, sexual orientation, marit
CCES (Cloud Core Exposure Server) is a commercial realization of the 3GPP Network Function NEF.
It builds on programmability of 5GC (SBA/MSA/CNA) , expose network capabilities and structured data from 5G Core doma
CCES is fully developed based on Cloud Native principles, with a SW architecture based on micro service technology, with c
We sincerely invite you to join us, witness the coming success at the same time to develop your personal competence and
Work with product owner to understand customer requirements and come up with solutions
Work in Agile software development process to develop new features in an efficient way
Handle customer inquiry or perform product maintenance work with strong trouble-shooting skills
Design documentation like component description and modeling during system design and implementation
Co-operate with colleagues in Europe to ensure the software delivery on time with good quality
General Qualification
Technical Qualification
Good programming skills in Java, familiar with Java Core (Java 8 or later)
Good knowledge of Multi-thread programming, Garbage collection,
Good knowledge of OO concept, design principles, patterns, and so on.
Good knowledge of No-SQL Database (PostgreSQL, Cassandra), Performance tuning experience is preferred.
Familiar with Microservice, Container, and Kubernetes technology
Familiar with Helm - Package Management Tool
Familiar with MSA (Microservice Architecture) is plus
Familiar with Web Application development experience is plus
Familiar with Python, Shell, and Golang, Netty, Spring-Cloud
Familiar with Git
Familiar with Maven, Gradle build tool
Familiar with automation test frameworks and Skilled in writing test cases with JUnit in Java is preferred
Technique blog, open source / GitHub project is a plus
Familiar with TCP, HTTP protocols is preferred
Design the radio HW solution from system view. Lead the technique team to implement the design and responsible the de
You will work as Radio system design, which covering below areas:
Responsibilities:
Radio unit HW solution design: define the HW solution from system level.
Radio functions solution design: define the radio function solution, especially related with HW part.
Radio link budget: radio performance allocation, define the requirement on sub-level
Trouble-shooting
Write related design document
Process control of product development
Pre-study task for new technique or new product
Technical coordination within team
ude services, consulting, software and infrastructure within Information and Communications Technology.
orked Society: a world connected in real time that will open up opportunities to create freedom, transform society and drive solutions
performance-driven culture and an innovative and engaging environment. As an Ericsson employee, you will have freedom to think bi
eration of wireless networks, i.e., 5G network with distributed edge compute, that will drive economic and social transformation for al
ologies is what Ericsson uses to drive thought leadership to automate and transform Ericsson offerings and operations. MI is also a key
ontiers to automate, simplify and add new value through large and complex data.
caling, monitoring and performance. You shall build effective AI models using stacking/ensemble techniques; and provide prediction ex
g real-world problems as part of a highly dynamic and global team. You will work in a highly collaborative environment where you com
reation for Ericsson in AI/ML
QL Databases. Design data pipelines and flow strategies.
ard/canonical data models by combining multiple data sources.
nderstand MI-driven business needs and opportunities
hese needs. This includes working with petabytes of 4G/5G-networks, IoT and exogenous data, and proposing/selecting/testing predic
o leverage existing data models and build new ones as needed.
machine learning models and solutions as part of Ericsson offerings including providing source code, workflows and documents
nurturing the communities and mentoring junior data scientists.
ning, Electrical Engineering or related disciplines from any of the reputed institutes. First Class, preferably with Distinction.
experience of about 15+ years.
ble scenarios
as NOSQL DBs
older business units, global customers, technology and other ecosystem partners in a multi-culture, global matrix organization with se
ng of data science and Machine Learning tools. Your knowledge and experience in Data Science methodologies will be applied to solve
hese needs. This includes working with petabytes of 4G/5G-networks, IoT and exogenous data, and proposing/selecting/testing predic
o leverage existing data models and build new ones as needed.
machine learning models and solutions as part of Ericsson offerings including providing source code, workflows and documents
ning, Electrical Engineering or related disciplines from any of the reputed institutes. First Class, preferably with Distinction.
experience of around 10+ years.
park ML etc.
initiatives and research papers addressing their functionalities, scalability and overall industrialization viability
industry/application domain
older business units, global customers, technology and other ecosystem partners in a multi-culture, global matrix organization with se
w, Dataiku, etc.
gn, development, integrate and documentation using groundbreaking engineering and delivery practices
ud-based systems and solution in consultation with external and internal partners
hrough our networks. We enable the full value of connectivity by creating game-changing technology and services that are easy to use
graphics representation and reporting i.e. R, Python
ou have a passion for developing and deploying digital solutions in an iterative way together with cross-functional teams? Join our dig
e solutions, and applying and industrializing advanced analytics / AI / ML applications. The ideal candidate is able to show a strong trac
matics, Physics, or related field (e.g. applied mathematics/statistics)
business problems (predictive modeling, customer segmentation / clustering, network analysis, etc.)
processing
sh as soon as possible. For any questions and clarifications reach out to the recruiter, Hema Powar, hema.powar@ericsson.com. As a p
note that you may be requested to complete the same, when you apply for this position.
eloped products. This function ensures that a stable and correct test environment is available, and to perform system related tests for
rdware licenses and inventory.
cycle of the product. Indoor and outdoor positioning 5G, IoT, Cloud development, modern SW delivery pipelines are all technologies
to join our team. Ideally, you are passionate about technology, a problem solver, and enjoy working in new areas and taking on new c
o yong.sze.miin3@huawei.com, applications without a CV will not be considered. Please make sure to have your attachments in Englis
ons, such as anomaly detection, user behavior prediction, and many more in large scale.
ce with applications.
tand the question, and devise an analytical approach to reach actionable answers.
Driven by a commitment to sound operations, ongoing innovation, and open collaboration, we have established a competitive ICT portf
s one of the largest and fastest-growing platforms in the world. It has strong presence with over 40 availability zones located across 4 c
ns team in Huawei Munich Research Center (MRC), to participate in the rapid prototyping of new features, innovation, and proof of co
than one third of the world’s population.
evenues in research and development, which enhances our competitiveness while driving industry and technology.
ollaborate with each other in a constructive and solution oriented manner.
e experts from around 50 different countries.
art of a multicultural team and growing environment, feel free to contact us. Driving future technologies with focus on customer satisf
nd Apache Beam
Lib or Storm / Samza are desirable
nt, Engineered products & infrastructure, Financial institutions, Health care payers & providers, Insurance, Media & entertainment, Me
ant challenges and capture their greatest opportunities. BCG was the pioneer in business strategy when it was founded in 1963. Today
ing deep industry and functional expertise and a range of perspectives to spark change. BCG delivers solutions through leading-edge m
machine learning with deep industry expertise. The BCG GAMMA team is comprised of world-class data scientists and business consult
asily interact with our analytics applications to measure the success of their new analytics enabled organization or quickly make decisio
date will have industry experience working on a range of different machine learning disciplines, eg anomaly detection, payment fraud
w we use technology, machine learning, and data to enable the productivity of our clients. You will help envision, build, deploy and dev
odels into production together with a highly multi-disciplinary team of scientist, engineers, partners, product managers and subject do
ive practical application knowledge in the field of big data and database, furthermore, you are keen to explore and innovate, and build
n global information and communication technology (ICT) solution providers. To achieve this goal, we are building an industry-recogni
nfrastructure and promotes simplified enterprise IT systems for service flexibility. Huawei is constantly pursuing innovation, providing
tion Big Data Analytics platform while collaborating with experts and working with cutting edge technologies in the Big Data space:
operations by driving technical innovations, influencing market and product development in order to deliver significant business value
applied to solve challenging real-world problems. Your contribution will also help to create new offerings in the areas of ML driven pla
hese needs. This includes proposing/selecting/testing predictive models, recommendation engines, anomaly detection systems, statisti
rage existing data models and build new ones as needed.
learning models and solutions including providing source code, workflows and documents
ncluding, but not limited to, ICML, NIPS, AISTATS, UAI, AAA.
ed and unintended consequences of their work. Thats why were passionate that true innovation must deliver value for Accenture, our
Team. Operating across all stages of the innovation spectrum, with a remit to build the future in real-time. The working environment i
ep Learning solution development skills. This role will focus on technical solution delivery across the Docks project portfolio; experime
nologies and techniques to address significant real-world business and societal challenges. Collaborating with our clients, designers, so
solutions, demonstrating your experience in the capabilities and limitations of these tools and platforms.
es on project teams.
ps, sprints and prototyping to help identify breakthrough ideas, concepts and solutions to business problems.
nd be able to match the future opportunities of those to the current and emerging challenges of our clients
ons and experiments and being responsible for their business value and measures of success.
backgrounds.
ng and communicating Analytics & AI concepts, solutions and value to non-technical audiences.
AI at the Dock, both in terms of new concepts, technologies, approaches and methods and in terms of business applications.
g, NLP, Knowledge Graph or other Artificial Intelligence domains and proven experience in the application of these technologies to add
ety and does not discriminate on grounds of race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domes
adding value to Sage Business Cloud Accounting and Financial Management solutions. AutoEntry is one of the fastest growing automa
learning problems.
g senior management.
n, cross-validation, overfitting, bias, variance, convex optimization, eigenvectors, relational databases, SQL, latency, computational com
titative field.
eveloping AI algorithms for solving such complex research challenges leveraging the recent advancements in AI. This position is expecti
G) in the Headquarters, CBG covers smartphones, PC and tablets, wearables, mobile broadband device, family device and device cloud
business units.
luding, but not limited to, ICML, NIPS, AISTATS, UAI, AAA.
ering teams to develop & productionize machine learning algorithms that are the core of Woebot's intelligence. You will work with NLP
efforts, this includes, model training, feature extraction, deploying produced models, data processing, and rigorously A/B testing.
nd production services while helping define our data team's processes and tooling.
ers' natural language input, and generate appropriate responses. Initially you will prioritize the following:
ems so that Woebot continually learns from ongoing conversations to better recognize the sentiment and intent behind users' natural
other natural/unstructured data sources.
ach user. We consider personalization to be key for developing a relationship over time, and delivering precision intervention - that is,
th and understand users. Combining intent classifiers, chit chat, and task-oriented models that help users achieve their goals of feeling
derive insights about individual users, thus allowing it to give personalized feedback to users, such as "Did you realize that you're happ
time and in the right manner to help users gain new insights about themselves.
le nationwide.
atest research to real-world problems.
high-quality data.
d to transform the Bank of Ireland brand: Customer planning across business under one group brand strategy and agreed brand pillar p
n increasing demand for more personalised customer experiences, Customer Analytics is leading the charge to enable and deliver this
of analytical solutions to deliver enhanced customer experience and actionable data insights based on data analysis. The types of proj
nd measurement in place.
emi-structured web data, unstructured text data to improve model accuracy.
bility for development of Radio products for all Radio Segments, like AAS, Macro, Indoor and High Frequency. The PDU has operations i
our arena. Radio SW provides possibility to work with the latest technology within telecommunication development and offers a rich n
ts following Ericsson common processes for configuration management, continuous integration and continuous deliveries. You will als
and delivery environment according to project requirements.
The Yocto Project including Poky and BitBake for embedded software on Linux distributions.
tility for implementing and integrating continuous delivery pipelines.
luding version control tools like GIT, Gerrit and ClearCase.
Ruby, Java, C++.
le and link libraries, understand advantages of Shared Libraries.
o work outside your own discipline
build and delivery environment.
tions via email. The last day to apply is before 18th May 2020.
s means that you get to leverage our 140+ years of experience and the expertise of more than 95,000 diverse colleagues worldwide. A
utions. As a developer, you will be involved in the development and maintenance of business critic applications.
ty and perform in accordance with the overall requirements as well as to the customer’s expectations.
utions. As a Java Full Stack Developer, you will be involved in the development and maintenance of both backend services & web front
ty and perform in accordance with the overall requirements as well as to the customer’s expectations.
nd documentation standards
ore network, connecting mobile devices around the world to the internet. You’ll be joining a small cross-functional development team
aking on the Scrum Master role in the development team, or taking the lead in system analysis, breaking down requirements and desi
s means that you get to leverage our 140+ years of experience and the expertise of more than 95,000 diverse colleagues worldwide. A
that offers services for both fixed and mobile network infrastructures. It offers services such as IP/MPLS edge routing and Evolved Pack
nt teams, and stakeholders to understand customer and product requirements
e made it our business to make a mark. Ericsson Packet Core has never had a greater opportunity to lead change; setting the bar for te
y easy to adopt, easy to use and easy to scale. This demands from all our people the creativity to discover, the accountability to delive
us connected mobile users while handling traffic from IoT to 4K video in Gbit speeds. All of this in a virtual deployment using container
dern facilities. We’re next to Chalmers and the university, with other cutting-edge tech companies as our closest neighbors in a buzzin
and Scrum. Early customer and partner feedback is a key element in our development process which is based on Continuous Integratio
production and commercial deployment. We use CI/CD (Continuous Integration/Development) flow and philosophy, with pipelines des
erimentation
r development, the used tech might vary. Here is a non-exclusive list of tech we use:
preparing applications and providing services to external customers we are looking for Software Developers to work in our office in Kr
delivery the highest quality product. Our team is full of passionate people building everyday a great ecosystem of different kind of app
SSAS and SSRS with a bit of integration from various applications and systems,
design objectives,
ETL standards,
stem MultiSport);
rks (RAN). In this role, you will be part of our customer-facing team(s) assembled to maintain and implement our Solutions at our Custo
of a node, network and/or system. Your scope of work could include the scenarios of introduction, upgrade expansion, functionality an
rvice providers. We enable the full value of connectivity by creating game-changing technology and services that are easy to use, adop
nfiguration and integration of a node, network and/or system. Your scope of work could include the scenarios of introduction, upgrade
hedule, Click Mobile, Click Plan, Click Forecast and Click Analyze)
olutions). Your role will include all development activities such as: requirement analysis, system design, architecture design, hardware d
bility for development of Radio products for all Radio Segments, like AAS, Macro, Indoor and High Frequency. The PDU has operations i
our arena. Radio SW provides possibility to work with the latest technology within telecommunication development and offers a rich n
ts following Ericsson common processes for configuration management, continuous integration and continuous deliveries. You will als
The Yocto Project including Poky and BitBake for embedded software on Linux distributions.
tility for implementing and integrating continuous delivery pipelines.
luding version control tools like GIT, Gerrit and ClearCase.
Ruby, Java, C++.
le and link libraries, understand advantages of Shared Libraries.
tions via email. The last day to apply is before 18th May 2020.
er in the rapidly changing environment of communications technology – by providing hardware, software, and services to enable the f
nization in Budapest.
e develop and support the Multimedia Telephony Application Server (MTAS) which is a key component of current and future commun
services for multiple types of customers and deployments. Our customer base is global and stretches from tier 1 mobile operators to fi
ms and close collaboration with product management and other technical leaders.
on is located in Budapest/Hungary.
hnical leaders
good quality
emium quality standard wrt. Stability/Robustness/Capacity &Characteristics, in addition, also a key differentiator during Sales activitie
ubernetes, ……
d Electronics.
with Lean/Agile best practices. The team will design, implement and test the feature, product and system from requirements to produc
actoring, unit/component test, automated testing and continuous delivery.
your ambitions, in alignment with our values of Respect, Professionalism and Perseverance. Ericsson is extremely focused on learning
loyment without regard to race, color, religion, sex, sexual orientation, marital status, pregnancy, parental status, national origin, ethn
employment in every location across the world in which the company has facilities. In addition, Ericsson supports the UN Guiding Princ
ent, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, training and development.
sexual orientation, marital status, pregnancy, parental status, national origin, ethnic background, age, disability, political opinion, soci
data from 5G Core domain, to enable automated control loops for different use cases, thereby creating new revenue streams for the o
ervice technology, with common development framework across portfolio of products, deployable on containers on bare metal or VM
Job Description
Date: Apr 7, 2020
Location: Mississauga Ontario Canada
Are you a highly detailed person who’s not afraid of digging into the data and p
We are the Products and Services Readiness team within Market Unit North Am
society and drive solutions to some of our planet’s greaWe are currently looking for Network Engineer who will perform the role of a F
l have freedom to think big and the support to turn id As a Feature Test Engineer, you will work in our radio labs, as well as in the cust
You may also be asked to work on automation projects that aim to improve and
vironment where you communicate and plan tasks and ideas. You will be working on high impact initiatives with other DS in Machine
To be successful in the role, you must have:
Bachelor or Master’s Degree, in Electrical Engineering, Computer Science, Com
4+ years of relevant experience in Telecom Industry, ideally in Radio Domain
Knowledge of Radio access technologies (LTE, NR)
Background in communications theory and digital signal processing
Linux knowledge
Data analysis knowledge
Good command of MS Office, especially MS Excel and PowerPoint
Hands on experience with common tools (TEMS, MapInfo, ACTi, Nemo, ITK, AM
Coding and scripting skills. Languages include: Java, C, C++
Good to have: Python and/or R scripting languages
Good to have: Database access scripting (e.g. SQL)
ng/selecting/testing predictive models, recommendatioGood to have: Machine Learning
ws and documents
You might also have:
Our Opportunity:
There is nothing average about what we do on the Ericsson team and we aren’t
We are now looking for several developers to maintain products (units, nodes,
In this role you would:
ws and documents
Test Environment Scrum Master
Ottawa, Canada
Product Development
Get Job Alerts
Share Job
Job Description
Date: May 1, 2020
Location: Ottawa
h Distinction.
You Will:
ensure the team lives agile values and principles and follows the established pr
plan, execute and deliver of code (test cases and common modules), as per agr
work in close collaboration with system architect, product owner and the syste
implement ways of working and communication flows to and from the develop
develop software code in Java for automated execution of radio performance t
plan resources for test system support to verification teams
provide formal documentation of test setup and code modules
help to maintain and improve the development lab environment
give to the overall success of the verification team goals
be responsible for competence development and over all team success
You Have:
B.Sc. in Computer or Electrical Engineering (or equivalent)
minimum 5 years SW development experience in object-oriented programming
experience using Java, Eclipse IDE, JCAT, Git or ClearCase
good knowledge of Linux/Unix environment
good understanding of 2G, 3G, 4G and 5G wireless communications technologi
hands on experience in the lab using spectrum analyzers and signal generators
Ericsson CENX fundamentally changes the way service providers view their netw
The architecture team works with product management, developers, and the b
The Machine Learning Architect will be primarily responsible for guiding CENX’s
Skills in software design, applied statistics and technical leadership combine to
The CENX Machine Learning Architect will contribute to the full stack of machin
You will:
Design and architect new application components, tools, or core software com
Define, document and communicate architectural decisions
Participate in the entire software development cycle by analyzing, specifying re
Design solutions to complex problems with an emphasis on efficiency, quality, a
Manage risk identification and risk mitigation strategies associated with archite
Constantly research and innovate in bringing the new tools, technologies, soluti
Drive strategy in evolving existing data model to meet demands of new applica
Work with developers daily to provide mentorship, resolve technical issues and
DevOps DBA
Montreal, Canada
Product Development
Get Job Alerts
Job Description
Date: Mar 4, 2020
Job title: DevOps / Database administrator
Location: Montreal, QC
(English to follow)
Our stimulating proposition:
You will be posted in Canada, in Montreal, and you will be part of an internatio
Your tasks:
Would be a plus:
Job summary:
We are looking for a network engineer who will be responsible for optimizing th
The role focuses on designing, auditing and optimizing services, for which you w
Main responsibilities:
Job Summary:
We are now looking for a Network Engineer that will be responsible for perform
The role focuses on executing design, audit and optimization of services, where
Key Responsibilities:
Ericsson complies with applicable country, state and all local laws governing no
This policy applies to all terms and conditions of employment, including recruiti
Context
Within the Mobile Networks Business Group, in the 5G product architecture de
This internship fits into exploratory projects conducted in our department, and
The considered applications domain is the allocation and the optimization of ra
Role
In this context, the main objective of the internship is to contribute in the intro
This requires fundamental skills in machine learning from model design to impl
The validation of the proposed solutions would consist in assessing the benefit
1.Getting familiar with the procedures for radio resources allocation in 5G netw
2.Selecting one (or more) specific use cases and identifying the necessary optim
At the end of this internship, this experience will allow you to reinforce and put
- Integrated into an AGILE team, you will participate in real-time software deve
Err:509
Err:509
R & D Engineer
🔍Bangalore, Karnataka, India
📁Applied R&D💼NSW Nokia Software20000000RM
Key responsibilities/Job description:
--> Development activities with flavours of UT, NT & functional tests of Nokia R
--> Feature delivery along with handling of legacy topics (issues, customer topi
--> Responsible for the product quality & deliverables in new features, custom
--> Support during customer escalations for Nokia Registers & Legacy HLR/HSS
--> Quick solutions to team for supporting team
Skills required:
--> Mandatory to have developemnet hands on expereience with C++
--> Domain & Protocol knowledge in the areas of UDM, AUSF, HSS, HLR, DIAME
--> Knowledge in Analysis, Design, Development/Testing.
--> Strong analytical and debugging skills.
--> Team Player, Self-motivated and able to work with little supervision
--> Usage of ROBOT, Rammbock or IPSL is mandatory. (for testing)
--> Excellent Interpersonal and Communication skills
--> Good knowledge in Python (mandatory)
--> Knowledge in GO language, Java technologies, python is an added advantag
--> Platforms: Linux
Innovative, full of ideas and eager to invent and apply new things
Self-guided and eager to take responsibility
Motivated to learn and explore new ideas and technologies
Be an out of the box thinker and problem solver
Team player and believer in the power of "we" rather than "me"
Looking forward to meeting you!
L1 SW C++ Developer
🔍Saint Petersburg, Russian Federation, Russian Federation
New
📁Applied R&D💼MN Mobile Networks1900000KTH
Nokia is a global leader in the technologies that connect people and things. Wit
Serving customers in over 100 countries, our research scientists and engineers
Position for 5G L1 SW development.
Requirements:
Bachelor’s, Master’s, or Doctorate’s degree in Computer Science, Information T
Project experience with at least 3 of the following programming languages: Jav
Project experience with at least one version control tool, preferably Git
Ability to learn new programming languages and tools quickly
Excellent English communication skills in both writing and speaking
Ability to solve complex problems logically
Strong time management and self-discipline skills
Ability to work well in a team-oriented environment with international colleagu
Ability to cope with tight deadlines
Demonstrates enthusiasm, curiosity, and motivated to learn and try new things
Dedicated team player with innovative spirit
As intern, you will belong to the team in charge of specifying and developing a
Err:509
Err:509
Err:509
Err:509
For this intership the following skills are required:
Engineer , 5G L1 integration
General purpose:
Now we are looking for several high motivated, enthusiastic and talented futur
You will have state of the art tools and methods available for your job with high
Depending on role you will do integration and testing or prepare your thesis wo
Responsibilities:
elines are all technologies we are using in our daily development practices.
Mandatory Requirements:
Self-drive and drive to lear
Team player with good communication (verbal and written) skills
Decent English skills
Good to have:
Python or C or C++ or Java or bash competence
GIT competence
Jenkins competence
Data Engineer 2
Job Description:
We are looking for Interested candidates for Data Engineer to not only help us
Develops and maintains scalable data pipelines and builds out new API integrati
Working experience with Tableau, QlikView, Mode, Matplotlib, Jupyter, or simi
Extensive experience analyzing data using SQL
Required Minimum Qualifications: (Education, Technical Skills/Knowledge)
You will be part of the Cloud Mobility Manager (CMM) within the ION Division o
The CMM delivers a converged packet core solution which addresses both evol
Key Job Responsibilities:
chnical skills.
and unstructured data. Development & prototyping in Virtualization/Cloud Computing including techno
Openstack and other well-known NFV-I platforms, including AWS and Containe
Data analytics/Artificial Intelligence/Machine Learning including a strong knowl
Coding in C/C++
You will need to be:
Qualifications:
Skills:
Essential:
Desired:
The L1 Downlink Physical Layer team in Ulm is looking for an ambitious embedd
The work assignment will be the Nokia base station and as a member of a L1 sc
Expectations:
General Purpose:
We are looking for highly skilled, advanced technical experts to be part of a tea
You will join Applied R&D SRAN (Single Radio Access Network) team within Nok
Your mission
Be part of the testing team for delivering the newest technology and features e
Your main responsibilities will be
To analyse system concepts and complex product and system features;
Test environment building, integration and maintenance;
According to project assignment, to create automated test suites and scripts fo
To execute manual or automated test cases against real equipment and to repo
To write defect reports and verifying corrections;
To collaborate with the stakeholders (system specification team & SW develope
To act with independence and discretion in routine matters.
Training
Role description
You will join Research and Development 5G Software team, responsible for dev
40% development
100% FUN!
Team Description
You will be part of a Development Unit (around 1300+) among 11 R&D Tribes in
Latest achievements
Since the beginning of the race towards first position in 5G market, our team ha
Training
uing innovation, providing customers with big data, da A customized training plan will be proposed at your arrival. During the first mon
Qualifications
in the Big Data space:
Key must-haves:
If you want to take part of this adventure that will shape the future, join us. Ap
The work area combines RF and system design tasks as well as RF measuremen
r significant business value. To achieve this we are building an industry-recognized multi-discipline lab of experts with focus on medium
the areas of ML driven platform intelligent monitoring The tasks require capability to communicate efficiently with suppliers and proje
detection systems, statistical model, deep learning, reinforcement learning and other machine learning systems
Evaluation of new design tools and supporting taking them in wide use within R
RF design tasks and HW development
Documentation
Participation to RF measurements to close loop from verification to simulations
Qualifications
h our clients, designers, software engineers and businesAt Bell, we do more than build world-class networks, develop innovative servic
If you’re ready to bring game-changing ideas to life and join a community that v
Bell’s Business Intelligence team is responsible for the management and optimi
Responsibilities
Lead the development of machine learning products and models from inceptio
Explore new data sources to uncover new business opportunities at all levels of
Identify areas for ML/AI opportunities and demonstrate to internal clients how
Build and implement strategies for ML-driven projects
Work with partners within Customer Operations and across Bell to make data-d
Work with and present to all management levels
Maintain and expand your knowledge of ML/AI and current technology through
Core Skills
Algorithms
Advanced knowledge of ML models: deep learning, reinforcement learning, NLP
Hands-on experience and expertise with different AI/ML frameworks such as Ke
Stay abreast of new technology and techniques in the ML/AI space
Coding
Advanced Python development skills
Experience in other programming languages Scala, C, C++, Java, Shell
Excellent code design (OOP, Algorithms, and Data Structures)
h and industry. Experience with CI/CD pipelines
Data
Understanding RDBMS, Distributed, and NoSQL databases
Proficiency in SQL
Understanding of Spark and MapReduce
Quick learner with ability to think out of the box
Additional Information:
Please apply directly online to be considered for this role. Applications through
ess applications.
At Bell, we don’t just accept difference - we celebrate it. We’re committed to fo
these technologies to address real-world business pr Accommodations are available on request for candidates taking part in all aspe
Java Developer
The role
Java Developer required to perform strong coding throughout development of
Responsibilities:
Err:501
Err:501
Err:509
Industry
Information Technology & Services Computer & Network Security Information
AI. This position is expecting to have a strong AI background with deep AI algorithmic knowledge such as anomaly detection, reinforce
ily device and device cloud service, and is the second largest smartphone manufacturer in the world. Huawei Consumer BG is dedicate
ce. You will work with NLP to create a best-in-class conversational engine, manage data labeling teams to ensure high quality training
sion intervention - that is, methods that are tailored to each user.
hieve their goals of feeling happier while also feeling natural and conversational.
ou realize that you're happiest on Sundays, and least happy on Tuesdays?"
e Bank in the markets in which we operate. This will enable us to shape and deliver on our purpose to enable customers, colleagues an
y and agreed brand pillar priorities, Deep brand & customer insight, Consistent & compelling customer communication across all touch
to enable and deliver this personalised experience through data analytics and the application of machine learning and artificial intellig
analysis. The types of projects involved in achieving this vision include leading the development of recommendation engines, building
. The PDU has operations in Kista (KI), Gothenburg (LN), Beijing (BJ), Nanjing (NJ), Lund (LD), Ottawa (OT) and Chengdu (CH). PDU Radio
opment and offers a rich number of opportunities in an everyday learning, creative and challenging atmosphere. With an agile way of
ous deliveries. You will also be responsible for developing and maintaining our Radio SW build and delivery environment including nee
e colleagues worldwide. As part of our team, you will help solve some of society´s most complicated challenges, enabling you to be ‘th
e colleagues worldwide. As part of our team, you will help solve some of society´s most complicated challenges, enabling you to be ‘th
ctional development team working with feature development, where the tasks include requirement analysis, system design, developm
wn requirements and designing feature behavior. Experience from either of these roles is a plus, but not necessary.
e colleagues worldwide. As part of our team, you will help solve some of society´s most complicated challenges, enabling you to be ‘th
e routing and Evolved Packet Gateway functionalities. SSR enables complete network convergence so subscribers can access services f
ange; setting the bar for technology to be inclusive and accessible; empowering an intelligent, sustainable and connected world.
he accountability to deliver and the courage to remove complexity wherever it presents itself. Your commitment to these qualities will
osest neighbors in a buzzing area. There are plenty of opportunities for learning and networking at Lindholmen Science Park, and there
loud transformation to meet the 5G journey.
d on Continuous Integration SW practices. We rely on team collective responsibility to finish the tasks according to a prioritized backlo
osophy, with pipelines designed for as-a-Service and to enable DevOps, with fast feedback loops. We believe in test driven developme
to work in our office in Kraków. Our employees in ADM department work for global and local projects building relationships with busi
expansion, functionality and capacity. You will work in a diverse multi-national team, and working experience from a telecom operator
that are easy to use, adopt, and scale, making our customers successful in a fully connected world. Headquartered in Stockholm, Swed
s of introduction, upgrade expansion, functionality and capacity. Your work will in part form our customer legacy.Good hands-on expe
tecture design, hardware design, software design, integration, verification, simulations, tools design, Product Lifecycle Management su
. The PDU has operations in Kista (KI), Gothenburg (LN), Beijing (BJ), Nanjing (NJ), Lund (LD), Ottawa (OT) and Chengdu (CH). PDU Radio
opment and offers a rich number of opportunities in an everyday learning, creative and challenging atmosphere. With an agile way of
ous deliveries. You will also be responsible for developing and maintaining our Radio SW build and delivery environment including nee
nd services to enable the full value of connectivity.
urrent and future communication services solutions across the globe. Our portfolio has strong market traction and we are getting into n
er 1 mobile operators to fixed replacements and cable operators to also include the enterprise market. Our goal is to provide a system
tiator during Sales activities.(e.g. Evaluation& Benchmark testing)
et Core and supports multi-access, GSM, WCDMA, LTE, 5G and interworks with Wi-Fi and CDMA with seamless transitions between th
emely focused on learning and development, supports mobility and flexible working hours. We are also committed to diversity and inc
tatus, national origin, ethnic background, age, disability, political opinion, social status, veteran status, union membership or genetics.
ports the UN Guiding Principles for Business and Human Rights and the United Nations Global Compact.
ility, political opinion, social status, veteran status, union membership or genetic information.
iners on bare metal or VMs and fully stateless with separation of business logic and data storage.
digging into the data and problem solving? Do you love the border between development and product? Ericsson is this place, and we
thin Market Unit North America. We lead testing and introduction of new Radio technology, such as IoT and 5G, for North American te
labs, as well as in the customer live networks. You will gain familiarity with the Ericsson radio products (SW and HW) through product
with other DS in Machine Intelligence to drive growth and economic profitability for Ericsson and its customers by accelerating curren
g, Computer Science, Computer Engineering or related field
ideally in Radio Domain
nal processing
d PowerPoint
pInfo, ACTi, Nemo, ITK, AMOS, QXDM, etc.)
tten in English
icsson team and we aren’t looking for average people, if you are exceptional, then you will fit right in. To succeed you must appreciate
f Software Development, you will be responsible for the design and implementation of scalable, high-performance, fault tolerant capa
in products (units, nodes, networks, systems and solutions). Your role will include all development activities such as: requirement anal
ng development team
m member to work on state-of-the-art multi-standard radio equipment for global deployment by leading mobile operators. Your role w
ect-oriented programming
ommunications technologies
zers and signal generators is an asset
olving skills
e providers view their networks. As leading provider of network and service operations software solutions, our product ingests all of a
ent, developers, and the broader Ericsson R&D community to shape and guide our overall development strategy and focus. We bridge
ponsible for guiding CENX’s machine learning strategy just as our customers are beginning to adopt Big Data and Machine Learning tech
cal leadership combine to form a superpower that can deliver revolutionary solutions to a key global industry. Ericsson works with mo
to the full stack of machine learning software, from platform design and specification to data exploration and algorithm tuning. The s
ools, or core software components and provide technical direction to the team.
by analyzing, specifying requirements, designing, and developing new tools, product features or platform core features.
asis on efficiency, quality, and simplicity. Consistently deliver performant and high-quality software in an agile fashion
ies associated with architecture
w tools, technologies, solutions, ideas and frameworks to the forefront.
t demands of new application domains
esolve technical issues and hurdles, and implement features.
kaging, solution optimization, automated development, test automation, monitoring and analysis, HA, cluster) and integration into a l
e, monitoring, troubleshooting (in depth, up to the level of Java codes)
and technique)
to development / to open source communities.
/ JVM applications
ms (Artifactory, Nexus), Jenkins
esponsible for optimizing the design and auditing of a network to meet customer requirements. The position relates to the design and
g services, for which you will be responsible for part of the solution and the service process. Therefore, you will need to comply with E
ctivities (large metropolitan area or state-wide geographic area), and responsibilities must include technical organization and work sup
meter changes, activation of functionalities on data from tests, in order to improve system performance.
MS Investigation, Actix, Agilent,
ge of the difference between artificial and real traffic loading
formance statistics, and solving complex problems
l quality of service, for both PS and CS traffic
be responsible for performing design optimization and audit of a network to meet customer requirements. The position is applicable f
mization of services, where you will be accountable for part of the solution and of the service process. Hereby, you should be able to ke
radio network design, implementation and tuning / optimization on 3G & LTE systems.
oLTE projects.
ties in a large scale (major metropolitan area or state wide geographic area) and responsibilities must include technical organization a
faces Design
G, Planet EV, Atoll
to create a new project in a planning tool
es, feature activations on drive test data to improve the system performance.
TEMS investigation, Actix, Agilent,
nd the difference between artificial and real traffic loading
EEO) to all employees and applicants for employment without regard to race, color, religion, sex, sexual orientation, marital status, pre
all local laws governing nondiscrimination in employment in every location across the world in which the company has facilities. In add
loyment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, trai
arassment based on race, color, religion, sex, sexual orientation, marital status, pregnancy, parental status, national origin, ethnic back
l || ProdMgt
ed in our department, and aiming at the introduction and the implementation of machine learning techniques in 5G networks.
and the optimization of radio resources for Multi Users massive MIMO systems in 5G. Beamforming, scheduling, traffic load balancing
s to contribute in the introduction of machine learning algorithms in radio resource management procedures for 5G networks.
rom model design to implementation and a good know-how of alternative ML approaches (such as reinforcement, supervised, unsupe
he implementation constraints in the products, such as inference latency, availability of data, in order to select the learning method ta
st in assessing the benefit of the machine learning based approaches compared to the conventional methods. This validation will be pe
tifying the necessary optimizations taking into account the RAN constraints
pecially 5G networks,
umes of data as well as for validation of machine learning algorithms.
UL-PHY LOKI)
he heart of our connected world. With the research and innovation capabilities of Nokia Bell Labs, we provide communications service
t / Apprenticeship
avaScript / Matlab,
ysis and synthesis, management of priorities, teamwork, curiosity, enthusiasm, open to others, communication.
th little supervision
y. (for testing)
ch as machine learning, computer vision and data science is desired. If you have an MSc and a lot of experience in the topics we are in
perience (academic work and R&D project experience within your PhD duration is usually considered)
are a great person. You probably have a blend of these personal attributes, which we value:
y new things
r than "me"
ect people and things. With state-of-the-art software, hardware and services for any type of network, Nokia is uniquely positioned to h
h scientists and engineers continue to invent and accelerate new technologies that will increasingly transform the way people and thin
already be a specialist on some of the above mention topics. And finally, we are looking for an ambition to further develop as 5G L1 So
will work in Nokia Software Technical Communication. Nokia Software Technical Communication is a global team of technical commun
documentation from XML sources
nt team's workflow:
uter Science, Information Technology or equivalent disciplines, with at least 1 year student status from the time of hire.
ogramming languages: Java, XSLT, Javascript, Python, SQL
ool, preferably Git
g and speaking
itted to diversity and inclusion. At Nokia, employment decisions are made regardless of race, color, national or ethnic origin, religion, g
ecifying and developing a web application supporting the 5G Node B information model. Interacting with stakeholders, you will propos
rt? Are you interested about gNb integration , test script development , test automation or SW development?
able for your job with highly motivated and experienced team.
g or prepare your thesis work.
s , 5G base station and CI ( continuous integrtion automation environment ) .As a result our customers will get fully functional systems
-driven cars.
written) skills
gineer to not only help us build data pipelines to efficiently and reliably move data across systems but also to build the next generation
builds out new API integrations to support continuing increases in data volume and complexity.
Matplotlib, Jupyter, or similar data visualization tools
ical Skills/Knowledge)
ect people and things. With state-of-the-art software, hardware and services for any type of network, Nokia is uniquely positioned to h
h scientists and engineers continue to invent and accelerate new technologies that will increasingly transform the way people and thin
which addresses both evolution scenarios: 5G (AMF), 4G LTE overlay today with 2G/3G consolidation, or 2G/3G renewal today with fut
omputing including technologies like:
helors/masters degree in Computer Science, Computer Engineering, Electrical Engineering or Information Technology (with focus Artifi
earning Algorithms.
nowledge (IPv6/IPv4), familiar with Linux, Ansible, KSH, HTTP2.0, XML, PHP, YAML, JSON, Web Design (JavaScript, Ajax, JQuery), SQL (m
of race, color, national or ethnic origin, religion, gender, sexual orientation, gender identity or expression, age, marital status, disabilit
scriminate against any employee or applicant for employment because of race, color, sex, age, national origin, religion, sexual orienta
of race, color, national or ethnic origin, religion, gender, sexual orientation, gender identity or expression, age, marital status, disabilit
g for an ambitious embedded software development expert who will work in engineering role. You will be part of a scrum team respon
ed for Multiple
experts to be part of a team that is responsible for embedded software development for Cellular Radio Modules. You will design and i
n, develop and test software for the new radio base stations. Participation in analysis and solving complex engineering problems. Parti
Design methodologies
may require making laboratory measurements and/or interacting closely with the hardware design team in order to observe the actua
Network) team within Nokia’s biggest business group Mobile Networks, responsible for testing software& hardware functionalities for
d system features;
ation team & SW developers) and support other testers and developers;
rrival. During the first months at Nokia you will have support from your manager, team and dedicated buddy and the opportunity to g
nd others. The position will involve taking these skills and applying them to some of the most exciting and massive data and analytics p
y degree in Electronics and Telecommunications or Computer Science;
team, responsible for developing software functionalities for the Nokia 5G base stations, ready to be delivered to customers all over t
unctionalities in our base station product and be involved in the complete development cycle (specifications, coding, testing).
+) among 11 R&D Tribes in different countries& continents. Within your Tribe in Timisoara, you will belong to one of the Squads, each
in 5G market, our team has full responsibility for 5G Cloud Control Unit in Timisoara (Requirements, Architecture and Design) and suc
rrival. During the first months at Nokia you will have support from your manager, team and dedicated buddy and the opportunity to g
as well as RF measurements with evaluation of new tools and methodologies targeting to design and verification flow optimization. Th
ences on RF engineering but also requires willingness and capability to learn new.
erts with focus on medium-term to long-term issues. The Lab will work closely with an open innovative ecosystem with Huawei Europe
ly with suppliers and project teams which you may support in taking new tools and methods into use after evaluation.
verification to simulations
s in a multicultural environment
develop innovative services and create original multiplatform media content – we’re revolutionizing how Canadians communicate.
nd join a community that values bold ideas, professional growth and employee wellness, we want you on the Bell team.
creating the ultimate service experience for our residential, wireless and small business consumers. We lead strategic development an
e management and optimization of BI systems used to analyze customer behavior, automate business insight processes target marketi
e it. We’re committed to fostering an inclusive, equitable, and accessible workplace where every team member feels valued, respected
ates taking part in all aspects of the selection process. For a confidential inquiry, simply email your recruiter directly or recruitment@b
roughout development of our products and collaborate with multiple teams across the organization. This person will greatly influence
in point of data entry for accountants, bookkeepers and businesses, so they can spend time on the things that really matter to their bu
i Consumer BG is dedicated to delivering the latest technologies to consumers and sharing the happiness of technological advances w
nsure high quality training data and pipelines, enable Woebot to deliver meaningful insights to users at scale, personalize Woebot's con
e customers, colleagues and communities to thrive and to build the National Champion Bank in Ireland.
munication across all touchpoints, Data driven marketing, Customer experience insight & innovation design, Integrated Technology tra
arning and artificial intelligence. To deliver on this vision the Customer Analytics team leverage internal and external data and apply da
endation engines, building and deploying predictive models and the commercial activation of business insights. This role will include lea
Chengdu (CH). PDU Radio Products consists of approximately 3000 R&D professionals and development partners, developing, suppor
here. With an agile way of working we develop 5G, LTE, WCDMA, GSM Network solutions to operators all over the world.
environment including needed SW CM tools. The position also requires that you work pro-actively with aligning our build and delivery e
ges, enabling you to be ‘the person that did that.’ We’ve never had a greater opportunity to inspire change; setting the bar for technol
ges, enabling you to be ‘the person that did that.’ We’ve never had a greater opportunity to inspire change; setting the bar for technol
, system design, development, and verification. As a team, you take responsibility for your feature from analysis to delivery and beyon
ges, enabling you to be ‘the person that did that.’ We’ve never had a greater opportunity to inspire change; setting the bar for technol
ibers can access services from any devices or locations. Ericsson Hungary is a main contributor in the development of the SSR product
nd connected world.
ment to these qualities will always be encouraged, and never go unnoticed. As a team, we are helping to tackle some of society´s most
en Science Park, and there is a wide range of restaurants for your convenience.
ding to a prioritized backlog. Our teams are multi-functional, self-organizing and highly independent. You’ll be working directly with yo
e in test driven development (TDD) and therefor competence in test case development and execution is valuable.
ng relationships with business and co-workers within international environment.
e from a telecom operator, telecom vendor, BSS/OSS vendor, consultancy or other professional services team is a plus.
artered in Stockholm, Sweden, Ericsson is proud of its global presence across 100+ countries and market areas. With a strong focus on
egacy.Good hands-on experience in Configuring ClickSoftware Service Optimization Suite ( Click Schedule, Click Mobile, Click Plan, Click
t Lifecycle Management support and product documentation. Our focus is on Lean and Agile ways of working. We prioritize in multi-fu
Chengdu (CH). PDU Radio Products consists of approximately 3000 R&D professionals and development partners, developing, suppor
here. With an agile way of working we develop 5G, LTE, WCDMA, GSM Network solutions to operators all over the world.
environment including needed SW CM tools. The position also requires that you work pro-actively with aligning our build and delivery e
n and we are getting into new markets, expanding existing deployments, enhancing the feature set and transforming our portfolio from
goal is to provide a system that is in the first line of the communication service evolution at the same time as it can be integrated and i
ess transitions between the access types. We believe it offers the best performance for our customers on the market today. We are loo
mitted to diversity and inclusion and to be a responsible and relevant driver of positive change. We also offer some awesome benefits,
membership or genetics.
csson is this place, and we have an exciting opportunity for you in a fast-paced, highly collaborative technical environment.
and HW) through product testing, as well as with the Ericsson product acceptance process for our large customers in North America. Y
ers by accelerating current Ericsson offerings. Your contribution will also help to create new offerings in the areas of MI driven 4G and
cceed you must appreciate that everything matters; every feature, every team member, and every user. We rely on innovative thinking
mance, fault tolerant capabilities. You will apply analytics principles to massive data sets, to unearth crucial insights into the impacts o
such as: requirement analysis, system design, architecture design, hardware design, software design, integration, verification, simulati
nologies in compliance with industry best practices
obile operators. Your role would primarily involve leading software development team and code development to support the radio per
our product ingests all of an operator’s network data, across multiple domains and physical and virtual infrastructure. Harnessing the p
tegy and focus. We bridge the gap between business requirements and technology in both directions, constructing solutions to meet
and Machine Learning techniques into the operation of their networks. To this day much network operation is a manual affair, execut
y. Ericsson works with mobile vendors across the world, large and small. Our data scientists have access to rich data sets with years o
nd algorithm tuning. The successful candidate will connect to business stakeholders, local development teams and the broader Ericsso
ent skills, capable of identifying and solving complex problems requiring in-depth analysis by creation, adaptation or use of appropriate
will need to comply with Ericsson's requirements for time, performance and quality, as set out in customer contracts.
organization and work supervision, without forgetting to demonstrate an excellent level of communication with the team, the site ma
The position is applicable for design & optimization of RAN, BBA, Transmission, Core, OM and Services Networks. You will be engaged i
y, you should be able to keep time, performance and quality according to Ericsson requirements and customer contracts.
e technical organization and oversight of work and demonstrated excellence in communications with team, project manager, and cust
ntation, marital status, pregnancy, parental status, national origin, ethnic background, age, disability, political opinion, social status, ve
mpany has facilities. In addition, Ericsson supports the UN Guiding Principles for Business and Human Rights and the United Nations Gl
national origin, ethnic background, age, disability, political opinion, social status, veteran status, union membership or genetic informati
hes, in order to optimize the performance of mobile access networks.
es in 5G networks.
ling, traffic load balancing, QoS / QoE management and contextualized data mining are some practical examples of optimization.
s for 5G networks.
ement, supervised, unsupervised or hybrid). Knowledge, of optimization techniques such as Transfer Learning / Federated Learning an
s. This validation will be performed on a software platform embedding the usual libraries in machine learning (e.g. Keras, PyTorch) and
e communications service providers, governments, large businesses and end users with the most comprehensive portfolio of products
etwork compression.
anomaly detection)
eam of technical communicators who create customer documentation solutions for Nokia Software products.
or ethnic origin, religion, gender, sexual orientation, gender identity or expression, age, marital status, disability, protected veteran st
keholders, you will propose solutions to lead the implementations of new features.
et fully functional systems in time. 5G Test libraries created during the process by team are
o build the next generation of data tools to enable us to take full advantage of this data. In this role, you will learn and work with the c
developing APIs, developing machine learning models, creating advanced data visualizations.
is uniquely positioned to help communication service providers, governments, and large enterprises deliver on the promise of 5G, the
m the way people and things communicate and connect.
3G renewal today with future evolution to 4G LTE and 5G technology on the same platform. The Cloud Mobility Manager (CMM) produ
chnology (with focus Artificial Intelligence/Machine Learning),
cript, Ajax, JQuery), SQL (mySQL) - or other database exposure. Experience with projects in virtualization/VM/public cloud and containe
ge, marital status, disability, protected veteran status or other characteristics protected by law.
in, religion, sexual orientation, gender identity, status as a veteran, and basis of disability or any other federal, state or local protected
ge, marital status, disability, protected veteran status or other characteristics protected by law.
art of a scrum team responsible for implementation of 4G and 5G functionalities in next generation base station.
ules. You will design and implement features and functionality for the products focusing on all technologies including 5G NR / mMIMO
ngineering problems. Participation in functional specifications reviews to identify issues. Close cooperation with other project teams a
order to observe the actual versus expected behavior of the software running on the radio.
ardware functionalities for the Nokia 4G/5G products, ready to be delivered to customers all over the world.
y and the opportunity to get in touch with 5G technology and our development environment.
assive data and analytics problems across multiple industries. We are looking for machine learning engineers to join the ML effort for o
, coding, testing).
o one of the Squads, each composed of 5-9 people. Your team has objectives to deliver in a sprint (1 month). In an environment where
cture and Design) and successfully implemented a development environment fully deployed in Cloud.
y and the opportunity to get in touch with 5G technology and our development environment.
ation flow optimization. This involves product and RF component level simulation tools and methodologies optimization. RF system sim
ystem with Huawei European customers to address real-world issues. The Lab will also engage with key European universities to build
nadians communicate.
e Bell team.
d strategic development and execution of day-to-day operations, develop tools and processes to drive service enhancements, manage
t processes target marketing & contact strategy opportunities and provide insight to drive optimal business decisions.
ber feels valued, respected, and supported, and has the opportunity to reach their full potential. We welcome and encourage applicati
directly or recruitment@bell.ca to make arrangements. If you have questions regarding accessible employment at Bell please email ou
rson will greatly influence the quality levels of our products and have an important role within the operation.
at really matter to their business.
g and developing scalable AI algorithms, the team will have an access to large real datasets collected from Huawei devices across Europ
technological advances with more people around the world. Walk the walk and make dreams come true.
, personalize Woebot's content and relationship to each user, and execute experiments that allow Woebot to deliver the right method
Integrated Technology transformation, Robust Marketing spend effectiveness & governance.
external data and apply data analytics, machine learning and visualisation techniques to maximise the value of this data and deliver po
ts. This role will include leading data science projects from a technical point of view, as well as providing technical leadership to data sc
rtners, developing, supporting, and, maintaining Ericsson products.
er the world.
ng our build and delivery environment and tools with the DURA functional framework requirements for our common CI-machinery. Si
setting the bar for technology to be inclusive and accessible; empowering an intelligent, sustainable, and connected world.
setting the bar for technology to be inclusive and accessible; empowering an intelligent, sustainable, and connected world.
pment of the SSR product in direct partnership with other Ericsson sites. We are looking for Senior Developer our Evolved Packet Gate
kle some of society´s most complicated challenges, enabling you to be ‘the person that did that’.
e working directly with your Product Owner and partners in a collaborative manner. This requires maturity and a ‘team first’ approach
m is a plus.
as. With a strong focus on innovation, we possess 49 thousand registered patents and a global strength of over 95 thousand competen
ng our build and delivery environment and tools with the DURA functional framework requirements for our common CI-machinery. Si
sforming our portfolio from virtualized products to cloud native products.
s it can be integrated and interoperable with legacy technologies and the different regulatory requirements of the global market.
e market today. We are looking for intelligent, creative and self-motivated software developer who are passionate about design code o
r some awesome benefits, amazing career development and training programs to provide an empowered career in a connected world
l environment.
omers in North America. You will be exposed to the latest radio products that will be rolled out in our biggest customers networks. Th
areas of MI driven 4G and 5G network, distributed cloud, IoT and other emerging businesses.
rely on innovative thinking to come from every member of the team – we know brilliant ideas can come from anyone. If you can envis
nsights into the impacts of alarms to services and reduce the cost to fix alarms for our customers.
ation, verification, simulations, tools design, Product Lifecycle Management support and product documentation. Our focus is on Lean
nt to support the radio performance verification of new radio products. Testing is executed in a high-tech lab environment using both a
tructure. Harnessing the power of big data analytics, CENX visualizes network and service topology, inventory, fault, and performance
tructing solutions to meet concrete needs, and proposing opportunities made possible by new technologies and techniques.
n is a manual affair, executed by network engineers using heuristics and accepted practices. However, the deployment of 5G networks
rich data sets with years of history that document a deep variety of facts about network operation across the full stack, from radio phy
ms and the broader Ericsson Machine Learning community to innovate and execute concrete solutions, cross pollinate knowledge, and
ation or use of appropriate procedures, techniques and methods. The candidate must be motivated, have an excellent capacity for lea
be involved in the whole process, from the pre-sale of services and networks to the delivery and acceptance of services.
mer contracts.
g / Federated Learning and of ways to implement them considering the level of complexity they introduce, is required as well.
lity Manager (CMM) product combines the MME (4G LTE) and SGSN (2G/3G) functions, paving the way for AMF (5G) functionality.
/public cloud and containers.
al, state or local protected class.
including 5G NR / mMIMO / 4G LTE.
with other project teams and stakeholders from other foreign locations around the world. Due to the close interaction between the e
s to join the ML effort for our teams, building ML-based systems, tools, and services that serve as infrastructure for our internal and ex
. In an environment where trust and autonomy are encouraged, each team member selects the tasks to work on and exchanges daily
ptimization. RF system simulations and component/sub-module models development are part of job role. All these help RF projects to
opean universities to build a basic research capability to support Huawei technical projects.
e enhancements, manage customer loyalty and retention, and leverage big data and artificial intelligence to create intellectual propert
e and encourage applications from people with disabilities.
ent at Bell please email our Diversity & Inclusion Team at inclusion@bell.ca.
uawei devices across Europe and will have access to Big Data processing infrastructure as needed (e.g., GPUs, Spark Clusters).
o deliver the right method to the right person at the right time.
of this data and deliver positive outcomes for our customers. The complex challenges the team faces means there is considerable oppo
ation. Our focus is on Lean and Agile ways of working. We organize in cross functional development teams in which continuous improv
environment using both automated and manual methods.
y, fault, and performance in a single pane, in real time. We enable the world's largest and most innovative service providers to scale th
and techniques.
eployment of 5G networks will explode the scale and complexity of wireless networks well past the ability of this paradigm to cope.
e full stack, from radio physics to video streaming quality. In the CENX group we particularly look for opportunities that cross those bo
pollinate knowledge, and educate colleagues about our ML efforts.
n excellent capacity for learning, be independent and have the desire to continuously improve.
e of services.
nd acceptance.
s required as well.
5G, from the Internet of Things, to emerging applications in the fields of virtual reality and digital health, we are shaping the future of
MF (5G) functionality.
interaction between the embedded software and the radio hardware, the successful candidate must be able to understand specificati
ure for our internal and external clients.
rk on and exchanges daily with his/her team mates on the progress and difficulties.
l these help RF projects to develop higher quality products in faster manner and evaluating RF performance even before actual HW is
create intellectual property.
s, Spark Clusters).
there is considerable opportunity for team members to grow and develop both their technical and non-technical skills to achieve thei
xternal Radio SW parties trying to align many different requirements on our build and delivery environment.
y of 40 percent of the world’s mobile traffic, thereby connecting more than 2.5 billion subscribers and counting. We are a world leader
xternal Radio SW parties trying to align many different requirements on our build and delivery environment.
g environment where you can growth together other members in inspired cross-functional team.
orrect as many faults as possible, prior to rolling out the product network wide. As part of your work, you will be interacting with vario
n we want to talk to you.
which continuous improvement, innovation and knowledge sharing is part of the daily work.
ervice providers to scale their operations as the network scales.
K-Mean Clustering
Elbow
Cluster
Cluster Analysis + Mapping
Cluster Visaulaization
KPIS Supervised Learning (NQI Targets) (ML (Supervised + DQN ) SDCCH
depth study relationship betwe
(ML (Supervised + DQN ) SDCCH Establishment >> NQI Prameter tuning >> DQN
depth study relationship between them
tuning >> DQN
sources
Coursera Udacity Udemy Medium TowardDataScince Analticvhdica
masterymachinelearning Chigao University very important Books University Lecture
Element Toronot >> UoT
BritishColumbia
Waterloo
cornell
CMU
Packt oreilly
Groups:
Group1: Basics
Math + Prob + Stat + Convex + Discerete + Linear Algebra
PGM ??
Group2: ML + DL + RL
ML
Deep Learning
NLP
Reinforcement Learning
Experience building end-to-end pipeline and deploying machine learning models
Time Series
TensorFlow
Keras
Pytorch
(Mlflow (An open source platform for the
Kubeflow,(The Machine Learning Toolkit for Kubernetes)
lifecycle) , Neptune Experience with ML collaborative platforms/pipelinesmachine learning,
Notebook experience (Jupyter, Zeppelin, Databricks, etc.)
Experience with ML collaborative platforms/pipelines (Mlflow (An open source platform for the machine learning
Group3:
Software Developer: DS + Design pattern + refactoring + DDD + Application Performance and Memory Management
Fluent Python >> High High Level programming
OOP+Solid+Pattern + DS + Algorithms
BlockChain
Java > Udacity >> Coursera OOP+Solid+Pattern
android + Ios
C++ >> Udacity
APIs
REST APIs
Ideal Candidates Will Also Have
Proficiency in Python's asyncio and aio
TDD, DDD and refactoring skills
Experience in developing and supporting frameworks
Hands-on experience with Docker
Web full stack experience
Familiarity with Jenkins
https://www.coursera.org/specializations/advanced-app-android
Imperial College London
Group4:
Cloud Computing : GCP + AWS + Azure
Linux command line and shell scripting.
Group5:
data mining language (e.g., R, SAS, SPSS),
Group6:
Big Data:
Hadoop, Spark, PySpark or SparkR,MapReduce,MLib
Databases:Postgres, Mongo, SQL, NoSQL
Casandra , ElasticSearch,CouchDB
cloud data warehouses - ie: Snowflake, Azure Data Warehouse, etc.,
a variety of databases - ie: SQL, PostgreSQL, Azure SQL, Oracle,
data automation and ETL tools ie: WhereScape, SSIS, Informatica
analytical tools - ie: SSRS, Cognos, PowerBI, Tableau,
Google BigQuery
AWS ecosystem such as ElasticSearch, S3, and DynamoDB
multiple relational databases and data warehouses (Redshift, Snowflake, Postgres, MySQL, etc…)
Understanding of GCP/Azure Data Engineering Stack, Big Data Tools and Technologies ( HDFS, HBase, Spark, Kafka)
Frameworks: Spark, Airflow, DataBricks, ONNX, Kafka, Netty
Databases: MySQL, Snowflake, S3/Parquet
Data mining experience working with Relational, NoSQL and Graph databases
Knowledge of RPA automation tools such as UIPath or Blue Prism
Splunk
Experience with NoSQL databases, such as MongoDB, Cassandra, HBase
Some of the popular DBMS include: MySQL, SQL Server, Oracle, IBM DB2, PostgreSQL and NoSQL databases (MongoDB, Couch
Group7:
Agile Scrum and/or kanban method
Agile software development and tools ie: JIRA, Azure DevOps,
Group8:
visualization tools (e.g., Power BI, Tableau, Shiny)
data visualization tools (Spotfire, Tableau, Qlik)
Experience in Tableau, Apache SuperSet, Looker or similar BI tools.
Experience in BI platform and data visualization tools (QlikView, Microstrategy, Tableau, Power BI, etc.)
Tableau, Power BI
Group9:
Docker, Kubernetes
Knowledge of management and productivity environments and tools (e.g., Jira, Git, Bitbucket, Jenkins)
Git, GitHub, and Zenhub
Knowledge of CI tools and processes (Jenkins, TeamCity, Bitbucket pipelines, etc);
Experience with container-type environment: Docker, Kubernetes, Openshift, PCF
Experience with Version Controlled data pipelines (Pachyderm)
Group10:
Software Engineering : Hammad
Group11:
BlockChain
3G+4G+5G Planning & Optimization
Group12:
Business Finance
HackerRank Question ( interveiw)
Group13:
Optimization
PGM
Bayesian
Gaussian
Group14:data-science-interview-questions-and-answers/
https://data-flair.training/blogs/data-science-interview-questions-and-answers/
Bayesian Russia + Duke Bayesian Statistics very very important + University of California, Santa Cruz (mcmc-bayesian-statistic
Month-4
Time-ser udemy + Coursera 15-April
Err:509
John Hokins
Washington
Time series udemy + practical-time-series-analysis Feature Engineer + Feature Selection + Projects
ensebmle project packt
applied AI
https://courses.analyticsvidhya.com/courses/applied-machine-learning-beginner-to-professional/?utm_source=blog&utm_me
Projects >>Hisham >> Udacity + Udemyy
Coursera
https://dzone.com/articles/5-best-reinforcement-learning-courses
Practical Reinforcement Learning (Russia Coursera)
Reinforcement Learning in Finance + advance (NYU ) Coursera
https://www.cs.cmu.edu/~epxing/Class/10708-20/lectures.html
https://www.cs.cmu.edu/~epxing/Class/10708-19/lectures/
https://www.youtube.com/playlist?list=PLoZgVqqHOumTqxIhcdcpOAJOOimrRCGZn
Apart from the MOOC by Daphne Koller as mentioned by Shimaa, you can look at the following courses on PGMs:
1. Machine Learning and Probabilistic Graphical Models by Sargur Srihari from University at Buffalo. You can find the video lec
2. Probabilistic Graphical Models by Andreas Krause from Caltech. You can find the slides at this link: http://courses.cms.caltec
3. Probabilistic Graphical Models by Eric Xing from CMU. Slides at this link: http://www.cs.cmu.edu/~epxing/Cl...
4. Probabilistic Graphical Models by David Sontag from NYU. Slides at this link: http://cs.nyu.edu/~dsontag/cours...
Projects
Groups:
Group1: Basics
Math + Prob + Stat + Convex + Discerete + Linear Algebra
PGM ??
Group2: ML + DL + RL
ML
Deep Learning
NLP
Reinforcement Learning
Experience building end-to-end pipeline and deploying machine learning models
Time Series
TensorFlow
Keras
Pytorch
Experience with ML collaborative platforms/pipelines (MLflow, Neptune, Kubeflow, etc.)
Notebook experience (Jupyter, Zeppelin, Databricks, etc.)
Group3:
Software Developer: DS + Design pattern + refactoring + DDD + Application Performance and Memory Management
Fluent Python >> High High Level programming
OOP+Solid+Pattern + DS + Algorithms
BlockChain
Java > Udacity >> Coursera OOP+Solid+Pattern
android + Ios
C++ >> Udacity
APIs
REST APIs
Ideal Candidates Will Also Have
Proficiency in Python's asyncio and aio
TDD, DDD and refactoring skills
Experience in developing and supporting frameworks
Hands-on experience with Docker
Web full stack experience
Familiarity with Jenkins
https://www.coursera.org/specializations/advanced-app-android
Imperial College London
Group4:
Cloud Computing : GCP + AWS + Azure
Linux command line and shell scripting.
deployment
Group5:
data mining language (e.g., R, SAS, SPSS),
Group6:
Big Data:
10+11+12
Garage>>Data Eng (Udacity >> Udemy >> Coursera
Hadoop, Spark, PySpark or SparkR,MapReduce,MLib
Databases:Postgres, Mongo, SQL, NoSQL
Casandra , ElasticSearch,CouchDB
cloud data warehouses - ie: Snowflake, Azure Data Warehouse, etc.,
a variety of databases - ie: SQL, PostgreSQL, Azure SQL, Oracle,
data automation and ETL tools ie: WhereScape, SSIS, Informatica
analytical tools - ie: SSRS, Cognos, PowerBI, Tableau,
Google BigQuery
AWS ecosystem such as ElasticSearch, S3, and DynamoDB
multiple relational databases and data warehouses (Redshift, Snowflake, Postgres, MySQL, etc…)
Understanding of GCP/Azure Data Engineering Stack, Big Data Tools and Technologies ( HDFS, HBase, Spark, Kafka)
Frameworks: Spark, Airflow, DataBricks, ONNX, Kafka, Netty
Databases: MySQL, Snowflake, S3/Parquet
Data mining experience working with Relational, NoSQL and Graph databases
Knowledge of RPA automation tools such as UIPath or Blue Prism
Splunk
Experience with NoSQL databases, such as MongoDB, Cassandra, HBase
Some of the popular DBMS include: MySQL, SQL Server, Oracle, IBM DB2, PostgreSQL and NoSQL databases (MongoDB, Couch
Group7:
Agile Scrum and/or kanban method
Agile software development and tools ie: JIRA, Azure DevOps,
Group8:
visualization tools (e.g., Power BI, Tableau, Shiny)
data visualization tools (Spotfire, Tableau, Qlik)
Experience in Tableau, Apache SuperSet, Looker or similar BI tools.
Experience in BI platform and data visualization tools (QlikView, Microstrategy, Tableau, Power BI, etc.)
Tableau, Power BI
Group9:
Docker, Kubernetes
Knowledge of management and productivity environments and tools (e.g., Jira, Git, Bitbucket, Jenkins)
Git, GitHub, and Zenhub
Knowledge of CI tools and processes (Jenkins, TeamCity, Bitbucket pipelines, etc);
Experience with container-type environment: Docker, Kubernetes, Openshift, PCF
Experience with Version Controlled data pipelines (Pachyderm)
Group10:
Software Engineering : Hammad
Group11:
BlockChain
3G+4G+5G Planning & Optimization
Group12:
Business Finance
HackerRank Question ( interveiw)
LeetCode
System Desgin
Group14:data-science-interview-questions-and-answers/
https://data-flair.training/blogs/data-science-interview-questions-and-answers/
Group15: IELTS
Group16:application
Optimization
Trading AI
udacity ML
Coursera Bayesian
Coursera NLP + Slideshare
[Master ML >> ML >> ML 3 best Course]
appliedAI Courses
datacamp
udemy A-Z >> R
>>Deep LEarning
Coursera+Hisham >> Tensorflow in parctice
#NAME?
Torrent:
University of Michigan
mory Management
ase, Spark, Kafka)
?utm_source=blog&utm_medium=comprehensive-guide-k-means-clustering
urses on PGMs:
o. You can find the video lectures and slides at this link: http://www.cedar.buffalo.edu/~sr...
nk: http://courses.cms.caltech.edu/c...
u/~epxing/Cl...
~dsontag/cours...
mory Management
ase, Spark, Kafka)
Groups:
Group1: Basics
Math + Prob + Stat + Convex + Discerete + Linear Algebra
PGM ??
Group2: ML + DL + RL
ML
Deep Learning
NLP
Reinforcement Learning
Experience building end-to-end pipeline and deploying machine learning models
Time Series
TensorFlow
Keras
Pytorch
Experience with ML collaborative platforms/pipelines (MLflow, Neptune, Kubeflow, etc.)
Notebook experience (Jupyter, Zeppelin, Databricks, etc.)
Group3:
Software Developer: DS + Design pattern + refactoring + DDD + Application Performance and Memory Management
Fluent Python >> High High Level programming
OOP+Solid+Pattern + DS + Algorithms
BlockChain
Java > Udacity >> Coursera OOP+Solid+Pattern
android + Ios
C++ >> Udacity
APIs
REST APIs
Ideal Candidates Will Also Have
Proficiency in Python's asyncio and aio
TDD, DDD and refactoring skills
Experience in developing and supporting frameworks
Hands-on experience with Docker
Web full stack experience
Familiarity with Jenkins
https://www.coursera.org/specializations/advanced-app-android
Imperial College London
Group4:
Cloud Computing : GCP + AWS + Azure
Linux command line and shell scripting.
deployment
Group5:
data mining language (e.g., R, SAS, SPSS),
Group6:
Big Data:
10+11+12
Garage>>Data Eng (Udacity >> Udemy >> Coursera
Hadoop, Spark, PySpark or SparkR,MapReduce,MLib
Databases:Postgres, Mongo, SQL, NoSQL
Casandra , ElasticSearch,CouchDB
cloud data warehouses - ie: Snowflake, Azure Data Warehouse, etc.,
a variety of databases - ie: SQL, PostgreSQL, Azure SQL, Oracle,
data automation and ETL tools ie: WhereScape, SSIS, Informatica
analytical tools - ie: SSRS, Cognos, PowerBI, Tableau,
Google BigQuery
AWS ecosystem such as ElasticSearch, S3, and DynamoDB
multiple relational databases and data warehouses (Redshift, Snowflake, Postgres, MySQL, etc…)
Understanding of GCP/Azure Data Engineering Stack, Big Data Tools and Technologies ( HDFS, HBase, Spark, Kafka)
Frameworks: Spark, Airflow, DataBricks, ONNX, Kafka, Netty
Databases: MySQL, Snowflake, S3/Parquet
Data mining experience working with Relational, NoSQL and Graph databases
Knowledge of RPA automation tools such as UIPath or Blue Prism
Splunk
Experience with NoSQL databases, such as MongoDB, Cassandra, HBase
Some of the popular DBMS include: MySQL, SQL Server, Oracle, IBM DB2, PostgreSQL and NoSQL databases (MongoDB, Couch
Group7:
Agile Scrum and/or kanban method
Agile software development and tools ie: JIRA, Azure DevOps,
Group8:
visualization tools (e.g., Power BI, Tableau, Shiny)
data visualization tools (Spotfire, Tableau, Qlik)
Experience in Tableau, Apache SuperSet, Looker or similar BI tools.
Experience in BI platform and data visualization tools (QlikView, Microstrategy, Tableau, Power BI, etc.)
Tableau, Power BI
Group9:
Docker, Kubernetes
Knowledge of management and productivity environments and tools (e.g., Jira, Git, Bitbucket, Jenkins)
Git, GitHub, and Zenhub
Knowledge of CI tools and processes (Jenkins, TeamCity, Bitbucket pipelines, etc);
Experience with container-type environment: Docker, Kubernetes, Openshift, PCF
Experience with Version Controlled data pipelines (Pachyderm)
Group10:
Software Engineering : Hammad
Group11:
BlockChain
3G+4G+5G Planning & Optimization
Group12:
Business Finance
HackerRank Question ( interveiw)
LeetCode
System Desgin
Group14:data-science-interview-questions-and-answers/
https://data-flair.training/blogs/data-science-interview-questions-and-answers/
Group15: IELTS
Group16:application
Optimization
Trading AI
udacity ML
Coursera Bayesian
Coursera NLP + Slideshare
[Master ML >> ML >> ML 3 best Course]
appliedAI Courses
datacamp
udemy A-Z >> R
Time Series
DS Coursera >> DS Udacity + Discrete Math
Udacity > intro Tensorflow > intro to pytorch
Udacity >> Reinfocement Learning in depth
Err:509
>>python fluent >> OOP >> Design Pattern
>>Deep LEarning
Coursera+Hisham >> Tensorflow in parctice
#NAME?
Torrent:
University of Michigan
Rogers DAMT (Data Analytics and Marketing Tech) is looking for a Lead Data Scienti
Data Scientist will be expected to lead, define, and execute challenging projects tha
Data Scientists will be primarily responsible interpret and solve deep business prob
Manage customer churn
Acquire and retain customers
Build Likelihood To Recommend (LTR/NPS) clusters or models,
Increase revenue through cross sell and up sell products/services
Increase campaign effectiveness
Skills/Qualifications
Bachelor or Master s degree in a technology/analytical field such as Computer Scie
A minimum of five years of relevant professional experience in an Agile environme
Deep expertise in developing and maintaining the predictive models using advance
Excellent programming knowledge in any of the languages (SAS , Phyton , R, Spark/
Expertise in data extraction, manipulation, feature engineering and validations
Experience in building Deep neural networks (MLP, CNN, RNN) and use of AI/Deep
Experience in working with any of industry standard analytical/ML mod
Experience in managing and leading complex projects and/or multiple projects sim
Strong organizational skills and the ability to manage multiple tasks simultaneously
Exceptional interpersonal, persuasion, and communication skills in order to commu
Exposure to Reinforcement learning (DynaQ/Q+, SARSA, TD, Monte Ca
As a Full-Stack Data Scientist, you will analyze, design, and implement AIOps solutio
What will you do?
Work on challenging and research-based initiatives using advanced machine learni
Provide analytics support to all TI pillars. This involves collaborating proactively wit
Prepare and integrate large and various types of data (structured/non-structured)
Implement machine learning models, data mining methods, and statistical analysis
Leverage visualization tools/packages to create powerful representations of results
Produce data-driven insights to help in informed decisions and actions by telling a c
Effectively communicate findings to business partners and executives
Collaborate with the development team to deploy production-scale solutions
Quickly learn new tools and technologies and use them in the daily analytics exerci
emory Management
What do you need to succeed?
Must-have
Bachelor, Masters or PhD. in Computer Science, Statistics, or relevant fields.
Expert in Python programming to write production-ready codes
Strong data profiling, cleaning, mining and technical documentation skills
2+ years of experience in building machine learning models (Supervised/Unsupervi
2+ year(s) experience with NLP and text analytics methods and packages
Experience building end-to-end pipeline and deploying machine learnin
Experience with big data technologies - parallel processing techniques and Apache
Experience with container-type environment: Docker, Kubernetes, Openshift, PCF
Experience with custom Web interfaces, API calls, and systems integration
Familiar with Linux environment, shell scripting, and Git
Experience working in an agile environment
Nice-to-have
Software engineering background with a focus on statistics and/or analytics
Experience in deep learning methods for NLP in Tensorflow or Pytorch
Familiar with Technology Infrastructure
With a strong history of innovation for over 100+ years, we are Canada’s largest div
Being part of the RACE21™ team, you’ll be leading the charge of a company-wide r
With the financial backing and commitment from our leadership, we envision a full
Renew , Automate , and Connect material, processes, equipment flows, and data s
Empower our employees by investing in digital skills and capabilities to enhance cre
We are world-class leaders in sustainability and safety – and we are building a bett
Reporting to the Manager of Technology, the Data Scientist, RACE21 brings deep u
Responsibilities
Be a courageous safety leader, adhere to and sponsor safety and environmental ru
Designs, develops, and implements end-to-end cloud based machine le
Ensures that data pipelines are scalable, repeatable, and secure, and can serve mu
Enables big data and batch/real-time analytical solutions that leverage emerging te
Collects, parses, manages, analyzes and visualizes large sets of data using multiple
Translates complex functional and technical requirements into detailed architectur
Codes, tests, and documents new or modified data systems to create robust and sc
Implements security and recovery tools and techniques as required
Works with fellow Data Scientists and Engineers to ensure that all data solutions ar
Ensures all automated processes preserve data by managing the alignment of data
Develops standards and processes for integration projects and initiatives
Base, Spark, Kafka) Qualifications
Master’s or PhD Degree in Information Technology, Computer Science, or a related
Minimum five years of experience in data science
Understanding of high performance algorithms and R statistical software
Experience in industry data science (e.g., machine learning, predictive maintenance
Capability to architect highly scalable distributed systems, using different tools
Demonstrated experience with object oriented design, coding and testing patterns
L databases (MongoDB, CouchDB, DynamoDB, HBase, Ne
Expert knowledge of data modeling and understanding of different data structures
Strong understanding of Agile methodologies
Experience as a Data Scientist on an agile team or other rapid development metho
Excellent problem solving, critical thinking, and communication skills
Experience in developing presentations and communications to be shared with inte
Brings a high energy and passionate outlook to the job and can influence those aro
Able to build a sense of trust and rapport that creates a comfortable & effective wo
Passion for innovation and “can do” attitude
Ability to travel to our sites in British Columbia 20-40% of the time
Responsibilities
In this role, you would report to the VP Engineering. Responsibilities will include:
Conceiving and developing data driven solutions for clients
Creating and presenting results and findings to clients
Collaborating with developer resources to build data pipelines and operationalize s
Data cleaning & reshaping
Qualifications
Masters or Ph.D in Econometrics, Financial Economics, Mathematics, Statistics, Phy
2-5 years of experience in a Data Science role
Experience in one or more programming languages (Preferably Python)
Experience with one or more data science frameworks (e.g. Numpy, Scikit-learn, Sc
Strong understanding of data and business
Ability to work autonomously and in a team
Working in a fast-paced environment
Ability to solve complex problems
Strong communication skills
Nice To Haves
Past experience with one or more business intelligence/analytics platfo
Analytical experience related to the food and beverage industry
Past experience working with time series data
Skills
Jupyter
Forecasting
Python
Statistical Modeling
Statistics
Interpreting Data
Problem Solving
Science
Highly Desirable
Experience in the cybersecurity industry
Experience big data analytics databases is a plus
Knowledge of networking concepts, technologies and devices (Firewalls, Routers, S
Knowledge of network and related web protocols (such as TCP/IP, UDP, IPSEC, HTT
Knowledge of SQL
Linux System knowledge as user and administrator
Background in statistics
Qualifications
Advanced degree in Statistics, Computer science, Behavioural Science or Mathema
4+ years of experience analyzing product and/or SaaS data, building ML and/or NLP
Demonstrated experience in leveraging data for actionable insights using different
Demonstrated experience working with both structured and unstructured data and
Have software engineering skills (scripting is important for quick prototyping, but k
Experience working with Google Cloud Platform
Experience building recommender systems and preferably NLP applications
Have deployed models to production with an engineering team
Excellent communication – written, conversational, presentation, and data-visualiz
Experience with both software engineering (ideally in an agile environment and wit
Experience in an HR space (e.g. People Analytics) or knowledge of Organizational P
Data Scientist
Job Description
As a Data Scientist, you are an experienced machine learning practitioner and pyth
Signal processing
Geo-spatial data processing
Natural Language processing
Social network analysis
Recommender system
Adaptive experimentation techniques
You will be part of our data science team and work in a dynamic and multi-cultural
Depending on your level of experience, you will lead your own projects and guide o
Job Description
You will help the team to improve upon current methods and models. With a practi
Your software development experience will allow you to closely collaborate with o
You will work in our headquarters in Antwerp (Belgium) or in Toronto (Canada)
Requirements
Job Description:
Working closely together with our machine learning experts, you will design, imple
Your analytical skills and your profound interest in machine learning will push you t
Your software development knowledge will allow you to work closely with the engi
Requirements
Desired Skills and Expertise:
You have experience in Python and its data analysis and machine learning stack
You have professional or academic experience in machine learning and data model
You can work independently and take matters into your own hands.
The ability to quickly learn new technologies and successfully implement them is es
Signal processing
Recommender system
Statistical analysis
Causal inference
Data Engineer
Job description
We are looking for the most ambitious and curious engineers in the field. You have
Requirements
You have an academic degree (BSc or MSc) in computer science or related field.
Experience programming in Java and Python.
Practical understanding of software engineering best practices.
Know your way around the Linux operating system.
Work experience with Docker containers.
Work experience on distributed computation frameworks (Kafka, Spark, …) and No
You are fluent in English.
You can work independently and take matters into your own hands.
The ability to quickly learn new technologies and successfully implement them is es
Bonus points
You have mastered the SciPy stack (Python, pandas, numpy, scikit-learn).
You know your way around source control and reproducibility tooling (git, conda, p
Experience with Deep Learning frameworks (TensorFlow, PyTorch), an
You will work with our team of engineers to design, implement and deploy end-to-
Your qualifications
You are skilled in Software Engineering with Python and have a strong grasp on sou
You know your way around web development frameworks (e.g. flask, d
You have a good overview of platforms and services that support production system
You master software design patterns and architecture design.
Experience with parallel computing libraries (dask, numba) and the SciPy stack (pan
Qualifications
Your Role Is To
Crack our business problems and come up with deployable machine learning mode
Interface with business to make sure they are asking the right questions
Sift through our data and find us some gems
Be a world-class hands-on deploy master
Charm Us With
An earned stripes in coding, in data handling, in statistics and in machine learning (
A good balance between theory and practice and a strong desire to learn and keep
Hands-on experiences and good understanding of various machine learning algorit
A good knowledge of what’s “Under the hood” of statistical methods
Coding, coding, coding (Python, Java, C++, Scala, R, …)
Some extra points on: SQL, NLP, image processing, recommendation system, busin
you should possess the creativity to invent and customize when necessary.
Analyze results and trends in order to assist in making appropriate recommendatio
Responsible for the development and insights of weekly, monthly, quarterly & ann
Collect performance data from internal and external data sources
Manage timeline for all digital reporting
Provide planning teams with key research and insights that help steer quality digita
Actively engage with industry partners to build relationships and grow understandi
Help to build key presentations to Client, Agency and Industry
Work with Mindshare Analytics team to evolve digital measurement process for bo
Qualifications
University degree in Information Systems, Statistics, Commerce or similar field or e
2+ years of relevant work experience
Strong attention to detail
Excellent analytical and problem solving skills
Ability to clean and dive into messy data to find insights
Intermediate knowledge of MS Office product suite
Intermediate to advanced experience with relational databases (i.e. SQL, mySQL)
Experience with media buying platforms, online advertising, SSPs, or DSPs is a plus
Intermediate experience with data visualization software (Tableau, Dato
Experience with open source programming languages (R, Python) is a plus
Experience with web analytics software (Google Analytics, Omniture, e
Expedia Career
About you:
You'll dive into groundbreaking machine learning models, experiment and apply ne
You're passionate about asking and answering questions in large datasets, and you
You have a keen desire to tackle problems and live to find patterns and insights wit
You propose analytics strategies and solutions that challenge and expand the think
You're looking for a role with diverse learning opportunities, growing and having fu
Be enthusiastic in collaborating, developing relationships within the company, and
Your experiences:
MSc or PhD degree in machine learning, or computer science/statistics/Physics wit
Knowledge of Python programming language
Good programming practices, ability to write readable, fast, object-oriented code
Expertise in machine learning: framing business problems as machine learning prob
Good understanding of supervised, unsupervised and reinforcement learning (plus
Experience with common data science toolkits, such as Scikit-learn, Spark ML, perfe
Very good understanding of data technologies; Hadoop, Spark, and standard relatio
Experience working in a fast-moving commercial environment and excellent organi
Ability to work collaboratively, as well as to manage workflow in accordance with p
Good communication and team management skills to technical and business audie
Strong business/commercial sense to combine with analytics to help drive recomm
Working knowledge of statistics: hypothesis testing, confidence intervals and A/B t
Responsibilities:
You own pricing and sort from end-to-end, including all aspects of analytics, test de
You develop consumer and pricing insights from sophisticated data-driven analysis
You deploy, test, and manage pricing and sorting models.
You work closely with Product Managers and Engineers to design and execute site-
You provide management with market and/or trend information, needed to make
You lead the development and evolution of analytical models establishing objective
You manage cross-functional ad hoc research requests.
Qualifications:
looking for a Lead Data Scientist role to deliver Machine Learning Acceleration Project as part of Marketing Transformation initiative.
xecute challenging projects that provide the technology and processes to deliver data science models in a fast-paced, agile environment. D
and solve deep business problems by developing predictive models and analysis that make the best use of the wealth of Customer, Netw
ucts/services
al field such as Computer Science, Management of Information Systems, Data Science, Machine Learning, Engineering, or other relevant te
erience in an Agile environment with excellent understanding of the underlying Statistical, Machine Learning theory and Predictive Model
edictive models using advance ML/DL techniques. Proficiency in building Supervised learning models [ (classification and regression) - tree
uages (SAS , Phyton , R, Spark/Scala)
ngineering and validations
NN, RNN) and use of AI/Deep Learning frameworks like MXNet, Caffe 2, Tensorflow, Theano, CNTK, and Keras is an added advantage
andard analytical/ML modern ML platforms like (SAS Viya, Cloudera, Sage Maker Azure ML)
s and/or multiple projects simultaneously, including (but not limited to) building descriptive, predictive and prescriptive solutions and dep
multiple tasks simultaneously
cation skills in order to communicate strategic and technical ideas to internal audiences to both inform and solicit buy-in from the end use
Q+, SARSA, TD, Monte Carlo) is preferred
, and implement AIOps solutions at RBC Technology Infrastructure (TI). Leveraging leading edge technologies and various data sets, you w
sing advanced machine learning methods focusing on tangible outcomes
s collaborating proactively with various business and technical units to identify business opportunities and designing innovative solutions t
a (structured/non-structured)
ethods, and statistical analysis
erful representations of results
sions and actions by telling a convincing story
s and executives
oduction-scale solutions
em in the daily analytics exercises
documentation skills
models (Supervised/Unsupervised)
thods and packages
deploying machine learning models
essing techniques and Apache Spark, Hadoop ecosystem, NoSQL/SQL databases
, Kubernetes, Openshift, PCF
d systems integration
rs, we are Canada’s largest diversified natural resource company looking to embark on our next chapter.
e charge of a company-wide renewal of technology and infrastructure – a high-tech transformation of mining into the next generation.
ientist, RACE21 brings deep understanding of big data to the teams, and helps in building and enabling big data analytics solutions. They a
R statistical software
arning, predictive maintenance) preferred
ems, using different tools
n, coding and testing patterns as well as experience in engineering software platforms and largescale data
ng of different data structures
% of the time
s, Mathematics, Statistics, Physics, Computer Science, Data Science, Engineering or equivalent experience
Preferably Python)
ks (e.g. Numpy, Scikit-learn, Scipy, Pandas)
ntelligence/analytics platforms (Teradata, Microstrategy, Cognos)
vant and approachable manner with both technical and non-technical audiences.
e large amounts of data for analysis.
and frameworks
anguages (such as Java, Javascript, C/C++, Perl, etc.)
ntific inquiry.
scientists on a variety of projects. Use both your IQ and EQ to support yourself, your team, and your company in achieving more inside and
cognition/engagement data relates to customer data and business metrics.
ms to develop Machine Learning and Natural Language Processing algorithms and scripts that will be built into software products.
boundaries on what is known about the “how” and “why” of the ways people work.
to analyse data such as feature usage and adoption and uncover insights to guide future product investments or business directions.
s of your stakeholders; erring on the side of quick communication and deep collaboration; strong sense of integrity and respect toward yo
learning practitioner and python programmer with a background in software engineering and in at least one of the following fields:
n a dynamic and multi-cultural environment, among the brightest minds sourced from 25 different countries
hods and models. With a practical mindset, you will bring these models into a production environment.
u to closely collaborate with our Engineering team to improve our models and push them through our release process.
m) or in Toronto (Canada)
ce or a related field.
ne learning stack
achine learning will push you to continuously educate yourself to become a hands-on data scientist.
u to work closely with the engineering team to deliver robust and scalable solutions while implementing state-of-the-art algorithms and pr
thematical background
ngineers in the field. You have had at least 2 years of work experience and have a passion for building state-of-the-art innovative computin
rn. You’ll be a part of an international team brought together by a culture of technical excellence, grit and integrity. You’ll find our compe
numpy, scikit-learn).
ducibility tooling (git, conda, pip, docker).
TensorFlow, PyTorch), and parallel computing libraries (dask, numba) is a plus.
mplement and deploy end-to-end Machine Learning solutions for ourselves and our clients. You’ll have the opportunity to work on challen
tificial Intelligence.
ter Science or equivalent work experience.
n new things.
nd have a strong grasp on source control and reproducibility tooling (git, conda, pip, docker).
t frameworks (e.g. flask, django).
that support production systems and know their advantages and disadvantages (e.g. aws s3 vs aws MongoDB)
dels, experiment and apply new ones, and apply analytics at scale in order to impact the business. Use the latest cloud and data technolog
ons in large datasets, and you are able to communicate that passion.
o find patterns and insights within structured and unstructured data.
hallenge and expand the thinking of everyone around you.
unities, growing and having fun while at it. You’re expected to stay ahead of the latest data science industry developments and coach the
hips within the company, and finding new business applications of data science and coaching more junior team members.
orts to help improve how we i) collect and influence customer intent, ii) understand the relevant product options and iii) rank those option
ning, data mining and statistical modelling to design and implement mathematical models and algorithms to solve real-world applications
ence efforts – an area of major focus for the company
alysis on the website to find out the effectiveness of our efforts
technologies and techniques and identifying and advising how they can be utilized throughout the range of potential use cases.
such as Keras, Tensorflow or Spark MLlib. Experience with programming in Python or Scala
ding an ability to communicate across business areas
antitative discipline (Computer Science, Machine Learning, Operations Research, Applied Mathematics, Industrial Engineering, Statistics), P
n supply/demand dynamics, price elasticity, customer’s behavior, and competitive strategies is a plus.
on segmentation strategies and predictive modeling.
ands-on coding experience.
ze revenue strategy
ment to efficiently provide effective support to growing insurance portfolio and highly-relevant custom experiences while maximizing reve
duct, marketing, and customer perspective
eliver additional insights
ds in our portfolio
financial, and statistical analysis
ming business needs
ess problems and partner with internal clients using consultative approach
rt raw data into actionable insights
ative fields with background both machine learning (PhD preferred) and software development. 3+ years industry experience and hands o
kills in at least one low level language like C++/Java/C, and scripting languages like Python/R/Scala
st-paced, agile environment. Data Scientist will be required to innovate and establish as a subject matter expert and a thought leader.
he wealth of Customer, Network and Channel information available across Rogers business units, channels and care and marketing platfor
as is an added advantage
prescriptive solutions and deploying these solutions in the cloud and/or on premise systems
s and various data sets, you will apply machine learning and statistical modelling techniques to facilitate informed decision-making and bu
esigning innovative solutions to optimize processes and promote informed decision-making.
ata analytics solutions. They apply complex and most current modelling techniques to existing data sets in order to find optimization and
data generation, feature engineering, model building, and performance evaluation)
omputer systems, and network devices.
y in achieving more inside and outside the office.
o software products.
ts or business directions.
tegrity and respect toward your colleagues (and everyone) that you express with helpfulness – you’re a team-player
deep learning libraries such as PyTorch / Tensorflow / Keras – Bonus if you are proficient in Spark
sis, Hierarchical Bayes; and Learning techniques such as Decision Trees, Boosting, Random Forests, Deep Learning,
atest cloud and data technologies to train and deploy machine learning models at scale. If you can prove your approaches are good, they'll
mizing algorithms
Cloud a plus
tical to be able to keep multiple balls in the air.
technical depth to the audience
of their activities
and care and marketing platforms. Collaborate with business primes from Wireless and Residential business to develop range of predictive
oosting, Decision Trees, Extra Trees, Regularized Greedy Forests), Generalized Linear Models, Discriminant models (LDA, MDA, FDA and QD
ormed decision-making and business process optimization. Moreover, designing and implementing end-to-end machine learning products
rder to find optimization and or improvement opportunities relevant to the context of the product being developed.
m Forests, Deep Learning, Neural Networks
ts debugging and testing procedures, and help optimizing our machine learning models and decision logic.
elieve in great teamwork, you must be eager to learn and bring an energetic and creative approach to work. We are looking for someone l
tials: free coffee, nuts, fruits, a ping pong table in Antwerp, and often home baked goods. Better yet, expect an agile and flat structure, dy
r approaches are good, they'll be quickly deployed to production.
to develop range of predictive models using advance Machine Learning and AI techniques and make business recommendations to
odels (LDA, MDA, FDA and QDA), and Unsupervised learning models (Isolation Forest, Clustering algorithms)
Month-4
Time series
Feature Engineer + Feature Selection
ensebmle project packt
John Hokins
Projects >>
Month-5
PGM Coursera + CMU
http://www.cs.cmu.edu/~epxing/Class/10708-14/lecture.html
https://www.cs.cmu.edu/~epxing/Class/10708-20/lectures.html
https://www.cs.cmu.edu/~epxing/Class/10708-19/lectures/
https://www.youtube.com/playlist?list=PLoZgVqqHOumTqxIhcdcpOAJOOimrRCGZn
Apart from the MOOC by Daphne Koller as mentioned by Shimaa, you can look at the following courses on PGMs:
1. Machine Learning and Probabilistic Graphical Models by Sargur Srihari from University at Buffalo. You can find the video lec
2. Probabilistic Graphical Models by Andreas Krause from Caltech. You can find the slides at this link: http://courses.cms.caltec
3. Probabilistic Graphical Models by Eric Xing from CMU. Slides at this link: http://www.cs.cmu.edu/~epxing/Cl...
4. Probabilistic Graphical Models by David Sontag from NYU. Slides at this link: http://cs.nyu.edu/~dsontag/cours...
Cruz (mcmc-bayesian-statistics) (Bayesian Statistics: From Concept to Data Analysis) + Improving your statistical inferences (Eindhoven un
courses on PGMs:
alo. You can find the video lectures and slides at this link: http://www.cedar.buffalo.edu/~sr...
link: http://courses.cms.caltech.edu/c...
du/~epxing/Cl...
/~dsontag/cours...
tical inferences (Eindhoven unicersity of technology)
time series analysis pennstate college of science
https://online.stat.psu.edu/stat510/lesson/11/11.1
https://medium.com/auquan/time-series-analysis-for-finance-arch-garch-models-822f87f1d755
https://www.twirpx.com/file/2941628/grant/
https://www.kodges.ru/
https://download.csdn.net/download/weixin_39516246/10944826
https://litmy.ru/knigi/programming/
Bayesian Russia + Duke Bayesian Statistics very very important + University of California, Santa Cruz (mcmc-bayesian-statistic
Cruz (mcmc-bayesian-statistics) (Bayesian Statistics: From Concept to Data Analysis) + Improving your statistical inferences (Eindhoven un
tical inferences (Eindhoven unicersity of technology)
Top Courses :
Udacity
Coursera
Udemy
Courses: advance Skills: Youtube Channels
Stanford 2018 (2020 ) Game theory (https://ozonm Saptarsi Goswami
CMU Discrete Optimization Hisham hallag Valeo
UoT Discrete Models Baghdad Student
BC Stochastic analysis Hourani
Learn from Data Information theory
Cornewall Optimization and applicatiospringboard india
Hisham algorithms Abhishek Thakur
Coursera Washin PGM INSAID
Applied AI distributed Computing super data sicnec
H2O practical machine learning Coursera Algorithmic game theory a 365 data scince
Yandex Big Data for machine learning Modern statistics AI Engineering
https://courses.analyticsvidhya.com/ Uncertainty optimization anKrish Naik
Udacity ML Edruka
Advance ML Coursera Russia HSE University Simplelearn
Michigan upGrad
John_Hopkins Kevin markham
IBM 3blue1brown
365 Data Scinec Very important statquest
Alerta Optimizing sentdex
Google Cloud Greatlearning
New York for fincance ?
Edurka
Edx Coumiba
Edx Hardvard
Udemy
Udacity Coursers:
Data Analyst
Data SCientist
ML Done
Depp LEarning
Self Drviign Car
Time series Udemy 365 Data Scince
https://ozonmasters.ru/ml
pluralsight very important Practical
Russia Course : ML Russia Course : Big Data RL
Introductory lecture Course program https://dzone.com/articles/5-best-reinforcement-learning
Course structure, reporting, lecturer,HDFS Hadoop DistributePractical Reinforcement Learning (Russia Coursera)
Basic principles of HDFS aReinforcement Learning in Finance + advance (NYU ) Cour
Keywords: Namenode and datanode.
The concept of a block.
Data Science (Data Sciense) Replication and fault tolerance.
Statistics The process of reading a file.
Artificial Intelligence The process of writing a file.
Data Mining Cluster topology and proximity concept.
Machine Learning Mapreduce
Big Data Introduction to the MapReduce paradigm.
MapReduce program using command line utilities.
Statement of the main tasks of machineMapReduce in Hadoop. The concept of mapper and reducer.
Teaching with a teacher (with markedData flow, data locality.
Target function Computing optimization, function combiners.
An object Hadoop streaming in Python.
Label Hive. Database Management System on Top of Hadoop
Classification Hive architecture and comparison with traditional DBMS.
Forecasting HiveQL query language.
Object Space Managed and external tables.
Feature space Partitions and bucket.
Feature extraction Storage formats.
Task visualization Custom Functions and UDF.
Error Functions Hive streaming in Python.
Empirical risk Introduction to Apache Spark
Training sample Why do we need Spark? What is the problem of Apache Hadoop?
Learning Optimization Tasks Spark components and a brief history of development.
Algorithm Model Spark and SparkContext architecture.
Algorithm Introduction to RDD. Resilient distributed dataset.
Training Two types of operations and lineage graph.
Generalizing ability Caching
The scheme for solving the problem of Paired RDD, Mergers and Aggregations.
How are tasks solved Broadcast variables and batteries.
Learning without a teacher / with unall Spark sq
Partially tagged training The motivation for creating Spark SQL, remember RDD.
Transductive teaching How to create a DataFrame?
Reinforcement training Why do we need a circuit?
Structural conclusion Spark SQL Overview. Projections and samples.
Active learning Built-in functions.
Online Learning Mergers and introduction of query plan analysis.
Transfer learning Counting aggregates and statistics.
Multitask learning Custom functions.
Feature learning Work with time and window functions.
Machine Learning Issues Spark program optimization
Examples of model problems Program execution model.
Shuffle, partitioning.
Mathematics in Machine Learning: A BrShuffle, serialization.
Occam's razor Optimization of user functions.
Free Cheese Theorem Catalyst Query Optimizer.
Soccer oracle Optimization examples.
Tweaks Details Merge Algorithms.
Defining Distributions Optimization of mergers in Spark.
Average and deviation Spark ML. Classification and Regression
Conditional density, marginalization an Vector and matrix operations.
Point estimation Distributed matrices and SVD counting.
Maximum Credibility Assessment ML pipeline, architecture and components.
Kullback-Leibler divergence Overview of Kaggle toxic comments challenge.
Covariance and correlation Building a baseline solution.
Density estimate Work with unbalanced samples.
Bar chart approach Feature engineering at Spark ML.
Parzenovsky approach Calculation of quality metrics and cross-validation.
Normal distribution Spark ML. Clustering and ALS
Central Limit Theorem K-Means Algorithm in Spark.
Information theory K-means to improve Kaggle toxic comments challenge.
The Curse of Dimension Thematic modeling, LDA, further improvement of the Kaggle toxic comments challe
Singular Matrix Decomposition (SVD) ALS as a least-squares method with hidden variables.
Matrix differentiation Building a recommendations pipeline using ALS.
Optimization Industrial Spark ML
Unconditional optimization methods Pipeline Overview.
Zero-Order Methods Estimators and transformers.
First order methods Custom estimate.
Second order methods Custom transformer.
Gradient descent Scikit-learn integration in the Spark ML pipeline.
The fastest gradient descent Integration of XGBoost in the Spark ML pipeline.
Stochastic gradient descent Distributed selection of hyperparameters.
Training: Batch, online, mini-batch Cross validation
Gradient descent in machine learning Structured streaming
Stationary points Introduction to streaming data processing.
Newton's method Distributed fault tolerant Apache Kafka data bus.
Quasi-Newtonian methods Structured streaming in Apache Spark and data delivery semantics.
Restricted Optimization Structured streaming and Spark ML. We are building an antifraud pipeline for real-ti
NoSQL and Apache Cassandra
Metric algorithms Why NoSQL is needed, compare with relational databases.
Metric Algorithms (distance-based) Introduction to Apache Cassandra.
Nearest centroid (Nearest centroid alg The concept of a node.
Proximity Based Approach The concept of the ring.
kNN in the classification problem Keys for clustering and partitioning.
kNN in the regression problem Data recording.
Justification 1NN Reading data.
Lazy (Lazy) and impatient (Eager) algor GOSSIP protocol.
Weighted Generalizations kNN Data compaction.
Various metrics: Minkowski, Euclidean CQL query language Cassandra.
Applications of the metric approach: fuzzy table matching, Lencore, in DL, classification of texts
Effective Nearest Neighbor Search Techniques
Nadara-Watson Regression
Quality control and model selection
Quality control problem
Model Selection in the broadest sense
Sampling Rules
Deferred control (held-out data, hold-out set)
Cross-validation
Bootstrap
Time control (out-of-time-control)
Local control
Learning Curves
Enumeration of parameters
Linear methods
Linear regression
Generalized linear regression
Matrix degeneracy problem
Regularization. The main types of regularization
Ridge Regression
LASSO (Least Absolute Selection and Shrinkage Operator)
Elastic net
Feature Selection
Error with weights
Sustainable Regression
Linear scoring models in the binary classification problem
Logistic Regression
Probit Regression
Multiclass Logistic Regression
Linear Classifier
Perceptron
Evaluation of the error function through a smooth function
SVM
Nonlinear methods
Linearity problem
Polynomial model
Nuclear Methods (Kernel Tricks)
Core examples
Usage in SVM
Regression Use
Kernelization
Mathematics of nuclei
RBF, RBF networks
Decision trees
Decision Trees (CART)
Predicates / Branches
Tree answers
Cleavage criteria in classification problems: Missclassification criteria, entropy, Gini
Stopping criteria when building trees
Retraining problem for trees
Trimming (post-pruning)
Classic algorithms for constructing decision trees: ID3, C5.0
The importance of symptoms
Missing Values
Categorical signs
Comparison: trees vs linear models
Ensembles
Algorithm ensembles: examples and rationale
Committees (voting) / averaging
Bagging
Encoding / Transcoding Responses, ECOC
Stacking and Blending
Boosting: AdaBoost, Forward stagewise additive modeling (FSAM)
Manual methods
Homogeneous ensembles
Random forest
Universal methods
Random forest
OOB (out of bag)
Setting Method Parameters
Areas of sustainability
The importance of symptoms
Boruta
ACE
RF computed proximity
Extreme Random Trees
Gradient Boost
Gradient Boost Over Trees
Gradient Boost Iteration
Fastest descent
Shortcut Heuristics - Shrinkage
Stochastic Gradient Boosting
Advanced Optimization Techniques
Modern Gradient Boost Implementations
Built-in control methods
Gradient Boost Options
Case: Scoring Task (TKS)
Calibration
Case: Predicting Answers to Questions
Bayesian approach
Bayes formula
The optimal solution to classification problems
Minimizing average risk
Naive Bayes
Bayesian Machine Learning Approach
Maximum Credibility Method
Clustering
Clustering task, types of clustering
k-means (Lloyd's algorithm)
Generalizations of k-means
Clustering Model Problems
Affinity propagation: message clustering between points
Mean Shift: Density Mode Detection
Hierarchical clustering
Linkage Types
Minimum Spanning Tree Clustering
Spectral clustering
DBSCAN
Birch
Cure
Generative Models
EM
Gaussian Mixture Model (GMM)
Teacherless Learning
UL Tasks
Dimension reduction (reduction)
PCA
Nonlinear Dimension Reduction
Kernel PCA
t-SNE
Noise Reduction
Data Generation
Anomaly Detection
Detection of emissions and novelty (anomaly detection).
Anomaly detection methods: statistical tests, model tests, iterative methods, metric methods, task substitution methods, m
Associative rules
Basic terms
Apriori AP
Recommender Systems
Recommender systems
Collaborative filtering: GroupLens algorithm, SVD, SVD ++, timeSVD ++, adaptation of SVD for social connections
One-class recommendation
Factorization machine, factorization machine with fields (FFM - field- aware factorization machine)
Knowledge-based Recommendations
Estimates of the mean, probability, and density. Weight schemes
Definition of the average: arithmetic mean, median, mode, average according to A.N. Kolmogorov, Cauchy average.
Weight schemes.
Case "forecasting visits of supermarket buyers and the amounts of their purchases": matrix of visits, estimation of the prob
Density recovery, weighted nonparametric methods, predicting the amount of purchases with their help, solving the joint
History of data analysis and infographic: Joseph Priestley, William Playfair, Charles Joseph Minar, Florence Nightingale, Joh
Recommendations on choosing the scale of graphs and scales, explanatory text, color and style of images, presentation of
Visualization goals.
Descriptive statistics: average, characteristic elements, scatter of values, absolute variations, relative variations, moments,
An example of visualization of descriptive statistics. The study of the parts of the sample (folds), visualization of the import
Visualization of individual signs: charts, histograms, distribution densities, selection of the number of bins, transformation
Visualization of categorical signs: histograms, pie charts and areas, clarification of the nature of the sign.
Visualization of a pair of attributes: correlation, dependence of attributes, independence of attributes, typical values, outli
Visualization “algorithm response” - “algorithm response”. Visualization “response of the algorithm” is a “sign”. Deformati
“Confusion Matrix” error / inconsistency matrix , Accuracy (MCE), 1st and 2nd kind errors, completeness (Recall, TPR), spe
Quality in binary classification problems with the answer in the form of probability, scoring errors: logistic error function Lo
Quality in multiclass problems: Hamming Loss, cross-entropy, Mean Probability Rate, MSE, MAE, averaging, generalization
Quality in the tasks of the recommendation: accuracy on the first n elements, average accuracy on the first n elements, MA
Editorial distance.
Quality in a task with target values - intervals: Jaccard coefficient, Shimkevich-Simpson coefficient (Szymkiewicz, Simpson),
Ways to configure specific error functions. Construction of a cleavage criterion for optimizing AUC ROC. Tasks with an inter
Data preparation
Fundamental data properties.
Types of data.
Data preprocessing.
Data Transformation: renaming features, objects, feature values, type conversion; coding of categorical variable values; sa
Data integration.
Character Generation
Types of numerical signs.
Contextual signs.
Service signs.
Data leak.
Real signs.
Temporary features (characteristics of time points, interaction of a pair of features, use for other features, use to generate
Geographical (spatial) features: Spatial Variables. (projections on different axes, clustering, identification, binding, characte
cial connections
sits, estimation of the probability of visits by recounting, weighted schemes in assessing the probability of visits, direct method for ass
heir help, solving the joint estimation problem.
visualization of the importance of signs, primary actions in the analysis of the sign.
butes, typical values, outliers, clusters. Scatter chart. Using noise for visualization. Pivot tables, triangular dependencies.
eviation MSE, its derivatives: RMSE, coefficient of determination R2, probabilistic and improbable justification RMSE, Huber function, L
leteness (Recall, TPR), specificity (TNR), accuracy (Precision), FPR (False Positive Rate) , F1 measure, Cohen's Kappa, Weighted kappa, M
rs: logistic error function Log Loss, MSE, Misclassification Loss, Exploss; Hinge loss; AUROC, GINI (Lorentz curve),
, averaging, generalization of F-measures, balanced accuracy (Balanced accuracy). Different types of averaging of quality: macro, micro
on the first n elements, MAP, Concordant - Discordant ratio, Mean Reciprocal Rank (MRR), Cumulative Gain, Discounted Cumulative G
nt (Szymkiewicz, Simpson), Braun-Blanquet coefficient, Sörensen coefficient, Kulczinsky coefficient, Oulia coefficient ( Ochiai), inclusion
UC ROC. Tasks with an interval attribute. Minimizing Root Mean Square Percentage Error (RMSPE) with deformations. Derivation of gra
egorical variable values; sampling; normalization; smoothing; creation of signs; aggregation; generalization; deformation of values.
tification, binding, characteristics of the neighborhood, analysis of trajectories, deanonymization of data, use of context and the study
ations), ensembles of algorithms.
on RMSE, Huber function, Logcosh, generalization of MAE and RMSE, percentage error functions (SMAPE, MAPE, PMAD), errors based
mations. Derivation of gradient descent formulas for basic error methods and functions.
deformation of values.
e of context and the study of strangeness, distance generation and use for other features).
APE, PMAD), errors based on comparison with the benchmark (MRAE, REL_MAE, PB), normalized errors (MASE), asymmetric errors, er
SE), asymmetric errors, errors accurate to the threshold, the use of error functions to generate attributes.
advance Skills: Game theory (https://ozoDiscrete Optimization Discrete Models
Game theory (https://ozoCoursera + University + Book + Youtube
Discrete Optimization
Discrete Models
Stochastic analysis
Information theory
Optimization and applications 2
algorithms
PGM
distributed Computing
Algorithmic game theory and mechanism design
Modern statistics
Uncertainty optimization and risk modeling
Stochastic analysis Information theory Optimization and applicatalgorithms
PGM distributed Computing Algorithmic game theory Modern statistics
Uncertainty optimization and risk modeling
created a roadmap for exploring machine learning in 10 days. Of course, you would want to dig deeper into each of the
Day 1:
Basic terminology:
Day 2:
Optimization basics:
a. Terminology & Basic concepts: Convex optimization, Lagrangian, Primal-dual problems, Gradients & subgradients, ℓ1ℓ
b. Algorithms: Batch gradient descent & stochastic gradient descent, Coordinate gradient descent.
c. Implementation: Write code for stochastic gradient descent for a simple objective function, tune th
step size, and get an intuition of the algorithm.
Day 3:
Classification:
a. Logistic Regression
b. Support vector machines: Geometric intuition, primal-dual formulations, notion of support vector
kernel trick, understanding of hyperparameters, grid search.
c. Online tool for SVM: Play with this online SVM tool (scroll down to “Graphic Interface”) to get some intuition of the al
Day 4:
Regression:
a. Ridge regression
Clustering:
Day 5:
Bayesian methods:
a. Basic terminology: Priors, posteriors, likelihood, maximum likelihood estimation and maximum-a-
posteriori inference.
c. Latent Dirichlet Allocation: The generative model and basic idea of parameter estimation.
Day 6:
Graphical models:
Days 7–8:
Neural Networks:
Day 9:
Miscellaneous topics:
a. Decision trees
b. Recommender systems
d. Multi-armed bandits
Day 10: (Budget day)
You can use the last day to catch up on anything left from previous days, or learn more about whatever
topic you found most interesting / useful for your future work.
I think the best three books
for doing research on
of the topics listed to have a working knowledge of them: https://hackernoon.com/th
machine learning:
https://blog.floydhub.com/
https://blog.floydhub.com/
https://blog.floydhub.com/a-pirates-guide-to-accuracy-precision-recall-and-other-scores/
https://blog.floydhub.com/guide-to-hyperparameters-search-for-deep-learning-models/
an, Michael Steinbach, Anuj Karpatne & Vipin Kumar. Introduction to Data Mining
on by Nesterov, Yurii Introductory Lectures on Convex Optimization - A Basic Course | Yurii Nesterov | Springer
1- action oriented 4
2-self development 4
3-customer focus 3.5
4-plan & aligns 3
5-communication effective 3.5
Canada :
experince + 3G+4G+5G + Data Eng + ML + Big Data
P.E
Master
MBA
Start :
intiative 1-Hisham Asem
T.T 2- udacity
Ok 3-
Ok 4-
A.I Master
ML
Deep Learnning
Time series Analysis
Iot
BlockChain
Agenda of Meeting
strategic vision for 5G
5G innovations and vision
5G Large Scale Deployment in MENA
5G Core and network Slicing
RAN Evolution
Roadmap to mobile 5G
technical differnce between 4G and 5G Slicing
5G Business Modles & Monetisation
5g iot baseband Software developer
C + C++ + Python
MWC Los Angles Topics
Asia Singafora Topics
AI + ML + DL 3G + 4G
1-Helwan important Math Topics + Old man 3 Lecture Alex 3G ready
2-andro ML alex 4G ??
3-androw DL
4- DataCamp Time Series Analysis
5-udemy Time Series Analysis
6- Hisham asem
7-Seraj
8-Linkden Data Scentist
9-Mutaz
10-Husam Hourani
11-Big Data
12-Stanford & MIT ( machine learning )
Mit >> Math
Walid Yousef >> probability
ML Andro
ML >> Yaser abu Mustafa
Optimization and Alqorithms
ML
Reporting
case studies
ML Hisham andrew
ML Udacity Udacity
ML Udemy Udemy
applied Machine Learnning washiton
Projects & Idea ( Optimization )
data manipulation
Web Scraping in Python
5G software Developer
C
C++
Data Strucure Walid Yousef
alqorithms
Statsitics
Data analysis
learn Data
datacamp-course-roadmap
p.com/courses/tech:python
p.com/tracks/data-analyst-with-python
Career tracks
Data Scientist Data Analyst
Introduction to Python Introduction to Data Science in Python
Intermediate Python for Data Science Intermediate Python for Data Science
Python Data Science Toolbox (Part 1) Python Data Science Toolbox (Part 1)
Python Data Science Toolbox (Part 2) Intro to SQL for Data Science
This course is part of the Applied Data Science with Python Specialization
Johns Hopkins University 10 Courses
Tableau
Fundamentals of Scalable Data Science (IBM)
BhmDQq7LnTz0
TensorFlow in details
g/specializations/big-data
g/specializations/big-data-engineering
Mit Udacity analyticsvidhya
Predictive Analytics for Business applied-machine-learning-beginner-to-prof
Deep Learning Become a Data Scientist natural-language-processing-nlp
Parallel Programming
Concurrent Programming
Distributed Programming
Android Developer
Ios Developer
IoT
5G
Information Retrieva
soft skills
writing email
Mlib
Spark/MLlib
Casandra
MongoDB
ElasticSearch
Docker
Agile experience
CouchDB
cloud services (AWS, GCP, Azure)
REST APIs
Docker, Kubernetes).
Spark in general and MLlib specifically (PySpark, Scala, SparkR)
ETL tool like SSIS
cloud data warehouses - ie: Snowflake, Azure Data Warehouse, etc.,
a variety of databases - ie: SQL, PostgreSQL, Azure SQL, Oracle,
data automation and ETL tools ie: WhereScape, SSIS, Informatica
analytical tools - ie: SSRS, Cognos, PowerBI, Tableau,
Agile software development and tools ie: JIRA, Azure DevOps,
Knowledge of management and productivity environments and tools (e.g., Jira, Git, Bitbucket, Jenkins)
Linkden Courses
BlockChain
Agile
Strong data profiling, cleaning, mining and technical documentation skills
causal inference
data analyst
AI
IBM Applied AI Coursera
NLP
Software Engineering
book Hand on-ML >> Repeat with Code + Summery Notes
Hisham asem + Hand-on Again + Kaggle
Udacity
Udacity
data-camp
step by step Book
Chieh Wu + BayesGroup.ru
https://mlwhiz.com/blog
http://www2.stat.duke.edu/~rcs46/bayes18.html
AAILab Kaist Youtube Channel >> https://aai.kaist.ac.kr/xe2/
CMU
https://sites.google.com/site/cs228tspring2012/
Ahmed Fathi + Walid + Mit Gilberton + Visual Draw Youtube
youtube channel >> intrigano : https://www.youtube.com/watch?
first 3 chapter Entropy
Walid + ahmed bazi + stanford + NPTEL + AlROOMI Arabic
https://dzone.com/articles/xgboost-a-deep-dive-into-boosting?edition=590295&utm_source=Zone%20Newsletter&utm_m
walid yousef
Udacity
Udacity
Udacity
Udacity
Walid Yousef
re that can be accessed through a graphical user interface, standard terminal applications, or a Java API. ( Machine Learning without p
roviding a consistent set of verbs that help you solve the most common data manipulation challenges
Udemy
ch data and navigate the Elastic Stack so you can do anything from tracking query load to understanding the way requests flow throug
ata-driven,interactive data analytics and collaborative documents with SQL, Scala and more.
is, topic modeling
analytics solutions and the expertise you need to build a data-driven enterprise.
https://analyticsindiamag.com/10-popular-automl-tools-developers-can-use/
PySpark or SparkR
ent tools (Jupyter, RStudio), ideally on a platform such as IBM Watson Studio, Anaconda, Databricks or GCP)
https://www.coursera.org/specializations/improve-english
k, Scala, SparkR)
e Data Warehouse, etc.,
Azure SQL, Oracle,
pe, SSIS, Informatica
A, Azure DevOps,
ne learning models
etes, Openshift, PCF
Advance Courses
https://www.youtube.com/watch?edufilter=NULL&list=PLyGDRqxp7CWmcw1igY3nF0qjuUXLE0jAi&v=KPDubbl_WWE
Hammad Books
Object-Oriented Analysis, Design and Implementation: An Integrated Approach
Microsoft SQL Server 2012 T-SQL Fundamentals
Domain-Driven Design Distilled
Berthold Vöcking
Foundations of Programming Languages
Introduction to Parallel Computing
Packtpub
Nando de Freitas Youtube Videos
Hand on ML ( Book ) + Again with Code in Details with all Topics step by step from interne
4 Courses 1- Discrete Math and Analyzing Social Graphs 2-Calculus and Optimization for Machine Learning 3- First Steps in
Ben1994 (Youtube Channle)
Probability - The Science of Uncertainty and Data (EDX)
khan-academy
http://nlp.chonbuk.ac.kr/
Coursera (Imperial Collegae Youtube >> george soilis ) https://www.youtube.c
PCA >> https://www.youtube.com/watch?edufilter=NULL&list=PL2jykFOD1AWa-
Game Theory Stanford Youtube (Freemium Courses Channel )
Applied Optimization for Wireless- (IIT Kanpur July 2018)
sting?edition=590295&utm_source=Zone%20Newsletter&utm_medium=email&utm_campaign=ai%202020-04-01
Coursera very important Mathematics for Data Science Specialization ( 4 courses )
ng in Java Specialization
erminal applications, or a Java API. ( Machine Learning without programming
on data manipulation challenges
cking query load to understanding the way requests flow through your apps.
h SQL, Scala and more.
evelopers-can-use/
n: An Integrated Approach
andro theory very important + walid + yasser (theory) + a intro to st (http://faculty.marshall.usc.edu/gareth-
ails with all Topics step by step from internet Coursera with also Youtube ( Andrw )
Optimization for Machine Learning 3- First Steps in Linear Algebra for Machine Learning 4-Probability Theory, Statistics and Explorato
Probability for Data Scientists Book MIT RES.6-012 Introduction to Probability, Spring
https://ocw.mit.edu/courses/electrical-engineering-and-chttp://nlp.chonbuk.ac.kr/
(IIT Kanpur July 2018) Descriptive Statistics Using "R" Udacity (stat > Desc >> Infer )>> profe leanod >> s
m_campaign=ai%202020-04-01
7 Coursera San diago Coursera Algorithm I & II Pricnton University
business
Experience with developing business
requirements, use cases and user stories in a data
analytics context
Experience with data warehousing and business
intelligence tools and platforms
Exposure to best practices in data management
and data governance practices
Knowledge of big data technologies (Microsoft Azure, Google Cloud Platform, Cassandra, Spark, etc.) and advanced analytics (Azure Machine Learning, Goo
Bonus points:
Some experience with analyzing time-series data
Familiarity with analyzing big datasets using tools such as Apache
Spark
Some experience with R and related machine learning libraries
Additional experience with deep learning frameworks such as
TensorFlow and PyTorch
Experience with accessing data from other datastores in the AWS
ecosystem such as ElasticSearch, S3, and DynamoDB
Exposure to working with software development teams and using
tools such as Git, GitHub, and Zenhub
You’ve worked with different types of data infrastructures (e.g., data warehouses or data lakes)
Strong command of SQL and working with multiple relational databases and data warehouses (Redshift, Snowflake, Postgres, MySQL, etc…)
Bonus points:
Qualifications
Advanced degree in Computer Science or Computer Engineering
with a focus on ML or NLP (PhD preferred)
Minimum of 3 years of applied experience in ML, deep learning and
NLP
Have expert understanding of machine learning and NLP tasks such
as classification, feature engineering, information extraction,
structured prediction, sentiment analysis, Q/A, NER and topic
modelling
Fully understand different neural networks (LSTM, CNN, RNN,
seq2seq, BERT etc.), different word embedding models and transfer
learning.
edureka!
Coursera Knowledge of uncertainty quantification metho
Operations Research
UDEMY Master the Coding Intervie
Coursera Stanford University Plurasight
s (Azure Machine Learning, Google Cloud Machine Learning, SAS Grid, Splunk, etc.) (asset)
al Machine Learning
Lawrence Leemis
Plan : 30 Day
Month 4 Time series Udemy
Time series Coursera
Time series DataCamp
Machine Learning Washintogn Univ
Regression
Start DL Hisham + ahmed Fathi + with andro + Riad Almadani important Lectures
Month 6 / 4 - 5 Hour / Day DL Hisham with Andro
TensorFlow Coursera + Book
NN Design
Pytorch
Keras
recommnded System Udemy & Coursera
Griff hilton NN Coursera
Month 7 Reinforcement Learning
Udemy SVM Course
Ensemble Projects
Ensemble Bagging Xgboosting Stacking HandBook
Unsupervised Handbook
Packt Hand-On
ensemble analyticsvidhya path 2020
Month 8 DataCamp
Data Scintsit Career Track
Machine Learning Career Track
SQL + R
all Projects
Hisham Projects >> Udemy projects 1 + 2 >> Udacity Projects >> DataCamp Projects >> analytic
Theano,, or
Caffe, Caffe2
MXNet
H2O
Weka
CNTK
Chainer
Recommender System
> DataCamp Projects >> analyticsvdeeplearning4j Deep Learning for Java
text mining techniques such as sentiment analysis, topic modeling
y: Deepchem
AutoML tools > Splunk , DataRobot , H2O , Rapid Miner , Big ML
Julia , Rust , Go
AWS infrastructure (SQS, SageMaker, Lambda)
tidyverse >> Visualization R-Programming
pth undersatnd dplyr >> Grammer
Shiny >> build interactive web apps straight from R.
IBM Watson
GIS/spatial analysis; graph theory/network analysis
Git
ng Specialization Coursera
ure and algrithms
th ( Graph Theory )
ttern + Functional programming
2021
Month 3
Path:
>> Data analyst + Dat
Again to re-fresh your knowledge ML+DS San-Diago
Month 4 Search Internet All Topics in Details Yandex
Yasser abu Mostafa Learn from Data Cloudera
A/B testing, Udacity
InterView Questions Udemy All
Udemy ML A-Z but R-programming Data Engineering, Big
Fast.ai (DL Projects )
Udemy Coursera
Month 5 Quera Question for interview
CMU ML University Lectures Theory ( from Syllabs)
DL + ML all Video + material ( Pluarsight )
DS + ML Interview Questions
Month 6
Month 9
Month 11
Month 12
2021
Group6:
Big Data:
Hadoop, Spark, PySpark or SparkR,MapReduce,MLib
Databases:Postgres, Mongo, SQL, NoSQL
Casandra , ElasticSearch,CouchDB
cloud data warehouses - ie: Snowflake, Azure Data Warehouse, etc.,
a variety of databases - ie: SQL, PostgreSQL, Azure SQL, Oracle,
data automation and ETL tools ie: WhereScape, SSIS, Informatica
analytical tools - ie: SSRS, Cognos, PowerBI, Tableau,
Google BigQuery
AWS ecosystem such as ElasticSearch, S3, and DynamoDB
multiple relational databases and data warehouses (Redshift, Snowflake, Postgres, MySQL, etc…)
Understanding of GCP/Azure Data Engineering Stack, Big Data Tools and Technologies ( HDFS, HBase, Spark, Kafka)
Frameworks: Spark, Airflow, DataBricks, ONNX, Kafka, Netty
Databases: MySQL, Snowflake, S3/Parquet
Data mining experience working with Relational, NoSQL and Graph databases
Knowledge of RPA automation tools such as UIPath or Blue Prism
Splunk
Experience with NoSQL databases, such as MongoDB, Cassandra, HBase
Some of the popular DBMS include: MySQL, SQL Server, Oracle, IBM DB2, PostgreSQL and NoSQL databases (MongoDB, Couch
Group7:
Agile Scrum and/or kanban method
Agile software development and tools ie: JIRA, Azure DevOps,
Group8:
visualization tools (e.g., Power BI, Tableau, Shiny)
data visualization tools (Spotfire, Tableau, Qlik)
Experience in Tableau, Apache SuperSet, Looker or similar BI tools.
Experience in BI platform and data visualization tools (QlikView, Microstrategy, Tableau, Power BI, etc.)
Tableau, Power BI
Group9:
Docker, Kubernetes
Knowledge of management and productivity environments and tools (e.g., Jira, Git, Bitbucket, Jenkins)
Git, GitHub, and Zenhub
Knowledge of CI tools and processes (Jenkins, TeamCity, Bitbucket pipelines, etc);
Experience with container-type environment: Docker, Kubernetes, Openshift, PCF
Experience with Version Controlled data pipelines (Pachyderm)
prgoramming
Java problem Solving
C/C++
BlockChain
Web Developer
JavaScript
MySQL, etc…)
es ( HDFS, HBase, Spark, Kafka)
QL and NoSQL databases (MongoDB, CouchDB, DynamoDB, HBase, Neo4j, Cassandra, Redis)
Bitbucket, Jenkins)
Jobs :: book linkden Jobs need Canada remotly work
data analyst Real-World Machine Learning
fdsfsd procom Loay Egypt Team
Data scientist statsitcsi + probability Python/R/Scala abdallah
ML (Hadoop/Spark
Big data Experience with AWS
ML frameworks/ libraries (e.g. Scikit-lear
Hadoop, Spark, Redis, HBase, Kafka, etc.
Signal processing
personal projects,
Geo-spatial data Kaggle
Experience
processing with
publications,
data or presentations at meetups/conferences
visualisation
Natural Language
tools, such as D3.js,
processing
GGplot,
Social Tableau,
NLP frameworks/libraries
network e.g. Gensim, SpaCy is a plus
TIBCO
analysisSpotfire.
Caffe, Tensorflow,
Proficiency in usingPyTorch
Recommender
query languages
Experience with analytical tools supporting data analysis (eg. Tablea
system
such as SQL, Hive,
Adaptive
AWS
Pig services including SageMaker
experimentation
Experience
techniques with
NoSQL databases,
such as MongoDB,
Cassandra, Hbase
udemy :
at meetups/conferences
3>Udacity
ML Michigan University
ML Washinton University
ML IBM
ML pluralsight + advanced Python
ML Edurak
ML Linkden
ML mlcourse.ai. Lecture 0. Introduction
Edureka (https://www.edureka.co/masters-program/big-data-architect-training)
Udemy
Deep Learning
Hisham +andrew +ahmed Fathi
Mutiz saad
Griff hilton NN Coursera
Udacity + Udemy
Nividea Courses
David Barber’s Bayesian Reasoning and Machine Learning
Cloud Computing
AWS
salesforce
Cloud Architect Master Program (Edurka)
OpenStack
Data Science Specialist with experience of Data Analysis and Machine Learning projects.
Solid knowledge of Pandas, Numpy, Scikit-learn, Spark, SQL, Tableau, and AWS
York University Big Data Analytic
WeCloudData + Brainstation
Adaptable data analysis skills with statistical business intelligence
Strong programming skills with Python and its toolkits (Pandas, NumPy, SciPy, etc.)
Proficiency in database management system: MySQL, NoSQL
Scalable data processing platforms: Hadoop, MapReduce, Spark, Hive
tect-training)
needed for Jobs
Skills : Done
Sklearn on-Going
Design hypothesis testing, backtesting, model validation, and data visualization systems
SageMaker
GitHub or Kaggle
git, mercurial, jenkins, travis, jira, asana, etc.
Source control experience, preferably GIT
Weka
Experience with popular DevOps tools such as (Puppet, Chef, Ansible, Vagrant, Docker, Jenkins, Maven, Se
Experience with high-performance Deep Learning frameworks such as TensorFlow, PyTorch, Theano, Caffe
Experience with popular machine vision frameworks in python and other languages such as Keras, Caffe, T
AWS,AZURE,GCP,Cloudera,databricks,snowflakes
using technologies like; git, mercurial, jenkins, travis, jira, asana, etc.
Published work in Data Science related journals or conferences such as ICML, NIPS, JML, KDD, and INFORM
project available in a public forum like GitHub or Kaggle
Contributing to influential Open Source Projects like sklearn, XGBoost, tidyverse, Tensorflow, pytorch, Kafk
Scikit-Learn, H2O, Keras, TensorFlow
(numpy, scipy, pandas, scikit-learn, tensorflow/keras/pytorch, etc.)
linear/logistics regression discriminant analysis, bagging, random forest, Bayesian model, SVM, neural netw
Hadoop, Spark, Kafka
relational SQL and NoSQL databases, including Postgres and Cassandra.
Data and Model pipeline and workflow management tools: Azkaban, Luigi, Airflow, Dataiku, etc.
stream-processing systems: Storm, Spark-Streaming,
Knowledge of VBA, SQL, MariaDB, and MCG IAM Web Harness software mandatory
statistics skills, such as distributions, statistical testing, regression, etc.
Experience with data visualization tools, such as Power BI, QlikView
Advanced pattern recognition and predictive modeling experience
Knowledge of one of the following: Business Objects, Tableau, Cognos, Looker, Power BI
Big Data technologies such as Hadoop, Kubernetes, Cassandra
Experience of data wrangling and data munging, using Big Data technologies
project management skills
a quantitative field
Agile environment >> Scrum
ETL processes
Hands-on experience with the following platforms/tools - Spark, Redshift / Postgres, AWS, Linux, Hive / Pres
e.g., S3, EC2, EMR, Redshift, DynamoDB, Kinesis, etc.
analyze A/B tests
Advanced knowledge of data visualization techniques and software tools (eg. R, Qliksense, Qlikview, HTML
Working knowledge of ETL tools (eg. Informatica, Ab Initio, Talend).
Experience with causal inference techniques, experimental design and/or A/B testing
Experience with creating visuals and dashboards in BI tools (e.g. Looker, Tableau, Power BI, Google Data S
visualization tools (e.g. Chartio, Looker, Tableau)
Experience building shallow or deep learning models (GBDT, CNN, RNN, LSTM), toolkits e.g. Matlab, RStudi
Experience constructing SQL queries (using Postgres or a similar platform)
visualization tools such as Tableau, PowerBI,
Experience working with microservice architectures/Docker/Containerization
e SQL, Hadoop, Spark, BigTable or DynamoDB
TensorFlow, Spark/MLlib or
Apply statistical inference to draw conclusions from data
Create essential performance metrics.
Experience utilizing both qualitative analysis (e.g., content analysis, phenomenology, hypothesis testing) an
Experience with distributed data processing systems (e.g. Spark, Redshift)
Big Data Platforms and tools (e.g. Cloudera, Hortonworks, MapR, Hadoop, Pig, Hive, etc.)
Impala, and SQL – for queries
NoSQL databases (ex. HBase, MongoDB,ArangoDB, Neo4J) – for behavioral analysis
Data formats – ex. JSON, flat files, Parquet, ORC files, Avro
Extract-Transform-Load (ETL) processes
Data visualization tools (ex. Tableau, Qlik, IPython, etc.)
Source control experience, preferably GIT
Weka
(i.e. Classification, ranking, segmentation, multivariate regression and/or pattern recognition techniques
Professional experience using unsupervised learning techniques - clustering, word embeddings, dimension
Kubernetes
HTTP, JSON, REST.
Postgres, Mongo, SQL, NoSQL.
Knowledge of uncertainty quantification methods (Bayesian methods, confidence intervals, probabilistic gra
Python (numpy, pandas, sklearn, xgboost, TensorFlow)
Design hypothesis testing, backtesting, model validation, and data visualization systems
algorithms and data structures
SageMaker
social network analysis methods
Understanding of statistics (e.g., hypothesis testing, regression, signal-to-noise ratio, confidence bounds)
Understanding of programmatic modeling (e.g., mathematical optimization, confidence
statistical software (ex. dplyr, Pandas)
Machine learning (supervised and unsupervised methods) and exploratory/statistical data analysis (such as
Domain experience in math and statistical methods such as hypothesis testing, confidence intervals, and va
graph algorithms and semantic Web
navy mission systems
Knowledge of Machine Learning concepts: e.g., cross-validation, regularization, boosting, bootstrapping, et
Experience with one or more Business Intelligence and visualization tools (Business Objects, Tableau, Cha
Working knowledge of one or more statistical analysis packages: R, SPSS, SAS, Numpy/Scipy, etc.
Working knowledge of one or more SQL languages: Oracle, MySQL, PostgreSQL, Redshift, etc.
https://www.udemy.com/course/tensorflow-2-practical-advanced/
DYMA Half rate Reinforcement learning with Tensorflow 2.0
sticsearch etc. BCCH https://rubikscode.net
https://github.com/vinhvu200/MazeAI
https://github.com/rabieifk/Prison_Break_Machine_Learning
https://arxiv.org/pdf/1810.09967.pdf
nal-to-noise ratio, confidence bounds) https://mc.ai/choosing-a-deep-reinforcement-learning-library/
zation, confidence https://modelzoo.co/category/reinforcement-learning
https://deeplizard.com/learn/video/FU-sNVew9ZA
ratory/statistical data analysis (such as linear models, mu https://blog.varunajayasiri.com/ml/dqn.html
sis testing, confidence intervals, and various probabilityhttps://www.youtube.com/channel/UC7ZVvEo7-B7lA6LY2MVX72A/pla
https://github.com/dennybritz/reinforcement-learning/tree/master/DQN
https://www.endtoend.ai/rl-weekly/
ularization, boosting, bootstrapping, etc. https://joshgreaves.com/reinforcement-learning/introduction-to-reinfor
tools (Business Objects, Tableau, ChartIO, JMP, etc.) ishttps://www.youtube.com/watch?v=Pka0DC_P17k
a plus
SPSS, SAS, Numpy/Scipy, etc. https://dzone.com/articles/trading-strategies-using-deep-reinforcement
g, regression, statistical inference, collaborative filtering, and natural
University language processing, experimental design, socia
of Alberta
cikit-learn, numpy, pandas, jupyter, matplotlib, scipy, nltk, spacy, keras, tensorflow
orks such as TensorFlow and PyTorch, and toolkits such as Tensor2Tensor, Sockeye, and OpenNMT
tion and/or evaluation, hypothesis testing and experimental design
e ongoing insights.
ch as clustering, classification, regression, decision trees, neural nets, support vector machines
ware mandatory
havioral analysis
. Streaming Data Platform, Kafka, Logstash, Teradata, Hadoop, etc.), and data formats (e.g. SQL, NoSQL, AWS S3, JSO
g AMI’s, SageMaker
h as S3, EC2, EMR, SageMaker, ECS, Docker, Gitlab CI, Python packaging, command-line executi
, Airflow, Dataiku, etc.
ware mandatory
RNN, LSTM), toolkits e.g. Matlab, RStudio, Weka, MLLib and frameworks PyTorch, TensorFlow, CNTK
phenomenology, hypothesis testing) and quantitative analysis techniques (e.g., clustering, regression, pattern recogn
havioral analysis
isualization systems
ratory/statistical data analysis (such as linear models, multivariate analysis, predictive modeling and stochastic mode
sis testing, confidence intervals, and various probability distributions
e, Vagrant, Docker, Jenkins, Maven, Selenium, or Jira), provisioning, infrastructure as code, and other DevOps concep
g, regression, statistical inference, collaborative filtering, and natural language processing, experimental design, socia
cikit-learn, numpy, pandas, jupyter, matplotlib, scipy, nltk, spacy, keras, tensorflow
orks such as TensorFlow and PyTorch, and toolkits such as Tensor2Tensor, Sockeye, and OpenNMT
tion and/or evaluation, hypothesis testing and experimental design
e ongoing insights.
ebook Insights, etc.
, Datalab, Redshift, etc
. Streaming Data Platform, Kafka, Logstash, Teradata, Hadoop, etc.), and data formats (e.g. SQL, NoSQL, AWS S3, JSO
ch as clustering, classification, regression, decision trees, neural nets, support vector machines, ensemble modeling and text min
g AMI’s, SageMaker
h as S3, EC2, EMR, SageMaker, ECS, Docker, Gitlab CI, Python packaging, command-line executions and shell scripting
Big Data and Data Engineering
Hadoop+Spark
Scala,
NoSQL databases and unstructured/semi-structured dat
https://maropost.breezy.hr/p/1e186b8808be
Scala and Spark for Big Data and Machine Learning (Ude
Spark and Python for Big Data with PySpark (Udemy)
Udacity
master Big Data
https://www.edureka.co/all-courses
arning: A Hands-on Tutorial in Python
vanZhou/Reinforcement-learning-with-tensorflow
993/reinforcement-learning-sutton
vu200/MazeAI
eifk/Prison_Break_Machine_Learning
10.09967.pdf
a-deep-reinforcement-learning-library/
tegory/reinforcement-learning
learn/video/FU-sNVew9ZA
asiri.com/ml/dqn.html
om/channel/UC7ZVvEo7-B7lA6LY2MVX72A/playlists
nybritz/reinforcement-learning/tree/master/DQN
ai/rl-weekly/
m/reinforcement-learning/introduction-to-reinforcement-learning/
om/watch?v=Pka0DC_P17k
les/trading-strategies-using-deep-reinforcement-learni
https://www.quora.com/As-a-fresher-should-I-learn-Ha
#1 Big Data Hadoop Certification Training-Edureka
#2 Big Data Specialization by UC San Diego-Coursera
#3 Big Data Architect-Simplilearn
#4 Become a Data Engineer – Udacity
#5 Big Data Hadoop Certification Training-Simplilearn
#6 The Ultimate Hands-On Hadoop – Tame your Big Dat
#7 Taming Big Data with MapReduce and Hadoop – Ha
#8 Taming Big Data with Apache Spark and Python – Hands On!-Udemy
#9 Learn Big Data: The Hadoop Ecosystem Masterclass-Udemy (https://www.udemy.com/course/hands-on-hadoop-mastercla
#10 Hadoop MAPREDUCE in Depth | A Real-Time course on Mapreduce-Udemy
11 Learn By Example: Hadoop, MapReduce for Big Data problems-Udemy
12-Yandex
13-Cloudera CCP Spark and Hadoop Developer certificat
14- Data Science Council of America (DASCA)
15-Big Data University
16-https://data-flair.training/
17-https://www.npntraining.com
18-Pluarsight
19-Linjden: Architecting Big Data Applications: Batch Mode Application Engineering
SQL
Managing Big Data with MySQL by Coursera
Beginner’s Guide to PostgreSQL by Udemy
High-Performance MySQL
Hive
Accessing Hadoop Data using Hive by Big Data University
Learning Apache Hadoop Ecosystem Hive by Udemy
Apache Hive Documentation
Pig
Apache Pig 101 by Big Data University
Programming Hadoop with Apache Pig by Udemy
Apache Storm
Apache Kinesis Documentation
Amazon Kinesis Streams Developer Resources by Amazon Web Services
Apache Spark
Data Science and Engineering with Apache by edx
pache Spark Documentation
Book – Learning Spark
a formats (e.g. SQL, NoSQL, AWS S3, JSON, Redis, neo4j, etc.) for efficient ML feature extraction and data transforma
HDFS
Big Data and Hadoop Essentials by Udemy
ig Data Fundamentals by Big Data University
Hadoop Starter Kit by Udemy
Apache Hadoop Documentation
.Cloud
Big Data Technology Fundamentals by Amazon Web Services
Big Data on AWS by Amazon Web Services
Apache Kafka
https://www.analyticsvidhya.com/blog/2017/03/big-data-learning-path-for-all-engineers-and-data-scientists-out-there/
The complete Apache Kafka course for beginners by Udemy
Learn Apache Kafka Basics and Advanced topic by Udemy
Apache Kafka Documentation
Book – Learning Apache Kafka
Apache Zookeeper
Apache Zookeeper Documentation
Book – Zookeeper
Impala
Scala
Functional Programming in Scala Specialization Coursera
Google
Cloudera
IBM
a formats (e.g. SQL, NoSQL, AWS S3, JSON, Redis, neo4j, etc.) for efficient ML feature extraction and data transforma
or machines, ensemble modeling and text mining techniques such as sentiment analysis, topic modeling and entity extraction
https://www.eduonix.com/learn-machine-learning-by-building-projects
https://www.eduonix.com/deep-learning-neural-networks-python-keras-for-dummies
Python Machine Learning Book
Pattern Recognition and Machine Learning Book + Walid Yousef
https://www.udemy.com/course/time-series-analysis-in-python/
Khan Academy Math
The Complete Python Course for Machine Learning Engineers (master python for ML Engineer ) Udemy
AWS Machine Learning: A Complete Guide With Python
https://aws.amazon.com/certification/certified-machine-learning-specialty/
fast.ai
Machine Learning EDX
Two Excellent Book Companions:Introduction to Statistical Learning, Hands-On Machine Learning with Scikit-Learn and Tensor
IBM
washnton University
Advanced Machine Learning Specialization — Coursera
Linkden Specilaization Topics very important
AWS Machine Learning Certifiacte
Google ??
https://randlow.github.io/categories/cat_machine-learning/ ( important project ) Credit risk
https://www.edureka.co/masters-program
https://www.dezyre.com/article/top-10-machine-learning-projects-for-beginners/397
ython – Hands On!-Udemy
asterclass-Udemy (https://www.udemy.com/course/hands-on-hadoop-masterclass-tame-the-big-data/)
Time course on Mapreduce-Udemy
or Big Data problems-Udemy
Pytorch
https://github.com/imhgchoi/pytorch-implementations
advance Topics :: Machine Learning from Youtube
https://www.youtube.com/watch?edufilter=NULL&list=PLqJm7Rc5-EXFv6RXaPZzzlzo93Hl0v91E&v=ZT8LszMo0D4
https://www.youtube.com/watch?edufilter=NULL&list=PLqJm7Rc5-EXFUOvoYCdKikfck8YeUCnl9&v=XLHB-Aktxw0
https://machinelearningmastery.com/start-here/ (jason Brownlee)
CBCSL teaching Youtube Channel explian math of machine learning
s: Batch Mode Application Engineering
https://www.packtpub.com/data/machine-learning
Bhavesh Bhatt ( Youtube Channel ) very important
ta University
on Web Services
7/03/big-data-learning-path-for-all-engineers-and-data-scientists-out-there/
ners by Udemy
pic by Udemy
on Coursera
d tools is desirable.
orking analysis, feature engineering, etc.
dis, neo4j, etc.) for efficient ML feature extraction and data transformations.
chniques such as sentiment analysis, topic modeling and entity extraction
ML
Coursera 10 Course:
Udacity
Udemy 10 Course :
Edx 5 Courses :
Youtube Channel 20 Top
Free lessons: MATH ADVANCED - Multivariate CalculuS For Machine Learning By Samuel J. Cooper Imperial College London
https://github.com/umer7/Deep-Learning
Neural Networks for Machine Learning by the University of Toronto (taught by Geoffrey Hinton) via Coursera
Creative Applications of Deep Learning with TensorFlow by Kadenze I + II + III (https://www.kadenze.com/courses/creative-ap
Reinforcement Learning Specialization (4 coursers ) Alberta Coursera
https://medium.com/free-code-camp/dive-into-deep-learning-with-these-23-online-courses-bf247d289cc0
http://course18.fast.ai/index.html
http://course18.fast.ai/part2.html
https://course.fast.ai/videos/?lesson=5
https://www.edx.org/professional-certificate/ibm-deep-learning
https://cs230.stanford.edu/
https://www.youtube.com/watch?fbclid=IwAR0HxtFrUjsVtUYl_q2VE_xQ8CN5H-k-5LHpYrQcZ-zCgQN1f7FNVoh_FBA&edufilter
Youtube Channel :: Graphical Models [Max Planck Institute for Intelligent Systems]
https://www.youtube.com/watch?fbclid=IwAR02Ix6gO-x4PpCfyH_ZbqodCuzEPyaM5IU44NJRhaL_pfC94DxwVt5z4lc&index=1&
https://blog.floydhub.com/best-deep-learning-courses-updated-for-2019/
Deep Learning For Coders by Jeremy Howard, Rachel Thomas, Sylvain Gugger - fast.ai
CS224n: Natural Language Processing with Deep Learning by Christopher Manning, Abigail See - Stanford
CS231n: Convolutional Neural Networks for Visual Recognition by Stanford
MIT Deep Learning by MIT
Computer Visiointerview Data sceintist & Machine learning
very very important
Computer Vision Sp
Nividia Courses https://github.com/vlgiitr/DL_Topics
http://nitin-panwar.github.io/Top-100-Data-science-interview-questions/?utm_campaign=News&utm_med
https://www.quora.com/profile/Prasoon-Goyal
urses-bf247d289cc0
https://www.coursera.org/specializations/tensorflow-in-practice
YrQcZ-zCgQN1f7FNVoh_FBA&edufilter=NULL&list=PLdxQ7SoCLQANQ9fQcJ0wnnTzkFsJHlWEj&v=Bn_jRbQcmV4
44NJRhaL_pfC94DxwVt5z4lc&index=1&edufilter=NULL&list=PLeyrCtKXMqvbafdWvljuORVu__CPlPEaB&v=ju1Grt2hdko
EDX
AnalyticsVidhya
Advantage Actor-Critic
nforcement Learning
on to Reinforcement Learning
ecision Processes
by Dynamic Programming
ee Prediction
ction Approximation
dient Methods
n and Exploitation
data-science/graduate-programs/master-data-science-and-artificial-intelligence/tuition
data-structures-and-algorithms-nanodegree
$1436
/4month
5 Courses
(https://github.com/AdityaGupta030697/Applied-Machine-Learning-Coursera) (https://github.com/villeristi/applied-machine-learning-in-
https://channel9.msdn.com/Tags/stephan-t-lavavej
C++: From Beginner to Expert – Udemy
Beginning C++ Programming – From Beginner to Beyond – Udemy
NPTEL
The C++ Programming Language, 4th Edition
b.com/villeristi/applied-machine-learning-in-python)
m/jiadaizhao/Advanced-Machine-Learning-Specialization)
HTML + CSS + XML + JSON
https://www.levels.fyi/?compare=Microsoft,Facebook&track=Software%20Engineer
https://hackernoon.com/deeplearning-101-coursera-vs-udemy-vs-udacity-b4eb3d
-vs-udemy-vs-udacity-b4eb3de06dbe
Time Plan
Building an Effective Machine Learning Workflow with scikit-learn By Data School 129$
https://www.analyticsvidhya.com/blog/2020/01/learning-path-data-scientist-machine-learning-2020/
SQL 100 Pages Linkden
https://www.analyticsvidhya.com/blog/2020/01/learning-path-nlp-2020/
Tensorflow2 Keras API ( Udemy )
RanJ Hindi >> Mathmatical Concepts with example
e-learning-2020/
Hisham ML
Hand-on Book ML
Month 1 -2020
probability Waleed https://end-to-end-machine-learning.teachable.com/cour
Stat Waleed statQuest
probability for DS Book jbstatistics.com
Brandon Foltz https://projects.iq.harvard.edu/stat110/youtube
linear algebra waleed
Month 2 -2020
linear algebra waleed
Linear algebra Gilbert
Linear Algerbra from Ahmed Fathi
information theroy ( entropy ,cross entropy, KL ) aim:
STS Udacity des + infer 1-all everything about ML in parall
John hobkis STS 2-Expanded my knowledge
statquest Youtube Channel Bayesian >> Gaussian >> MMCM >
MCMC + Bayesian + Guassian 3-DL >> NLP
Discrete Math >> Graph theory 4-Big-Data + Data Engineering
PGM 5-IELTS
Linear alegbra ahmed fathi All ML ((مناهج Udacity
Month 3 -2020 stop all >> start ML Udacity
Udacity Machine Learning
Win Kaggle
Discrete Mathmatics ( Graph Theory ) then >> bayesian >> ML (coursera download )
DS with alqorithms Waleed + Coursera or Udacity IBM data scinece Coursera
SQL Query datacamp data scientist + ML
Calculus including maximizing and minimizing algebraic equati PGM
Introduction to Version Control git,github time-series
OOP Python + refactoring Hand-On all Topics with details
udemy > learn R
Month 4 -2020 NLP
Bayesian
Applied Miachgen
ML > John Hopkins
appliedAI
Alberta Machine Intelligence Institute
Feature Engineering & Feature Selection ( check Correlation matrix Topic )
R-Programmin A-Z
Projects > udacity + udemy + hisham Kaggle + AnalyticVhdica 27 Projects + Udemy Kaggle master
Time Series
RL
DL Hisham + ahmed Fathi >> Coursera andrw + Russia IBM AI Engineering Professional Certificate
Geoffrey Hinton
Tensoflow + Keras + Pytorch
NLP >> Udacity + Coursera
PGM + Optimization
advance python deep dive + django + flask + web scraping + Design Pattern + Data Structure and alogrithm
ine-learning.teachable.com/courses/000-fou
Guassian
bayesian
Frqunist sta
d.edu/stat110/youtube A/B test
Markov Models
>> Linear Algebra
>> Discrete Math >> Graph theory
Probability Graphical models
then ML Theory Topics in details
DL >> RL
1-all everything about ML in parallel (DS and algorithm) Plan-2021 ( IELTS Exam ) + ( Canada P.E ) + ( Coursera Certifiacte )
2-Expanded my knowledge 1+2+3-Month Big Data Course Edurake >> Coursera >> >>
Bayesian >> Gaussian >> MMCM >> PGM >> University Lectures 4 BlockChain Course + Quantim Computing
5 Software Engineering Skill ( Hammad )
4-Big-Data + Data Engineering
AICourse
coursera download )
my Kaggle master
ssional Certificate
DRL AI
Plan-2020
Book ML A Probabilistic Perspective + element
all Topics of Lectures syllabs
Plan for ML Practical
1+2 Prob + STS + Linear
maybe 3 for PGM
4+5 ML revsion Hand-on with Topics in details from all
6+7+8 DL Hisham with andrw >> Udacity
Book Ian + illustarted + TensorFlow + Pytorch +Keras
9 Fully Reinforcement Learning
10 NLP + (R +SAS + SPSS)
11+12 ?? ML Time-Series Analysis
11+12 ?? Projects ************
OOP + DS with alqorithms + refactoring Python
cloud services (AWS, GCP, Azure) + Kurbane + Docker
Plan 2020-2021
ML Coursera
Deep Coursera
Alebra RL Coursera
data strcure and algorithm certificate
PGM Coursera
IBM data scinece Coursera + IBM AI Coursera
GCP Coursera
ML Udacity
DL Udacity
Self driving Car Udacity
practical Time series Certificate Coursera
Big Data San Diaego Coursera
course “Strategic Thinking”! #strategicthinking #leadership
GCP Google
advance Machine learning Coursera
RF
2G Full
3G
4G VoLTE
5G
Peace is Every Step: The Path of Mindfulness in Everyday Life, Thich Nhat Hanh
Thinking, Fast and Slow , Daniel Kahneman
Make Time: How to Focus on What Matters Every Day , Jake Knapp, John Zeratsky
Essentialism: The Disciplined Pursuit of Less , Greg McKeown
Subliminal: How Your Unconscious Mind Rules Your Behavior , Leonard Mlodinow
Deep Work: Rules for Focused Success in a Distracted World , Cal Newport
Team Leader Book
2019 Data Science Bowl
Uncover the factors to help measure how young children learn
https://www.kaggle.com/c/data-science-bowl-2019/discussion/127469
Topics : Instructor: Sargur SrihariDepartment of Computer Sc
1. Introduction
1. Machine Learning-Overview(28MB)
2. Python and ML
Frameworks(13.9MB) Code
3. Linear Algebra(4.5MB) Code
4. Example: Curve Fitting(934KB)
5. Probability Theory(4.9MB) Code
6. Numerical Computation(1.4MB)
7. Decision-Theory(488KB)
8. Information Theory(715KB)
2. Probability Distributions
1. Discrete Distributions(1MB)Code
1. Biology(4.5MB)
2. Feed-forward Network Functions(669KB)
3. Network Training(2.6MB)
4. Backpropagation(8.7MB)
5. The Hessian Matrix(562KB)
6. Regularization in Neural Networks(1.2MB)
1. Convolutional Networks(4.9MB)
2. Soft Weight Sharing(1.2MB)
7. Mixture Density Networks (634KB)
8. Bayesian Neural Networks(716KB)
9. Deep Learning Overview(5.2MB)
10. See course on Deep Learning
6. Kernel Methods
1. Kernel Methods(6.3MB)
2. Radial Basis Function Networks(812KB)
3. Gaussian Processes(6.8MB)
7. Sparse Kernel Machines
1. Support Vector Machines(5.4MB)
2. SVM for Overlapping Distributions(1.3MB)
3. Multiclass SVMs (1.4MB)
4. Relation to Logistic Regression (446KB)
8. Probabilistic Graphical Models
1. Approximate Inference(180KB)
2. Variational Inference(3.3MB)
3. Variational Mixture of Gaussians(1MB)
11. Sampling Methods
1. Need for Sampling (6.6MB)
2. Basic Sampling Methods(2.5MB)
3. Markov Chain Monte Carlo Sampling(815KB)
4. Gibbs Sampling(1.2MB)
12. Continuous Latent Variables
1. Principal Components Analysis
See Section 3.2 of course on Data Mining
2. Nonlinear Latent Variable Models
Basic concepts
Linear regression
Linear algebra, Ridge regression
Logistic regression
MVN, LDA/QDA
Naive Bayes; Beta-Binomial model
Bayesian concept learning; Beta-Binomial; Dirichlet-Multinomial
Bayesian parameter estimation for Gaussians, generative classifiers, linear and logistic regression
Decision theory ; model selection
Midterm
Feature selection
L1 regularization
Machine Learning-Overview(28MB)
Python and ML Frameworks(13.9MB) Code
Linear Algebra(4.5MB) Code
Example: Curve Fitting(934KB)
Information Theory(715KB)
Probability Distributions
Discrete Distributions(1MB)Code
Gaussian Distribution(833KB) Code
Gaussian Bayesian Networks(738KB)
Neural Networks
Biology(4.5MB)
Feed-forward Network Functions(669KB)
Network Training(2.6MB)
Backpropagation(8.7MB)
The Hessian Matrix(562KB)
Regularization in Neural Networks(1.2MB)
Convolutional Networks(4.9MB)
Soft Weight Sharing(1.2MB)
Mixture Density Networks (634KB)
Bayesian Neural Networks(716KB)
Deep Learning Overview(5.2MB)
See course on Deep Learning
Kernel Methods
Kernel Methods(6.3MB)
Radial Basis Function Networks(812KB)
Gaussian Processes(6.8MB)
Sparse Kernel Machines
Support Vector Machines(5.4MB)
SVM for Overlapping Distributions(1.3MB)
Multiclass SVMs (1.4MB)
Relation to Logistic Regression (446KB)
K-means Clustering(1.4MB)Code
Gaussian Mixture Models(1.5MB) Code
Latent Variable View of EM(1.1MB)
Bernoulli Mixture Models(3.1MB)
Probabilistic Classifiers (Probability Slides, Notes on Probability) Overfitting and complexity; training, validation, tes
Non-Parametric Models to Matlab (II)
Ensemble Methods Week 3 Classification problems; decision boundarie
Neural Networks
More Neural Networks
Even More Neural Networks
Convolutional Neural Networks
More CNNs, Boosting
Part 2: Data Science 573 and 575
The second set of notes are from courses I've taught in UBC's Master of Data Science (MDS) program in 2017 and 2018, which
Structure Learning
Sequence Mining
Semi-Supervised Learning
PageRank
Markov Chains and Monte Carlo
Part 3: Computer Science 540
The third set of notes is from the January-April 2019 offering CPSC 540, a graduate-level course on machine learning. Related r
Videos covering the first month of material in the 2016 offering are available here. Note that the material has gone through so
A. Fundamentals
340 Overview
Fundamentals of Learning
Convex Optimization (Notes on Norms)
B. Large-Scale Machine Learning
Gradient Descent Convergence
Coordinate Optimization
Stochastic Subgradient
SGD Convergence Rate
Stochastic Average Gradient
DAG Models
More DAGs
Undirected Graphical Models
Approximate Inference
Log-Linear Models
Boltzmann Machines
E. Discriminative Models
Conditional Random Fields
Structured SVMs
Deep Structured Models
Fully-Convolutional Networks
Recurrent Neural Networks
Long Short Term Memory
F. Bayesian Learning
Bayesian Statistics
Empirical Bayes
Hierarchical Bayes
Topics Models
More Approximate Inference
Non-Parametric Bayes
VAEs and GANs
Part 4: Machine Learning Reading Group
The final set of notes are topics that I have not covered in a formal course, but where I've given overviews in our machine lear
Parallel and Distributed Machine Learning
Online, Active, and Causal Learning
Reinforcement Learning
Overview of Other Large/Notable Topics
Syllabus
ass organization, topics overview, software etc. Topics covered include: Algorithmic models of learning. Learning classifi
Problems, data, and tools; Visualization; Matlab from experience. Bayesian, maximum a posteriori, and minimum descrip
support vector machines, Bayesian networks, bag of words classifiers, N
SSE; gradient descent; closed form; normal equations; nearest neighbor classifiers, locally weighted regression, ensemble class
dimension, Occam learning, accuracy and confidence boosting. Dimensi
; training, validation, test data, and introduction hierarchical clustering, distributional clustering. Reinforcement learning
automated knowledge acquisition, pattern recognition, program synthe
lems; decision boundaries; nearest neighbor methods web, and bioinformatics and computational biology.
or Mid-term
achine learning. Related readings and assignments are available from the course homepage. This course is intended as a continuation on C
erial has gone through some substantial improvement since then.
iews in our machine learning reading group.
ls of learning. Learning classifiers, functions, relations, grammars, probabilistic models, value functions, behaviors and programs
osteriori, and minimum description length frameworks. Parameter estimation, sufficient statistics, decision trees, neural networks,
rks, bag of words classifiers, N-gram models; Markov and Hidden Markov models, probabilistic relational models, association rules,
ed regression, ensemble classifiers. Computational learning theory, mistake bound analysis, sample complexity analysis, VC
confidence boosting. Dimensionality reduction, feature selection and visualization. Clustering, mixture models, k-means clustering,
ering. Reinforcement learning; Learning from heterogeneous, distributed, data and knowledge. Selected applications in data mining,
n recognition, program synthesis, text and language processing, internet-based information systems, human-computer interaction, semanti
tended as a continuation on CPSC 340 and the notation in this course is almost the same, except that we switch to using superscripts to re
Self Notes on ML and Stats. Course Material
Machine learning and statistics tie into many different fields, in
viors and programs Contents
Multivariate Adaptive Regression Spl SVM derivation: convex optimization, Hilbert spaces, repr
AUC
Precision
Recall
Specificity
Mean absolute percentage error
Root mean square error
Algorithms
Linear regression: Usually performed through OLS
Logistic regression
Naive Bayes
K-Nearest Neighbors
K means clustering
Classification and regression trees(CARTs)
itch to using superscripts to refer Support
to vector machines
AdaBoost
Random forest
ARIMA
Decision Trees
ID3
CHAID
C4.5, C5.0
Hierarchical Clustering
Miscellaneous
Curse of dimensionality
No free lunch theorem
Occams Razor
Deep Learning
Neural Networks
Bayesian neural nets
Deep Boltzmann Machine(DBM)
Gradient descent is an optimization algorithm used to find the values of parameters (coefficients) of a fu
In this the gradient descent variation the update to the coefficients is performed for each training instan
3 a)Bootstrapping
In the context of machine learning with bootstrapping, we’re drawing random samples from another sam
3 b) Cross Validation
Cross-validation (also called rotation estimation) is a method to estimate how well a model generalizes o
4 ) LDA
Parametric, linear, Classification algorithm used to classify more than two categories
If you have more than two classes, then the Linear Discriminant Analysis is the preferred linear classifica
22 a) Linear Regression
Different techniques can be used to prepare or train the linear regression equation from data, the most
OLS:
When we have more than one input we can use Ordinary Least Squares to estimate the values of the coe
Gradient Descent:
When there are one or more inputs you can use a process of optimizing the values of the coefficients by
There are extensions of the training of the linear model called regularization methods. These seek to bot
Lasso Regression: where Ordinary Least Squares is modified to also minimize the absolute sum of the coe
Ridge Regression: where Ordinary Least Squares is modified to also minimize the squared absolute sum o
Remove Noise
Remove Collinearity
Gaussian Distributions. Linear regression will make more reliable predictions if your input and output var
Rescale Inputs: Linear regression will often make more reliable predictions if you rescale input variables
22 b) Logistic regression
It is a linear, parametric classification algorithm. One vs Rest logistic regression can be used to classify m
Logistic regression is named for the function used at the core of the method, the logistic function. The lo
Logistic regression is a linear method, but the predictions are transformed using the logistic function
22 c) Naive bayes
Classification algorithm
It is called naive Bayes or idiot Bayes because the calculation of the probabilities for each hypothesis are
22 d )KNN
KNN is used for regression problems the prediction is based on the mean or the median of the K-most sim
22 f) CART
It is a nonlinear algorithm
For classification the Gini cost function is used which provides an indication of how pure the leaf nodes a
The most common stopping procedure is to use a minimum count on the number of training instances a
You can use pruning after learning your tree to further lift performance. The complexity of a decision tre
22 g) SVM
Non parametric
support vector machine is a generalization of a simple and intuitive classifier called the maximal margin c
support vector machine, which is a further extension of the support vector classifier in order to accomm
A hyperplane is a subspace whose dimension is one less than that of its ambient space
maximal margin hyperplane (also known as the maximal margin hyperplane optimal separating hyperpla
22 k) Decision Trees
There are many good ways to decide the variable which should be used for splitting. Below are a few:
Q and A
Yes, the rotation is necessary because it maximizes the differences between the variance captured by th
It is a straight effect. If the components are not rotated, then it will diminish eventually, and one must us
It assumes that all the features in the data set are important, equal and independent.
4)What is the difference between stochastic gradient descent (SGD) and gradient descent (GD)?
Both algorithms are methods for finding a set of parameters that minimize a loss function by evaluating
In standard gradient descent, you'll evaluate all training samples for each set of parameters. This is akin
In stochastic gradient descent, you'll evaluate only 1 training sample for the set of parameters before up
The Box-Cox transformation is a generalized "power transformation" that transforms data to make the d
If the training set is small, high bias / low variance models (e.g. Naive Bayes) tend to perform better beca
If training set is large, low bias / high variance models (e.g. Logistic Regression) tend to perform better be
PCA is a method for transforming features in a dataset by combining them into uncorrelated linear comb
These new features, or principal components, sequentially maximize the variance represented (i.e. the fi
As a result, PCA is useful for dimensionality reduction because you can set an arbitrary variance cutoff.
Support Vector Machine Learning Algorithm performs better in the reduced space. It is beneficial to perf
10)How will you find the correlation between a categorical variable and a continuous variable ?
You can use the analysis of covariance technqiue to find the correlation between a categorical variable a
13)Explain what a local optimum is and why it is important in a specific context, such as K-means clusteri
20)What do you understand by statistical power and how do you calculate it?
21)What’s the Central Limit Theorem and what are its practical implications?
If the variability of true values along the regression line is not constant, then this condition is known as h
Good resources:
https://remicnrd.github.io./the-...
cs tie into many different fields, including decision theory, information theory, functional analysis (Hilbert spaces), convex optimization, an
ata mining: classification, clustering, regression, ranking, density estimation Chapters Available as Individual PDFs
Shannon Theory
scovery (CRISP-DM, KDD) Fourier Transforms
Sparse Regularization
Convex Analysis
Gradient Descent Methods
Non Smooth Optimization
Theory of Sparse Regularization
Compressed Sensing
classification and ranking) Machine Learning
eling (for density estimation), including sampling techniques Deep-Learning
d generalization bounds: Hoeffding bounds, Chernoff bounds (derived from Markov's bound), McDiarmid's inequality, VC bound
parameters (coefficients) of a function (f) that minimizes a cost function (cost)
ent descent is referred to as batch gradient descent. Batch gradient descent is the most common form of gradient descent described in m
erformed for each training instance, rather than at the end of the batch of instances
andom samples from another sample to generate a new sample that has a balance between the number of samples per class. This is usefu
e how well a model generalizes on a training dataset. In cross-validation we split the training dataset into N number of splits and then sepa
wo categories
on equation from data, the most common of which is called Ordinary Least Squares.
to estimate the values of the coefficients. The Ordinary Least Squares procedure seeks to minimize the sum of the squared residuals.
the values of the coefficients by iteratively minimizing the error of the model on your training data. This operation is called Gradient Desc
ation methods. These seek to both minimize the sum of the squared error of the model on the training data (using Ordinary Least Squares)
thod, the logistic function. The logistic function, also called the sigmoid function
babilities for each hypothesis are simplified to make their calculation tractable. Rather than attempting to calculate the values of each attr
on and regression
tion of how pure the leaf nodes are (how mixed the training data assigned to each node is).
The complexity of a decision tree is defined as the number of splits in the tree. Simpler trees are preferred. They are easy to understand
sifier called the maximal margin classifier
ambient space
lane optimal separating hyperplane), which is the separating hyperplane that optimal separating hyperplane is farthest from the training o
hm used for discovering relationships between a categorical response variable and other categorical predictor variables.
nish eventually, and one must use a lot of various components to explain the data set variance.
independent.
ize a loss function by evaluating parameters against data and then adjusting.
h set of parameters. This is akin to taking big, slow steps toward the solution.
the set of parameters before updating them. This is akin to taking small, quick steps toward the solution.
yes) tend to perform better because they are less likely to be overfit.
ession) tend to perform better because they can reflect more complex relationships
e variance represented (i.e. the first principal component has the most variance, the second principal component has the second most, an
fitting an SVM?
uced space. It is beneficial to perform dimensionality reduction before fitting an SVM if the number of features is large when compared to
a continuous variable ?
context, such as K-means clustering. What are specific ways of determining if you have a local optimum problem? What can be done to av
then this condition is known as heteroskedasticity.
aces), convex optimization, and probability. We will cover introductory material from most or all of these areas.
Syllabus
Introduction to different paradigms of machine learning Syllabus
Linear prediction, Regression Supervised Learning: Decision Trees and K-Nearest-N
Maximum Likelihood, MAP, Bayesian ML Models (Linear Regression and Logistic Regression),
Regularization, Generalization, Cross Validation and Flat Clustering, Gaussian Mixture Models (via Ex
Basics of Optimization Manifold Learning; Assorted Topics: Boosting, Reduc
Linear Classification, Logistic Regression, Naïve Bayes Topic Models for Text.
Support Vector Machines
Kernel Methods
Neural Networks, Backpropagation
Convolutional Neural Networks
amples per class. This is useful when we’d like to model against a dataset with highly unbalanced classes.
number of splits and then separate the splits into training and test groups. We train on the training group of splits and then test the mode
(using Ordinary Least Squares) but also to reduce the complexity of the model (like the number or absolute size of the sum of all coefficien
lculate the values of each attribute value P(d1, d2, d3|h), they are assumed to be conditionally independent given the target value and ca
or variables.
nent has the second most, and so on).
ession and Logistic Regression), Model Selection (AIC/BIC/Cross-validation, etc.), Feature Selection, Learning Theory; Unsupervised Learnin
Gaussian Mixture Models (via Expectation Maximization), Linear Dimensionality Reduction and Matrix Factorization, Nonlinear Dimensiona
ssorted Topics: Boosting, Reductions, Structured Prediction, Ranking, Semi-supervised Learning, Active Learning, Reinforcement Learning,
of splits and then test the model on the test group of splits. We rotate the splits between the two groups many times until we’ve exhauste
Learning Curves
Definition and illustration (complex models versus simple models)
Linear Regression example (learning curves for noisy linear target)
Learning Diagram
Components of learning (target function, hypothesis set, learning algorit
Input probability distribution (unknown distribution, bin, Hoeffding)
Error measure (role in learning algorithm)
Noisy targets (target distribution)
Where the VC analysis fits (affected blocks in learning diagram)
Learning Paradigms
Nonlinear Transformation
Basic method (linearity in the parameters, Z space)
Illustration (non-separable data, quadratic transform)
Generalization behavior (VC dimension of a nonlinear transform)
Occam's Razor
Definition and analysis (definition of complexity, why simpler is better)
Overfitting
The phenomenon (fitting the noise)
Sampling Bias
Definition and analysis (Truman versus Dewey, matching the distribution
Support Vector Machines
SVM basic model (hard margin, constrained optimization)
The solution (KKT conditions, Lagrange, dual problem, quadratic program
Soft margin (non-separable data, slack variables)
Nonlinear transform (Z space, support vector pre-images)
Kernel methods (generalized inner product, Mercer's condition, RBF ker
Validation
Introduction (validation versus regularization, optimistic bias)
Model selection (data contamination, validation set versus test set)
Cross Validation (leave-one-out, 10-fold cross validation)
VC Dimension
Growth function (dichotomies, Hoeffding Inequality)
ny times until we’ve exhausted all the variations Examples (growth function for simple hypothesis sets)
Break points (polynomial growth functions)
Bounding the growth function (mathematical induction, polynomial bou
Definition of VC Dimension (shattering, distribution-free, Vapnik-Chervo
VC Dimension of Perceptrons (number of parameters, lower and upper b
Interpreting the VC Dimension (degrees of freedom, Number of example
t is most unlikely in real data
om the observations to the hyperplane, and is known as the margin.
The Learning Problem - Introduction; supervised, unsupervised, and reinforcement learning. C
ywords below -- Is Learning Feasible? - Can we generalize from a limited sample to the entire space? Relations
The Linear Model I - Linear classification and linear regression. Extending linear models throug
blending, before and after the fact) Error and Noise - The principled choice of error measures. What happens when the target we
Training versus Testing - The difference between training and testing in mathematical terms. W
osterior, unknown versus probabilistic) Theory of Generalization - How an infinite model can learn from a finite sample. The most imp
The VC Dimension - A measure of what it takes a model to learn. Relationship to the number o
roximation-generalization tradeoff) Bias-Variance Tradeoff - Breaking down the learning performance into competing quantities. T
The Linear Model II - More about linear models. Logistic regression, maximum likelihood, and
Neural Networks - A biologically inspired model. The efficient backpropagation learning algori
Overfitting - Fitting the data too well; fitting the noise. Deterministic noise versus stochastic n
s, sample, PAC) Regularization - Putting the brakes on fitting the noise. Hard and soft constraints. Augmented
is, training data) Validation - Taking a peek out of sample. Model selection and data contamination. Cross valid
ng: search for green sample) Support Vector Machines - One of the most successful learning algorithms; getting a complex
Kernel Methods - Extending SVM to infinite-dimensional spaces using the kernel trick, and to
Radial Basis Functions - An important learning model that connects several machine learning m
n, model selection) Three Learning Principles - Major pitfalls for machine learning practitioners; Occam's razor, sa
Epilogue - The map of machine learning. Brief views of Bayesian learning and aggregation met
rror, CIA, supermarket)
in learning diagram)
bility estimation)
transform)
a nonlinear transform)
est neighbor)
learning, pseudo-inverse)
s global, EM algorithm)
regularization)
approximation)
soft-order constraint, augmented error)
ral networks)
e error, choosing a regularizer)
oise, stochastic noise)
tor pre-images)
t, Mercer's condition, RBF kernel)
othesis sets)
and reinforcement learning. Components of the learning problem. Introduction to Machine Learning
to the entire space? Relationship between in-sample and out-of-sample. General information and basic concepts. Overview
xtending linear models through nonlinear transforms. Supervised machine learning theory
happens when the target we want to learn is noisy. The supervised Machine Learning problem setup. C
sting in mathematical terms. What makes a learning model able to generalize? Linear methods for regression
a finite sample. The most important theoretical result in machine learning. Error functions for regression. Least squares: analy
. Relationship to the number of parameters and degrees of freedom. Linear methods for classification
ce into competing quantities. The learning curves. Error functions for classification. The perceptron a
ion, maximum likelihood, and gradient descent. Artificial neural networks
ackpropagation learning algorithm. Hidden layers. Artificial neural networks: multilayer perceptron a
istic noise versus stochastic noise. Kernel functions and support vector machines
d soft constraints. Augmented error and weight decay. Definition and properties of Kernel functions. Supp
egression. Least squares: analytical and iterative methods. Regularized least squares. The Delta rule. Examples.
lassification
assification. The perceptron algorithm. Novikoff's theorem. Separations with maximum margin. Generative learning algorithms and Gauss
works: multilayer perceptron and radial basis functions network. Application to classification and to regression problems.
support vector machines
rties of Kernel functions. Support vector machines for classification and regression problems.
ne learning techniques. Clustering algorithms: EM algorithm and k-means algorithm. Kernel Density Estimation.
ing and control
rcement learning. Markov decision processes and Bellman equations. Values and Temporal Difference methods. Q-learning and the Sarsa
g. Notes on deep learning, transductive learning and other hot topics. Challenging applications.
(classification and regression), unsupervised learning (clustering and density estimation) and semi-supervised learning (reinforcement and
Overfitting and underfitting. Generalization bounds. Complexity of a model: Vapnik-Chervonenkis dimension and Rademacher complexity.
earning algorithms and Gaussian discriminant analysis. Naive Bayes. Logistic regression. Multinomial regression.
n problems.
Lecture 3: 1/24/11
Ridge Regression and PCA
lecture notes pdf
Lecture 4: 1/26/11
The Central Limit Theorem; Large Deviations; and R
lecture notes pdf
Lecture 5: 1/30/11
The Moment Method; Convex Duality; and Large/M
lecture notes pdf
Lecture 6: 2/2/11
Hoeffding, Chernoff, Bennet, and Bernstein Bound
lecture notes pdf
Lecture 7: 2/7/11
Feature Selection, Empirical Risk Minimization, and
Lecture 8: 2/9/11
Feature Selection and Chi^2 Tail bounds
lecture notes pdf
Lecture 9: 2/14/11
Risk vs. Risk: Some terminology differences betwee
lecture 0 notes pdf
Empirical Processes
lecture 9 notes pdf
Lecture 10: 2/16/11
Bracketing Covering Numbers
erminology differences between Stats and ML Linear threshold functions, perceptron algorithm
defined risk analogously, causing some confusion) Risk bounds
Concentration inequalities
Uniform convergence
sion and Ridge Regression Minimax strategies for log loss, linear loss, and quadratic loss
Universal portfolios
Rademacher Averages
sition and Linear Prediction
al Covering Numbers
nd Packing Numbers
ent Descent
nd the VC dimension
Here is a tentative outline for the course:
Nonnegative Matrix Factorization [slides]
D. Lee and S. Seung. Learning the Parts of Objects by Nonnegative Matrix Factorization, Nature 1999.
S. Vavasis. On the Complexity of Nonnegative Matrix Factorization, SIOPT 2009.
S. Arora, R. Ge, R. Kannan and A. Moitra. Computing a Nonnegative Matrix Factorization -- Provably, STOC 2012.
S. Arora, R. Ge and A. Moitra. Learning Topic Models -- Going Beyond SVD, FOCS 2012.
S. Arora et al. A Practical Algorithm for Topic Modeling with Provable Guarantees, ICML 2013.
M. Balcan, A. Blum and A. Gupta. Clustering under Approximation Stability, JACM 2013.
C. Hillar and L. Lim. Most Tensor Problems are NP-hard, JACM 2013.
E. Mossel and S. Roch. Learning Nonsingular Phylogenies and Hidden Markov Models, STOC 2005.
A. Anandkumar, D. Foster, D. Hsu, S. Kakade and Y. Liu A Spectral Algorithm for Latent Dirichlet Allocation, NIPS 201
A. Anandkumar, R. Ge, D. Hsu and S. Kakade. A Tensor Spectral Approach to Learning Mixed Membership Commun
N. Goyal, S. Vempala and Y. Xiao. Fourier PCA, STOC 2014.
U. Feige and J. Kilian. Heuristics for Semirandom Graph Problems, JCSS 2001.
Sparse Coding
Sparse Recovery, Incoherence and Uncertainty Principles
Alternating Minimization via Approximate Gradient Descent [slides]
B. Olshausen and D. Field. Emergence of Simple-cell Receptive Field Properties by Learning a Sparse Code for Natur
D. Spielman, H. Wang and J. Wright. Exact Recovery of Sparsely-Used Dictionaries, COLT 2012.
S. Arora, R. Ge, T. Ma and A. Moitra. Simple, Efficient and Neural Algorithms for Sparse Coding, Manuscript 2014.
B. Barak, J. Kelner and D. Steurer. Dictionary Learning and Tensor Decomposition via the Sum-of-Squares Method, M
S. Geman and D. Geman. Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images, Trans.
A. Dempster N. Laird and D. Rubin. Maximum Likelihood from Incomplete Data via the EM Algorithm, J. Royal Statis
S. Dasgupta. Learning Mixtures of Gaussians, FOCS 1999.
S. Arora and R. Kannan. Learning Mixtures of Separated Nonspherical Gaussians, Annals of Applied Probability 2005
A. Kalai, A. Moitra and G. Valiant. Efficiently Learning Mixtures of Two Gaussians, STOC 2010.
A. Moitra and G. Valiant. Settling the Polynomial Learnability of Mixtures of Gaussians, FOCS 2010.
M. Belkin and K. Sinha. Polynomial Learning of Distribution Families, FOCS 2010.
Discussion: Is nature an adversary? And if not, how can we model and exploit that?
A. Bhaskara, M. Charikar, A. Moitra and A. Vijayaraghavan. Smoothed Analysis of Tensor Decompositions, STOC 201
E. Candes and B. Recht. Exact Matrix Completion via Convex Optimization, FOCM 2009.
V. Chandrasekaran, P. Parrilo, B. Recht and A. Willsky. The Convex Geometry of Linear Inverse Problems, FOCM 201
P. Jain, P. Netrapalli and S. Sanghavi. Low-rank Matrix Completion using Alternating Minimization, STOC 2013.
M. Hardt. Understanding Alternating Minimization for Matrix Completion, FOCS 2014.
B. Barak and A. Moitra. Tensor Prediction, Rademacher Complexity and Random 3-XOR, Manuscript 2015.
Q. Berthet and P. Rigollet. Computational Lower Bounds for Sparse PCA, COLT 2013.
V. Chandrasekaran and M. Jordan. Computational and Statistical Tradeoffs via Convex Relaxation, PNAS 2013.
1. Introduction
2. Minimax formulation for learning, distribution free and adversarial learning settings, uniform guarante
n, Nature 1999.
3. Statistical Learning Framework
on -- Provably, STOC 2012. . Empirical risk minimization and Regularized empirical risk minimization
5. Deriving algorithms through relaxation and minimax analysis for online learning
STOC 2005.
s, FOCS 2010.
us adversaries, various notions of regret) connections toLecture 1 : Introduction, course details, what is learning theory, learning framewor
Reference : [1] (ch 1 and 3)
ing settings, uniform guarantees and no free lunch theorems
hattering dimension Lecture 5 : Statistical Learning: MDL continued, infinite classes [lec5]
onvergence (iid)
ntial fat-shattering dimension Lecture 9 : Statistical Learning: Properties of Rademacher complexity, examples [le
rtingale uniform convergence
Lecture 10 : Statistical Learning: Examples continued, Covering numbers, Pollard bo
Lecture 13 : Online Learning: Halving, Exponential weights, minimax rate for bit pre
Tue, Aug 25
ng theory, learning frameworks [slides] [notes]
Introduction, history, overview, and administrivia.
ameworks [lec2] Olivier Bousquet, Stéphane Boucheron, and Gábor Lugosi, Introduction to statistica
Theodoros Evgeniou, Massimiliano Pontil, and Tomaso Poggio, Statistical learning t
Ulrike von Luxburg and Bernhard Schölkopf, Statistical learning theory: models, con
Tomaso Poggio and Steve Smale, The mathematics of learning: dealing with data, N
Cosma Shalizi, Learning theory (formal, computational or statistical) (http://cscs.um
Thu, Aug 27
te classes [lec5] Tue, Sep 1
[notes]
macher Complexity, Growth function [lec6] Concentration inequalities: Markov, Chebyshev, McDiarmid (bounded differences i
Dimension, Massart's finite lemma [lec7] Torben Hagerup and Christine Rüb, A guided tour of Chernoff bounds, Information
Gábor Lugosi, Concentration-of-measure inequalities, lecture notes, 2003-2009
ma continued [lec7] Colin McDiarmid, Concentration, Probabilistic Methods for Algorithmic Discrete Ma
Terence Tao, Concentration of measure (http://terrytao.wordpress.com/2010/01/0
cher complexity, examples [lec9] Thu, Sep 3
[notes]
Covering numbers, Pollard bound [lec10] Formulation of the learning problem: concept and function learning; realizable case
ollard bound, Dudley Bound [lec11] Dana Angluin, Queries and concept learning, Machine Learning, vol. 2, no. 4, pp. 31
at-shattering dimension, learnability [lec12] Leslie Valiant, A theory of the learnable, Communications of the ACM, vol. 27, no. 1
Tue, Sep 8
ights, minimax rate for bit prediction [lec13] [notes]
Formulation of the learning problem, continued: agnostic (model-free) learning; co
Descent [lec14]
Dana Angluin, Queries and concept learning, Machine Learning, vol. 2, no. 4, pp. 31
David Haussler, PAC learning model, and decision-theoretic generalizations, with ap
Leslie Valiant, A theory of the learnable, Communications of the ACM, vol. 27, no. 1
Thu, Sep 10
Tue, Sep 15
[notes]
Empirical Risk Minimization: abstract risk bounds and Rademacher averages -- stoc
Peter Bartlett and Shahar Mendelson, Rademacher and Gaussian complexities: risk
ential Rademacher Complexity [lec19] Olivier Bousquet, Stéphane Boucheron, and Gábor Lugosi, Theory of classification:
Thu, Sep 17
[notes]
Vapnik-Chervonenkis classes: shatter coefficients; VC dimension; examples of VC cl
Anselm Blumer, Andrzej Ehrenfeucht, David Haussler, and Manfred Warmuth, Lear
Gábor Lugosi, Pattern classification and learning theory, in Principles of Nonparame
Tue, Sep 22
Thu, Sep 24
[notes]
Binary classification: bounds for simple VC classes (linear and generalized linear dis
Peter Bartlett, Michael Jordan, and Jon McAuliffe, Convexity, classification, and risk
Olivier Bousquet, Stéphane Boucheron, and Gábor Lugosi, Theory of classification:
Tue, Sep 29
Thu, Oct 1
No class: Allerton conference
Tue, Oct 6
Thu, Oct 8
[notes]
Binary classification, continued: reproducing kernel Hilbert spaces and kernel mach
Peter Bartlett, Michael Jordan, and Jon McAuliffe, Convexity, classification, and risk
Olivier Bousquet, Stéphane Boucheron, and Gábor Lugosi, Theory of classification:
Tue, Oct 13
Thu, Oct 15
[notes]
Regression with quadratic loss
Tue, Oct 20
Thu, Oct 22
Thu, Oct 29
Tue, Nov 3
Thu, Nov 5
[notes]
Stability of learning algorithms: learnability without uniform convergence; average
Olivier Bousquet and André Elisseeff, Stability and generalization, Journal of Machin
Alexander Rakhlin, Sayan Mukherjee, and Tommaso Poggio, Stability results in lear
Shai Shalev-Shwartz, Ohad Shamir, Nathan Srebro, and Karthik Sridharan, Learnabi
Moritz Hardt, Ben Recht, and Yoram Singer, Train faster, generalize better: stability
Kobbi Nissim and Uri Stemmer, On the generalization properties of differential priv
Tue, Nov 10
Thu, Nov 12
Online learning: basic model; regret; regret bounds for online convex and strongly
Martin Zinkevich, Online convex programming and generalized infinitesimal gradien
Elad Hazan, Amit Agarwal, and Satyen Kale, Logarithmic regret algorithms for onlin
Nicolò Cesa-Bianchi, Alex Conconi, and Claudio Gentile, On the generalization abilit
Jacob Abernethy, Alekh Agarwal, Peter Bartlett, and Alexander Rakhlin, A stochasti
Tue, Nov 17
Thu, Nov 19
Tue, Dec 1
[notes]
Minimax lower bounds: binary classification under a margin assumption; reduction
Pascal Massart and Élodie Nédélec,, Risk bounds for statistical learning, Annals of S
Bin Yu, Assouad, Fano, and Le Cam, in Festschrift for Lucien Le Cam, edited by D. Po
se progresses. Each topic will come with links to reference materials; key references will be highlighted. To get a rough idea of the materia
ugosi, Introduction to statistical learning theory, in Advanced Lectures in Machine Learning (O. Bousquet, U. von Luxburg, and G. Rätsch, ed
so Poggio, Statistical learning theory: a primer, International Journal of Computer Vision, vol. 38, no. 1, pp. 9-13, 2000
al learning theory: models, concepts, and results (http://arxiv.org/abs/0810.4752), 2008
f learning: dealing with data, Notices of the American Mathematical Society, vol. 50, no. 5, pp. 537-544, 2003
al or statistical) (http://cscs.umich.edu/~crshalizi/notebooks/learning-theory.html), Jan 09, 2011 [A nice succinct summary, with lots of use
Chernoff bounds, Information Processing Letters, vol. 33, no. 6, pp. 305-308, 1990 [Short and sweet]
, lecture notes, 2003-2009
ds for Algorithmic Discrete Mathematics, pp. 1-46, 1998
tao.wordpress.com/2010/01/03/254a-notes-1-concentration-of-measure/), Jan 03, 2010
eoretic generalizations, with applications to neural nets, in Mathematical Perspectives on Neural Networks, Lawrence Erlbaum Associates,
d Rademacher averages -- stochastic inequalities for ERM; Rademacher averages (structural results, Finite Class Lemma); introduction to VC
nd Gaussian complexities: risk bounds and structural results, Journal of Machine Learning Research, vol. 3, pp. 463-482, 2002
ugosi, Theory of classification: a survey of recent advances, ESAIM Probability and Statistics, vol. 9, pp. 323-375, 2005 (Section 3 only)
near and generalized linear discriminant rules); surrogate loss functions; margin-based bounds
nvexity, classification, and risk bounds, Journal of the American Statistical Association, vol. 101, no. 473, pp. 138-156, 2006
ugosi, Theory of classification: a survey of recent advances, ESAIM Probability and Statistics, vol. 9, pp. 323-375, 2005
uniform convergence; average and uniform stability of learning algorithms; the role of convexity and strong convexity; stability of Stochasti
neralization, Journal of Machine Learning Research, vol. 2, pp. 499-526, 2002
Poggio, Stability results in learning theory, Analysis and Applications, vol. 3, no. 4, pp. 397–417, 2005
nd Karthik Sridharan, Learnability, stability, and uniform convergence, Journal of Machine Learning Research, vol. 11, pp. 2635-2670, 2010
ter, generalize better: stability of stochastic gradient descent, preprint, 2015
margin assumption; reduction to finite testing on a binary hypercube (Assouad's lemma); extra log factor for rich VC classes; information-t
statistical learning, Annals of Statistics, vol. 34, no. 5, pp. 2326-2366, 2006.
Lucien Le Cam, edited by D. Pollard, E. Torgersen, and G. Yang, pp. 423-435, 1997, Springer-Verlag.
get a rough idea of the material, check out the schedules from past offerings: Fall 13, Fall 14.
138-156, 2006
138-156, 2006
onvexity; stability of Stochastic Gradient Descent; connection between differential privacy, stability, and generalization
s sampling)
ls, EM Algorithm) Proximal gradient method
rkov Models)
Conjugate functions
Dual decomposition
Newton's method
Quasi-Newton methods
Gauss-Newton method
Lectures from previous years
Conic optimization and interior-point methods
Conic optimization
Barrier functions
Path-following methods
Symmetric cones
First-order methods
Smoothing
Cutting-plane methods
Ellipsoid method
ds for Large-Scale Systems EE236A - Linear Programming (Fall Quarter 2013-14) CSE 291: Topics in unsupervised learning
Prof. L. Vandenberghe, UCLA Time
optimization
nsupervised learning
5 in EBU3B 4138
al clustering [4/15]
rojection [5/13]
Time
TuTh 11-12.30 in CSE 2154
ns of bounded independent random variables
Instructor:
Sanjoy Dasgupta
Office hours Tue 2-4 in CSE 4138
ses, Occam-style bounds Administrative details
Course requirements: There will be periodic homework assignments as well as a final project.
The axiomatic formulation of entropy that I presented is one of many, but my personal favorit
Aczel, Forte, Ng. Why Shannon and Hartley entropies are "natural". (Find it in JSTOR, or email
Here's the paper for the species distribution problem we discussed:
Phillips, Dudik, Schapire. A maximum entropy approach to species distribution modeling.
Homework 1, due 1/31.
Bayesian inference for exponential families (Jan 24,29,31)
Here's the paper on modeling amino acid distributions using Dirichlet mixtures:
Sjolander, Karplus, Brown, Hughey, Krogh, Mian, Haussler. Dirichlet mixtures: a method for im
listic embeddings Homework 2, due 2/12.
Gaussian models: conditioning, linear regression, kernel trick, Bayesian model selection, Gaus
A good basic reference on Gaussian processes is the following book, available online:
ents as well as a final project. Lecture 7 Chernoff's Bound and Hoeffding's Inequality
Lecture 8 Classification Error Bounds This course will provide an introduction
Lecture 12 Complexity Regularization for Squared Error Loss Linear threshold functions, perceptron a
Lecture 13 Maximum Likelihood Estimation Risk bounds
panying readings Lecture 14 Maximum Likelihood and Complexity Regularization Concentration inequalities
Lecture 15 Denoising II: Adapting to Unknown Smoothness Uniform convergence
Lecture 16 Wavelet Approximation Theory Rademacher averages; combinatorial di
Lecture 17 Denoising III: Spatial Adaptivity Convex surrogate losses for classificatio
Lecture 18 Introduction to VC Theory Game-theoretic formulations of predicti
Lecture 19 The VC Inequality Minimax strategies for log loss, linear lo
to natural language processing. Lecture 20 Applications of VC Theory Universal portfolios
nd 3 of the following fantastic text: Online convex optimization
Neural networks
ure models.
: Peter Bartlett.
urs: Wed 9:00-10:00, 399 Evans; Thu 11:00-12:00; 723 SDH.
ek. Office Hours: Wed 5:00-6:00 and Thu 4:00-5:00; 283H Soda.
e will provide an introduction to the theoretical analysis of prediction methods, focusing on statistical and computational aspects. It will c
ation inequalities
onvergence
her averages; combinatorial dimensions
urrogate losses for classification
oretic formulations of prediction problems
strategies for log loss, linear loss, and quadratic loss
nvex optimization
c gradient methods
as I-projection
Fall 2013
Overview
Machine learning is an exciting and fast-moving field of computer science with many recent consumer applications (e.g., Micro
General information
Instructor:
Prof. David
dsontag {@ | at} cs.nyu.edu
Grader:
Chen-Chien Wang
Grading: problem sets (50%) + midterm exam (25%) + project (20%) + participation (5%). Problem Set policy
Pre-requisites: Basic Algorithms (CS 310) is required, but can be taken concurrently. Students should be very comfortable with
Books: No textbook is required (readings will come from freely available online material). If an additional reference is desired,
Schedule
Loss functiBarber 17.1-2 (stop before 17.2.1) on least-squares regression, 29.1.1-4 (review of vector algebra)
4
Sept 12 (Th)
Support vector machines [Slides]
See above. Also:
5
Sept 17 (Tues)
Support vector machines (continued) [Slides]
If you would like a second reference, see these notes (sections 5-8)
Bishop, Section 6.2, Section 7.1 (except for 7.1.4), and Appendix E
Optional: For more advanced kernel methods, see chapter 3 of this book (free online from NYU libraries)
7
Sept 24 (Tues)
Kernel methods & optimization
8
Sept 26 (Th)
Learning theory [Slides]
VC-dimension
Notes on learning theory
10
Oct 3 (Th)
Learning theory (continued) [Slides]
11
Oct 8 (Tues)
Nearest neighbor methods [Slides]
Oct 22 (Tues)
Midterm exam
14
Oct 24 (Th)
Clustering [Slides]
K-means
Hastie et al., Sections 14.3.6, 14.3.8, 14.3.9, 14.3.12
Hierarchical clustering
See above.
16
Oct 31 (Th)
Clustering (continued) [Slides]
Spectral clustering
Hastie et al., Section 14.5.3
17
Nov 5 (Tues)
Introduction to Bayesian methods [Slides]
18
Nov 7 (Th)
Naive Bayes [Slides]
19
Nov 12 (Tues)
Logistic regression [Slides]
21
Nov 19 (Tues)
EM algorithm (continued) [Slides]
22
Nov 21 (Th)
Hidden Markov models [Slides]
Notes on HMMs
Tutorial on HMMs
Murphy, Chapter 17
Bishop, Sections 8.4.1, 13.1-2
23
Nov 26 (Tues)
Dimensional
Notes on PCA
More notes on PCA
25
Dec 5 (Th)
Collaborative filtering
[Slides]
26
Dec 10 (Tues)
Applications in computational biology [Slides]
An introduction to graphical models
Dec 12 (Th)
Project presentations (group 1)
Dec 17 (Tues)
10-11:50am
Project presentations (everyone else)
During final exam slot. Note the special time! Same location.
Acknowledgements: Many thanks to the University of Washington, Carnegie Mellon University, UT Dallas, Stanford, UC Irvine,
Reference materials
Machine learning books
Trevor Hastie, Rob Tibshirani, and Jerry Friedman, Elements of Statistical Learning, Second Edition, Springer, 2009. (Can be dow
David Barber, Bayesian Reasoning and Machine Learning, Cambridge University Press, 2012. (Can be downloaded as PDF file.)
Probability
Chapter 2 of either Murphy or Bishop (see also Bishop Appendix B)
Review notes from Stanford's machine learning class
Sam Roweis's probability review
Linear algebra
Bishop Appendix C
Online class from MIT
Review notes from Stanford's machine learning class
Sam Roweis's linear algebra review
Calculus
Bishop Appendix D and E (Lagrange multipliers)
Notes from MIT on Lagrange multipliers
Dan Klein's Lagrange Multipliers without Permanent Scarring
Optimization
Convex Optimization by Stephen Boyd and Lieven Vandenberghe. (Can be downloaded as PDF file.)
sumer applications (e.g., Microsoft Kinect, Google Translate, Iphone's Siri, digital camera face detection, Netflix recommendations, Google
m Set policy
ould be very comfortable with basic mathematical skills in addition to good programming skills. Some knowledge of linear algebra and mu
dditional reference is desired, the following books are good options. Bishop's book is easier to read, whereas Murphy's book has more dep
ctor algebra)
on see Hastie, Section 7.10 (pg. 250).
UT Dallas, Stanford, UC Irvine, Princeton, and MIT for sharing material used in slides and homeworks.
Queen University
HEC Montreal
Ryerson University
Carleton University
University of Waterloo'
https://skoolville.com/blog/canada-universities-with-masters-in-dat
in Data Analytics
of Data Analytics
usiness Analytics
omputer Science – Data Analytics
dan Certificate
-degree/data-science-analytics/canada-data-science-analytics
Applied Data Science and Big Data weCloud Data
https://weclouddata.com/courses/data-science-diploma
springboard
BrainStation
Which is a better deal: a Masters in Data Science or Data Science Bootcamp?
https://www.coursecompare.ca/best-data-analytics-certification/
brainstrom
York University School of Continuing studies
The G. Raymond Chang School of ryreson
University of Toronto School of Continuing Studies Continuing Education, Ryers
https://lanterninstitute.ca/
13 weeks
faster way ? To get job >> master
https://www.switchup.org/bootcamps/weclouddata
The University of Toronto School of Continuing Studies Boot Camps offer a 12-week, full-time
york
mcgill
waterloo
toronto
https://www.coursecompare.ca/subject/data-science-courses/
https://www.ryerson.ca/graduate/datascience/admission/faq/
cience Bootcamp?
er a 12-week, full-time
OSAP student finacial assistant
https://www.ryerson.ca/sfa/
https://www.ryerson.ca/sfa/govt_aid/osap/fulltime/student_group/
https://www.ontario.ca/page/how-apply-osap
Important Dates
https://www.ryerson.ca/sfa/govt_aid/osap/fulltime/important-dates/
National Student Loans Service Centre (NSLSC)
https://www.csnpe-nslsc.canada.ca/en/funding-options
https://www.ryerson.ca/sfa/govt_aid/osap/fulltime/
https://mscac.utoronto.ca/concentrations/data-science
https://web.cs.toronto.edu/graduate/admissions
OSAP and Tuition for University of Toronto (in 4 minutes) Youtube Channel
OsapLogoAid estimator
https://osap.gov.on.ca/AidEstimator1920Web/enterapp/enter.xhtml