AI Lab Design
AI Lab Design
Building an AI Lab
Based on GPT technology
PI: Michael Gendron
Co-PI: Abdullah Oguz
0|Page
Contents
Introduction ...................................................................................................................................................................... 1
Steps to Create an AI Lab at CCSU .......................................................................................................................... 1
Necessary Technology .................................................................................................................................................... 3
Development of Curriculum........................................................................................................................................ 4
Project Ideas for Students in an AI Lab .................................................................................................................. 5
Other Student and Faculty Activities in the Lab .................................................................................................. 7
Exploring the Applications and Advantages of an AI Lab: Enhancing Research, Learning, and
Collaboration.................................................................................................................................................................... 8
Budget .............................................................................................................................................................................. 10
Proposed Timeline ....................................................................................................................................................... 11
Introduction
Artificial Intelligence (AI) is rapidly becoming one of the most critical technologies of our time.
As AI evolves, it presents numerous opportunities for students to gain practical knowledge and
skills while contributing to research and development in the field. Building an AI laboratory that
is well-equipped and designed to enhance student learning is an essential step toward preparing
students for the future of work.
A significant goal of this lab is to make New Britain and surrounding communities AI-
informed. Students in the AI curriculum will accomplish this by engaging the community and
relaying the info to the general public. Perhaps a required community service course can be part
of the lab curriculum. Lastly, as we build the lab, we can garner public support by advertising that
CCSU is on the ground floor of AI—and educating the general public is a good start. We should
pursue a series of articles in the NB Herald and other local outlets, building community curiosity
and interest in CCSU as a resource.
This concept paper presents a step-by-step guide for building an AI laboratory that educational
institutions can use. It also outlines a curriculum to provide students with the necessary skills to
work in AI. Additionally, this paper provides examples of potential use cases for the lab,
highlighting some ways to utilize the lab.
AI is an interdisciplinary domain. We propose creating a lab and, once set up, a multidisciplinary
council from the CCSU faculty to guide further use of the AI Lab. Below are our initial steps.
1|Page
iii. Department of Defense (DOD): The DOD offers national security and
defense research funding. If the AI lab deal with applications related to
these areas, the DOD could be a potential funding source.
iv. Private foundations: Several private foundations fund research and
facilities in science and technology. For example, the Chan Zuckerberg
Initiative, the Bill and Melinda Gates Foundation, and the Moore
Foundation all offer funding opportunities for research and infrastructure.
Other potential foundations include Alfred P. Sloan Foundation, Arnold
Ventures, Intel Foundation, and IBM Foundation.
v. Industry partnerships: It may be possible to secure funding through
partnerships with industry organizations interested in the research
conducted at an AI lab. These partnerships could include technology
companies, healthcare organizations, or defense contractors. OpenAI, the
creator of ChatGPT, and Google, the creator of Bard, are two
organizations we could approach for this partnership. The PI is already in
conversations with Grammarly to determine their interest.
2. Choose a suitable location: The lab requires a dedicated campus space. It should have
ample room for workstations, collaboration areas, and any necessary hardware.
3. Equip the lab: With a location secured and a budget in hand, we can proceed with
equipping the lab, including setting up powerful workstations, acquiring necessary
software, and providing access to relevant datasets and resources (See the next section for
details).
4. Expertise: Drs. Gendron and Oguz will provide initial knowledge in artificial
intelligence, natural language processing, and machine learning. Their guidance and
mentorship are invaluable for students working on GPT-related projects. We will invite
other faculty to collaborate after set up.
5. Encourage interdisciplinary collaboration: The AI lab must create a cooperative
environment to bring together students and faculty from diverse disciplines, including but
not limited to computer science, linguistics, psychology, and design. By encouraging this
cross-disciplinary collaboration, the lab can leverage broader perspectives and skills,
fostering innovative problem-solving and creativity in AI research and application.
6. Create a supportive community: A welcoming and inclusive environment is essential
where students can share ideas, collaborate on projects, and learn from one another. This
environment can help create a strong sense of belonging and motivation.
7. Develop a curriculum (See Below): An AI-focused curriculum must be designed that
covers the fundamentals of AI, NLP, and machine learning, as well as advanced topics
specific to LLMs and AI, such as fine-tuning, data collection, and evaluation methods.
8. Offer workshops and training sessions: Regular workshops, seminars, and training
sessions led by experts in the field will complement the curriculum and provide practical,
hands-on learning experiences. These sessions can cover the technical and ethical aspects
of LLM development and deployment.
9. Establish a Council: An interdisciplinary council can be created to enhance innovation
and collaboration from across CCSU that can maximize the use of the lab. This council
can provide oversight and strategic direction for the lab.
10. Promote research opportunities: Students should be encouraged to participate in
research projects, publish papers, and present their findings at conferences. This
promotion can help establish the lab's reputation and contribute to the broader LLM and
AI research community.
11. Monitor progress and adapt: Metrics should be developed to evaluate the lab's
performance and adjust as needed regularly. These adjustments may include updating
equipment, refining the curriculum, or changing research priorities based on new
developments in the field.
Necessary Technology
Below is a more detailed description of the equipment and resources needed to set up an AI lab:
1. Hardware:
a. High-performance computers: Powerful workstations with the latest CPUs and
GPUs should be provided to handle intensive tasks like model training and fine-
tuning. We should ensure these computers have sufficient RAM and storage for
large datasets and temporary files.
b. Servers and storage: A local or cloud storage solution should be set up to host
datasets, models, and other resources. This storage allows for easier collaboration
among lab members and ensures data security.
c. Networking: High-speed internet and a secure Wi-Fi network should facilitate
seamless communication and data transfer between workstations, servers, and the
cloud. Build two ring network; research for PII data and open for teaching.
d. Peripherals: Workstations equipped with necessary peripherals like high-
resolution monitors, ergonomic keyboards, and mice to create a comfortable
student working environment.
2. Software:
a. Operating systems: Given CCSU's predominant use of Windows and macOS
computers, the lab should primarily use Windows and macOS PCs, with
preference given to Windows systems. The lab will consider using virtual
machines if Linux becomes necessary for specific tasks. This strategy would
allow Linux environments to run on existing hardware, providing the flexibility to
use Linux's advanced capabilities for specific AI research tasks without a
dedicated Linux machine.
b. AI frameworks and libraries: The lab should ensure access to prominent AI
frameworks and libraries like TensorFlow, PyTorch, and Hugging Face
Transformers, which are vital for developing LLM models.
c. OpenAI API access: The lab should arrange secure access to the OpenAI APIs,
allowing experimentation with ChatGPT and other AI models offered by OpenAI.
d. Development tools: The lab needs to install Integrated Development
Environments (IDEs) like Visual Studio Code, Anaconda, PyCharm, or Jupyter
Notebooks to streamline coding and debugging processes.
e. Collaboration tools: The lab should adopt version control systems like Git and
collaboration platforms like GitHub or GitLab to manage code repositories and
foster teamwork.
f. Data visualization and analysis tools: The lab should incorporate software tools
like Matplotlib, Seaborn, or Tableau, which are essential for visualizing and
analyzing research data.
3. Datasets and resources:
a. Access to datasets: The lab should ensure access to publicly available and
proprietary datasets pertinent to LLM research. These datasets may include text
corpora, conversational data, and domain-specific collections.
b. Data collection tools: The lab should provide tools and resources necessary for
web scraping, data annotation, and other data collection methods to help students
build custom datasets for their projects.
c. Documentation and tutorials: The lab should maintain a library with relevant
documentation, tutorials, and educational materials to facilitate students' learning
about LLM and related technologies.
Development of Curriculum
The interdisciplinary council should develop a comprehensive AI lab curriculum covering
foundational and advanced AI, NLP, and machine learning topics. These could be courses from
existing ones at CCSU, new techniques, and a new degree. Below is a detailed outline of the
critical areas to include in the curriculum:
1. Introduction to Artificial Intelligence (AI):
a. History and Evolution of AI
b. Types of AI (Narrow AI, General AI, Superintelligence, and LLMs)
c. AI applications across various industries
2. Fundamentals of Natural Language Processing (NLP):
a. Basic Concepts in Linguistics
b. Tokenization, stemming, and lemmatization
c. Part-of-speech tagging and named entity recognition
d. Syntax and dependency parsing
e. , Sentiment analysis and text classification
3. Basics of Machine Learning (ML):
a. Supervised, unsupervised, and reinforcement learning
b. Standard ML algorithms and techniques
c. Feature engineering and data preprocessing
d. Model evaluation and validation
e. , Overfitting, underfitting, and model selection
f. Application of ML principles to the training of LLMs
4. Introduction to Deep Learning:
a. Artificial neural networks and their components
b. Activation functions, loss functions, and optimization algorithms
c. Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs)
d. Basics of transfer learning
5. Advanced NLP with Deep Learning:
a. Word embeddings and pre-trained language models
b. Transformer architecture and attention mechanisms
c. State-of-the-art models like Google's BERT (Bidirectional Encoder
Representations from Transformers) and Bard, OpenAI's ChatGPT, and their
variants.
6. ChatGPT-specific topics:
a. Overview of the GPT architecture and its evolution (GPT-1, GPT-2, GPT-3, GPT
3.5, and GPT-4)
b. Preprocessing and tokenization for ChatGPT
c. Fine-tuning techniques for domain-specific applications
d. Model evaluation metrics and best practices
e. , Techniques for controlling and improving model output
7. Ethics and Responsible AI:
a. Bias, fairness, and Transparency in AI Models
b. Privacy and data protection concerns
c. AI safety and robustness
d. Responsible deployment and use of AI systems
e. , Regulatory and policy considerations
8. Practical applications and projects:
a. Building LLM-powered chatbots, virtual assistants, and Q&A systems
b. Content generation, summarization, and translation
c. Sentiment analysis, emotion detection, and personality modeling
d. Domain-specific applications in healthcare, finance, law, and more
9. Research and innovation:
a. Current research trends and challenges in AI, NLP, and ChatGPT
b. Identifying research gaps and opportunities
c. Methods for Conducting and disseminating research
Faculty would deliver the curriculum through lectures, seminars, hands-on workshops, and self-
paced learning materials. By covering these topics, students will gain a solid understanding of
LLMs and their related technologies, preparing them for research, development, and innovation
in the field.
Budget
The IT department is preparing the budget under George Claffey's direction.
Proposed Timeline
Fall 2023
Claffey, Gendron, Oguz Interdisciplinary Committee
• Gendron and Claffey finalize planning • Create an interdisciplinary committee
and present to deans, provost, and to oversee the AI Curriculum
president for final approval. o Gendron will be the initial
• Claffey and the IT department secure chair.
room, furniture, and hardware. • Create curriculum for submission in
• Gendron and Oguz oversee the November.
building of the lab installation of • Once courses and curriculum are
hardware/software. approved, create course shells.
• Create draft proposals for the new • Marketing starts for the new program.
curriculum.
Spring 2024
• Build IVAI LLM for the CCSU • Plan for the new program
Website (proof of concept).
Fall 2024 - Start to offer the new program.