Professional Documents
Culture Documents
Project Ideas and Challenges
Project Ideas and Challenges
Product Related
AI (Generic)
Background
Data is now everywhere and exponentially getting larger in volume and type. There is a serious
problem in identifying data that is fake and authentic data. Due to the lack of controls and lack of
regulations of AI data, numerous problems arise, including data whose sources are unknown, data
that is used to make inaccurate conclusions as well as lead to wasted resources on storage,
maintenance, and processing of false data. Fake data can also be used to train algorithms, leading
to biased results.
Challenge / Requirement
The ask is to create a solution that can somehow tag every piece of original work, whether images,
video, audio or text) with a unique digital signature or watermark that is easy to verify for the public
but impossible for bad actors to crack as it does not follow human identifiable patterns. The
watermark should be permanent and non-erasable and cannot be deleted even if the digital asset is
copied or duplicated. The solution should also give data owners complete authority on who can and
who cannot train AI models with their data.
Deliverables
1. A digital watermark algorithm that can embed digital watermarks into data sets.
4. A method for verifying the integrity of data streams by extracting the digital watermarks
5. A method for controlling access to the data sets based on the presence of digital watermarks
1. Robustness: The watermark should not be easily removed or rendered useless when altered.
2. Reversibility: The watermark should be reversible.
3. Quality: The watermark should not reduce the purpose or quality of the data.
4. Security: The watermark should be secure and not easily broken or removed
AI (Generic)
Background
Generative Artificial Intelligence(GenAI), has gained significant attention in recent years. These
models are known for their large size and require substantial computational resources for both
training and making predictions.
Challenge / Requirement
Create a framework for open-source GenAI models (LLM and Diffusion based Model),
compress and optimize them with various techniques and compare the performance and accuracy
aspects for both training and inference perspectives.
Deliverables
The use of existing tools like Openvino and similar types of tools can be used and compared.
3. Prompting Framework For LLM
Nature of Challenge
Product Related
AI (Generic)
Background
Large Language Models (LLMs) are gaining popularity in various fields, including engineering.
A new subject called "Prompting" is emerging, which involves using these LLMs effectively.
Challenge / Requirement
Design and develop an innovative framework for Prompting Large Language Models (LLMs) in
engineering applications. The framework should explore and propose effective methods for
utilizing LLMs to solve specific engineering tasks. It must provide clear guidelines on how to
prompt LLMs with appropriate instructions and constraints to achieve accurate and relevant
outputs. The challenge aims to leverage the power of LLMs and their language understanding
capabilities to enhance engineering problem-solving. In the solution, the user should also be able
to upload the relevant documents in the form of pdf, word, ppt, excel etc. type of files and write
the appropriate prompts to get the relevant information from the uploaded documents.
Deliverables
The solution should try to focus on using open-source LLM models and must have comparative
reports of the accuracy and performance of the open-source LLM models with the benchmark
ones like ChatGPT or BARD.
4. Auto generation/modification of prompts based on factual content
of answers
Nature of Challenge
Product Related
AI (Generic)
Background
Prompt engineering is emerging are where writing a right prompt is an art that minimizes the
hallicination and increases the factual content and conciseness of generated output from LLM.
There is need of a system that automatically generates the right prompt or correct the earlier prompt
to get the right answer.
Challenge / Requirement
build a ML model that should generate and re-generates prompt that induces zero short learning, or
few shot learning or teach LLM model with examples until large language model generates the right
answer that should have minimum hallucinations and maximum relevant factual information.
Deliverables
light weight LLM model or alternate DL model that controls regenerative prompt.
light weight LLM model or alternate DL model that controls regenerative prompt.
Product Related
AI (Generic)
Background
Customer feedback from various sources, such as surveys, reviews, social media, or direct
communication are analyzed to extract the key insights that may help improve
product/services. Prioritizing of improvements based on impact, feasibility, cost, product vision and
goals needs to be justified with evidence and logic. The improvised product also needs to
be thoroghly tested to avoid any negative side effects or unintended consequences before product
launch.
Challenge / Requirement
1. develop a LLM based pipeline that should read and extract actionable insight (strength, weakness,
customer preferences etc) for any specific product/product family preferably medical device. multiple
open source product review can be used specific to one product family
2. A generative AI model should generate new product specification considering customer feedback
and competing products to minimize the hallucination.
3. LLM should provide step by step reasoning/justification for any change recommended by LLM
4. Using A/B testing or causal model to do predict impact on revenue or any other factors
Deliverables
A LLM Model with interfaces to interact based on new logs that are input
1. develop a LLM based pipeline that should read and extract actionable insight (strength, weakness,
customer preferences etc) for any specific product/product family preferably medical device. multiple
open source product review can be used specific to one product family
2. A generative AI model should generate new product specification considering customer feedback
and competing products to minimize the hallucination.
3. LLM should provide step by step reasoning/justification for any change recommended by LLM
4. using A/B testing or causal model to do predict impact on revenue or any other factors
AI (Medical)
6. Generation of New molecules similar to existing drug
Nature of Challenge
Process Related
AI (Medical)
Background
Drug discovery is a critical and challenging areas in medicine. Drug discovery involves a very
prolonged and costly process of identifying potential drug candidates and testing their efficacy
and safety.
Challenge / Requirement
Generative AI can significantly accelerate this process by using deep learning algorithms to
identify potential drug candidates based on a large dataset of compounds and their properties.
Virtual compounds are generated by AI algorithms and tested in silico, which means in a
computer simulation rather than a laboratory. This decreases the time and cost involved in the
discovery of new drugs.
Deliverables
Process Related
AI (Medical)
Background
The predictive capabilities of generative AI play a crucial role in data interpretation and decision-
making. Researchers can rely on AI-generated insights to predict mass spectra, propose potential
chemical structures, and guide the allocation of resources. This results in robust and reliable data
analysis, reducing human error and increasing the confidence in research outcomes.
Moreover, the economic advantages of generative AI extend beyond cost savings. By minimizing the
wastage of resources, it contributes to sustainable research practices, aligning with the global push
for environmental responsibility.
Challenge / Requirement
Collect diverse, high-quality mass spectrometry data for specific defined goal. Choose an
appropriate generative AI model like VAEs, GANs, or Transformers. Train the model using the
dataset, optimizing parameters and loss functions. Employ the model for virtual experimentation,
simulating mass spectra for different scenarios, and predicting the effects of experimental variables.
Utilize the model for optimal experimental design, identifying conditions likely to yield informative
results. Integrate the generative AI into data augmentation pipelines to enhance dataset diversity.
For compound identification, employ the model to propose potential chemical structures matching
observed spectra.
Deliverables
The solution should address critical factors such as accurate virtual experimentation for efficient
optimization of experimental setups, precise compound identification through AI-generated
predictions, streamlined data augmentation to enhance dataset diversity, and collaboration with
domain experts to ensure validation. Key factors include the reduction of resource-intensive physical
experiments, cost-effective utilization of specialized instruments, and the integration of responsible
AI practices to ethically drive research efficiency.
Nature of Challenge
Product Related
AI (Medical)
Background
There are lot of advancement in medical industry where SW are used for connectivity, diagnosis,
and Patient health data storage. Though medical devices and software are tested for cyber security
threats using various software and manual processes, there are always security vulnerability that
can remain or come in through upgrades.
Challenge / Requirement
Design a software solution using opensource AI stacks to detect security vulnerability of medical
devices by considering
The solution can consider usage of AI for cyber security threat detection. The tool can learn and get
trained continuously and automatically to help improve the threat detection in the medical and life
science devices.
Deliverables
1. create SW application which can run on any platform such as computer , mobile etc.
3. Software developed using open AI or any other stack shall be able to detect the threat in
application mentioned in 1 and 2.
[note: SW applications 1 and 2 can be free SW as well and can be downloaded from internet if
available but it shall be medical device application.]
SW tool developed shall be able to find 1. applicable standards for cyber security 2. identify threat in
any connected device or SW.
Product Related
AI (Medical)
Background
Sepsis is a life threatening condition. Studies have shown that early identification of sepsis following
rapid initiation of antibiotic treatment improves patient outcomes. A generative AI algorithm that can
simulate different scenarios for treating sepsis.
Challenge / Requirement
Generative AI creates realistic models for medical simulations and training purposes. With hyper-
realistic training environments, healthcare professionals can engage in complex medical procedures
and improve their skills and decision-making abilities without real-world consequences.
Deliverables
Product Related
AI (Medical)
Background
Generative AI can help improve the accuracy and efficiency of using machine learning in
combination with medical imaging techniques, such as CT and MRI scans. Machine learning models
can automatically identify abnormalities in images and alert doctors to potential issues.
1. A generative AI algorithm like generative adversarial networks (GANs) is trained on real patient
data.
2. The generative AI model generates new images that are similar to the input data but can result in
better machine learning due to increased volume or diversity in the data.
3. AI model is trained on the synthetic data.
Challenge / Requirement
Deliverables
A POC that can demonstrate creation of medical images and can be used to train other models that
to automatically identify abnormalities in images and alert health provider potential issues.
Product Related
AI (Medical)
Background
The goal of medical scribes is to pare down documentation burden on clinicians. Physicians can
spend up to six hours per day logging notes into an EHR, cutting into time with patients and
contributing to burnout.
Challenge / Requirement
Clinical documentation service that allows providers to automatically create medical notes using
Generative AI. Providers to build clinical applications that use speech recognition and generative AI
to create transcripts of patient visits, identify key details and create summaries that can be entered
into an electronic health record.
Deliverables
The source of generated text in the summary from the original transcript, making it easier for doctors
to review clinical notes.
AI(Others)
12. Image Semantic Segmentation
Nature of Challenge
Product Related
AI (Others)
Background
Semantic segmentation is the task of classifying each and very pixel in an image into a class. These
may be needed for various applications such as self driving cars, robotic system, damage detection
et all. Collection of thousands and millions of images and annotation for Semantic segmentations
and scenario labelling involves considerable efforts and is time consuming.
Challenge / Requirement
Generation of images for autonmous car scenario sush as road marking, highways, streets, road
users et all.
Annotaion of the generated images for semantic segmentation and scenario labelling
Deliverables
Expected delivery is using generative AI we should be able to create the images for these scenarios
road users, road markings, high ways, streets etc.
Nature of Challenge
Product Related
AI (Others)
All major video streaming services or apps are competing to ensure the best quality of experience
(quick video start, no buffering, quality playback without Audio/Video issues etc). Video & audio
analysis for application in the area of Live video streaming on various OTT devices forms the
backbone of collecting metrics on the end user Quality of experience in the Video streaming world.
This is required to identify and benchmark golden batch of KPI's which the engineering team should
meet and improve.
Because of the various device platforms and complexities of video streaming (device hardware,
network speed, concurrency etc) and the ask to validate KPI or defects that only the human senses
(eyes & ears) can detect, the audio & video analysis on Video playback on devices has been a
challenge.
Challenge / Requirement
Develop compute vision based generic audio/video analysis algorithms leveraging AI/ML models to
come up with a best in class solution for Video & audio analysis on streaming devices
(mobiles/STB/Smart TV/Roku/FireTV/AppleTV etc) in Hitech & Media industry segment.
Deliverables
•Live Stream Video Analysis compute vision No Reference algorithm (Real time video monitoring to
identify anomalies like macro blocking, blur, motion blur, poor quality, colour distortions etc)
•No Reference compute vision algorithm for Video Analysis able to detect objects, patterns and
motion (no source video)
•Audio Analysis in video stream playback ( language detection, Audio quality metrics, Deliberate vs
unexpected audio Loss)
•Closed Captioning sync with Audio during video playback (no source video)
•video playback A/V Quality at Content-AD transition (video and audio anomalies when switching
between ad and content and vice versa) - (Ads are dynamic and not know prior)
14. Identification Of Flood Prone Area in Urban Settlement
Nature of Challenge
Product Related
AI (Others)
Background
India regularly witnesses severe floods during the monsoon season. Monsoons are becoming
more erratic because of climate change and global warming. Flood water can deviate from its
usual course due to various factors, such as obstacles, urban development, landscape changes, etc.
One of the challenge of Urban areas is to prepare for eventuality such as floods in NEW
areas and having safety measures in place to protect human lives and at the same time restrict
damages.
Challenge / Requirement
AI based predictive analysis solution to predict flood risk prone area based on aspects such as
change in landscape, obstacles et all. The solution should also be able to visualize the flood
water path possibilities by analyzing the landscape changes.
Deliverables
POC of the AI model which can identify obstacles, change in landscape etc from the images
Product Related
AI (Others)
Background
In Contruction segment contracting and sub-contracting there are multiple RFP submitted by
multiple vendors again any tendor, assessing those RFP in terms of technical, financial, legal and
work scope perspective is very time-consuming task for tendor issuing organization. We are
looking for GenAI based solution that can generate a summary report for all RFP submitted
against tendor in terms of financial efficiency, timeline, legal requirements, qualification criteria,
technical parameters and previous track records.
Challenge / Requirement
1. UI should have ability to upload multiple RFP documents against any given tendor document
2. It should Tabulaize the requirements from tendor and categorize them into legal, finance,
complaince etc
3. It should read content from all RFP from multiple vendors highlighting the vaguness of the
contractual requirements from corresponding RFP
5. It should recommend top 3 vendors with justification who has potential to qualify for contract
Deliverables
working POC
AI - Powered Project Management Assistance
Nature of Challenge
Product Related
AI (Others)
Background
Project management involves various tasks, such as planning, organizing, and monitoring a
project from initiation to completion. Project managers often face challenges, such as managing
project timelines, resources, and budgets.
Challenge / Requirement
Develop an AI-powered project management tool that analyzes project timelines, resource
allocation, and risks. It can suggest optimal schedules and identify potential bottlenecks.
Deliverables
A working prototype that can suggest optimal schedules and identify potential bottlenecks
through chat based interfaces.
AI (Software Engineering)
Nature of Challenge
Product Related
Background
Software testing is a way of checking if a desirable system works as expected and meets the
requirements. To do this, you need to create test cases, which are specific scenarios or situations
that you use to test the system. Test case generation is one of the most challenging stages in
software testing.In most of the cases, test case generation involves considerable manual efforts
prone to manual error.
Challenge / Requirement
Deliverables
- A set of test cases that are generated by the proposed solution and by the manual method, along
with their quality and coverage metrics, such as correctness, completeness, consistency,
traceability, etc.
- A presentation that summarizes the main points of the report and the prototype or demo, and
showcases the advantages and limitations of the proposed solution, productivity improvement
compared to the manual method.
This challenge statement focuses on reducing testing life cycle, effort reduction, testing time
span reduction, cost reduction, reducing human errors etc.
Nature of Challenge
Product Related
Background
The API ecosystem is currently burgeoning. Every application is now API-based, and API-driven
development is the norm. With the proliferation of APIs, there is a critical need to develop industry-
standard APIs that are designed for the long term and are easy to access. While API standards and
templates exist, no publicly available material will ensure an API meets all the standards of a good
API design. Developers spend days writing and modifying APIs, and there is no standard way of
developing APIs. The need for more valuable resources to create APIs is exacerbated as thousands
of new developers enter the workforce from academic institutions. Creating sustainable APIs that
follow all standards is thus a critical need for the software industry. Finally, the challenge is the need
for API is fast-paced and APIs need to be developed at a much higher rate than even a few years
back.
Challenge / Requirement
Create a generic LLM that can be trained on various APIs and can provide clear guidelines and code
to create a new API based on the specific prompts. The LLM should also have the ability to be
trained on newer APIs. The outputs should be well-defined APIs that can be used in any code.
Ideally the output APIs should be in the most common programming languages like Java, Python.
Go and Node.js
Deliverables
2. All code and models should be developed from scratch and not copied from any online/offline
sources
4. The LLM should follow all ethical AI, trustable AI and explainable AI criteria.
5. The LLM should have the ability to learn from new data and inputs.
17. AI assisted software development using LLMs
Nature of Challenge
Product Related
AI (Software Engineering)
Background
In the current process of software development, each stage (architecture, code development, testing
etc.) goes through a completely new start and hence takes significant time in overall delivery
schedule. Using LLM, the development cycle can be enhanced, enriched and also validated within
minimum time.
Challenge / Requirement
LLMs can have broader impact in terms of prompt engineering, close domain LLMs, private LLMs for
code development and testing to tackle the overall problem statement of software development cycle.
The solution referred here will be an AI assistance to the software developer.
Deliverables
An overall framework architecture diagram along with details about each block of the development
cycle where LLMs can be plugged into. It may also contain the developed microservices and whole
of the code containers to execute all the blocks of the software development cycle.
1. Prompt engineering
2. Type of LLMs
3. Accuracy
4. validation
Abstract Lev
AI (Software Engineering)
Background
Root cause analysis is one of challenging task considering collecting data from various sources in
various format infer insight from those data and create mind map for root cause analysis. This
specific problem becomes more complex when human has to read, infer and deduce root cause
from machine
Challenge / Requirement
We are looking for a system that must include LLM as component that should perform following
actions.
4. it can generate a mind map based on RCA techniques like 5 whys, Pareto analysis etc.
5. system should assign probability score for all potential root causes
Deliverables
Working Prototype
19. GenAI based end to end regression testing life cycle automation
Nature of Challenge
Product Related
AI (Software Engineering)
Regression testing is one of major time taker in testing lifecycle eventhough with advent of various
automation testing tools, creation and maintenance of automation script code are still a major
challenge, in addition tracing back automation script to manual test cases then to requirement is
cumbersome. in nutshell if we have genAI based solution if we provide requirement document then
can it create all related artifacts with minimal human validation.
Challenge / Requirement
3. It should generate following artifacts from requirement documents in order Test Scenario> test
case description> test steps> Automation test script > Execution log analysis report
4. system should have feature of human validation and modification at each steps
Deliverables
working prototype
Abstract Level
Have a Question / doubt
Nature of Challenge
Product Related
AI (Software Engineering)
Background
Develop an AI-driven code refactoring tool that suggests improvements to existing code. The
generative AI can analyze code quality, performance, and best practices to propose refactoring
solutions.
Deliverables
A working POC of the tool that can take a code as input and does code refactoring. The
generative AI can analyze code quality, performance, and best practices to propose refactoring
solutions.
Product Related
AI (Software Engineering)
Background
In the Python language ecosystem, there are over 1000s of libraries that available. The intricate
web of dependencies between them often gives rise to significant challenges within a working
environment. Additionally, the process of incorporating new versions of these libraries
invariably demands extra effort from developers. There is a need for a practical solutions in the
form of effective recommendations centered around compatible library versions that can aid the
developers
Challenge / Requirement
An AI based model designed to analyze various libraries listed in the requirements.txt file. The
model checks for compatibility, suggests appropriate library versions, scans the code, and offers
essential recommendations for identifying obsolete code or adapting to changes in functionality
within new versions.
Deliverables
AI ( Transportation)
Product Related
AI (Transportation)
Background
A interactive digital cockpit is the future of automotive interiors. A car can effcetively be seen
as a computer on wheel running millions of lines of software code. Software informs everything
from AD/ADAS to other configurable features. This is what leads to a software defined vehicle
in which the software informs almost everything. A Software-Defined Vehicle (SDV) is any
vehicle that manages its operations, adds functionality, and enables new features primarily or
entirely through software. Digital twin of a SDV can help in testing and monitoring before and
after deployment.
The digital twin of the HMI interface of Digital Cockpit encompasses the following features :
Instrument Cluster
Driver Cabin Monitoring
Heads up Display
Infotainment Systems
Challenge / Requirement
Building a Digital Twin of a Digital Cockpit for a software defined vehicle. The digital twin of
this HMI interface to encompass the following features :
The digital twin should include 3D responsive model, exploded view of scene composition,
Simulations and scenarios, AR/VR enablement.
Deliverables
Product Related
AI (Transportation)
Background
3D LIDAR is a device mounted on autonomous vehicles to scan area surrounding the car in a
definite range to gather scene information. Obstacle or objects in such a scanned scene are
captured as 3D point cloud and are not obvious to reveal the information of shape of such
obstacles or objects. Extracting such shape information as 3D key points will form unique
signature of the obstacle or object.
Challenge / Requirement
Develop and demonstrate a software algorithms that extracts shape information as 3D key points
of objects. These points can be stored in an output file with a suitable label.
Kitti data set of LIDAR point cloud from Velodyne LIDAR mounted on a travelling vehicle
available in public domain on internet.
Deliverables