0% found this document useful (0 votes)

44 views16 pages

Flipkart Business Analyst Interview Questions

The document outlines the interview process for a Business Analyst position at Flipkart, detailing key SQL and Python questions, data handling strategies, and estimation techniques. It includes examples of SQL queries for window functions, indexing, and data retrieval, as well as Python functions for handling missing data and finding sequences. Additionally, it discusses case studies and managerial questions related to project management and team disagreements.

Uploaded by

sathish

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views16 pages

Flipkart Business Analyst Interview Questions

Uploaded by

sathish

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

FLIPKART Business Analyst Interview

Experience (1-3 years)

CTC - 14 LPA
--------------------------------------------------------------

SQL
1. What are window functions, and how do they differ from aggregate
functions? Can you give a use case?
Explanation:

• Window Functions perform calculations across a set of table rows that are related
to the current row, without collapsing the rows into a single summary result.

• Aggregate Functions, on the other hand, return a single result for a group of rows,
reducing the number of rows in the result set.

Use Case:

If you want to calculate a running total or rank without losing the row-level granularity,
window functions are useful.

Example Query:

Use Case: Calculate the running total of sales for each salesperson.

Schema:

CREATE TABLE Sales (

SalesID INT,

SalesPerson VARCHAR(50),

SaleAmount INT,

SaleDate DATE
);

INSERT INTO Sales VALUES

(1, 'Alice', 200, '2025-01-01'),

(2, 'Alice', 150, '2025-01-02'),

(3, 'Bob', 300, '2025-01-01'),

(4, 'Bob', 100, '2025-01-03'),

(5, 'Alice', 250, '2025-01-03');

Query:

SELECT

SalesPerson,

SaleAmount,

SaleDate,

SUM(SaleAmount) OVER (PARTITION BY SalesPerson ORDER BY SaleDate) AS

RunningTotal

FROM Sales;

Result:

SalesPerson SaleAmount SaleDate RunningTotal

Alice 200 2025-01-01 200

Alice 150 2025-01-02 350

Alice 250 2025-01-03 600

Bob 300 2025-01-01 300

Bob 100 2025-01-03 400

2. Explain indexing. When would an index potentially reduce
performance, and how would you approach indexing strategy for a large
dataset?
Explanation:

• An index improves the speed of data retrieval operations by creating a structure (like
a B-tree) for faster lookups.

• Downside of Indexing:

o Indexes increase storage requirements.

o They slow down write operations (INSERT, UPDATE, DELETE) because the
index needs to be updated.

When Indexing May Reduce Performance:

• Small Tables: Full table scans are often faster than index lookups.

• Frequent Updates: When the table has frequent write operations, maintaining
indexes adds overhead.

• Unused Indexes: An index on columns rarely queried wastes resources.

Indexing Strategy for Large Datasets:

1. Index Frequently Queried Columns: Use indexes on columns often used in

WHERE, JOIN, GROUP BY, and ORDER BY clauses.

2. Avoid Over-Indexing: Index only what is necessary.

3. Composite Indexes: Create multi-column indexes for queries involving multiple

columns.

4. Regular Monitoring: Use tools like EXPLAIN or Query Analyzer to ensure indexes are
effective.

Example:

CREATE INDEX idx_customer_id ON Orders (CustomerID);

3. Write a query to retrieve customers who have made purchases in the

last 30 days but did not purchase anything in the previous 30 days.
Example Query:

Schema:

CREATE TABLE Purchases (

PurchaseID INT,

CustomerID INT,

PurchaseDate DATE

);

INSERT INTO Purchases VALUES

(1, 101, '2024-12-15'),

(2, 102, '2024-12-25'),

(3, 101, '2024-11-10'),

(4, 103, '2024-12-01'),

(5, 102, '2024-11-01');

Query:

WITH RecentPurchases AS (

SELECT CustomerID

FROM Purchases

WHERE PurchaseDate >= CURDATE() - INTERVAL 30 DAY

PreviousPurchases AS (

SELECT CustomerID

FROM Purchases

WHERE PurchaseDate BETWEEN CURDATE() - INTERVAL 60 DAY AND CURDATE() -

INTERVAL 30 DAY

)
SELECT DISTINCT CustomerID

FROM RecentPurchases

WHERE CustomerID NOT IN (SELECT CustomerID FROM PreviousPurchases);

4. Given a table of transactions, find the top 3 most purchased products

for each category.
Example Query:

Schema:

CREATE TABLE Transactions (

TransactionID INT,

ProductName VARCHAR(50),

Category VARCHAR(50),

Quantity INT

);

INSERT INTO Transactions VALUES

(1, 'ProductA', 'Category1', 10),

(2, 'ProductB', 'Category1', 5),

(3, 'ProductC', 'Category1', 20),

(4, 'ProductD', 'Category2', 15),

(5, 'ProductE', 'Category2', 10),

(6, 'ProductF', 'Category2', 25);

Query:

WITH RankedProducts AS (

SELECT

Category,
ProductName,

Quantity,

RANK() OVER (PARTITION BY Category ORDER BY Quantity DESC) AS Rank

FROM Transactions

SELECT

Category,

ProductName,

Quantity

FROM RankedProducts

WHERE Rank <= 3;

Result:

Category ProductName Quantity

Category1 ProductC 20

Category1 ProductA 10

Category1 ProductB 5

Category2 ProductF 25

Category2 ProductD 15

Category2 ProductE 10

5. How would you identify duplicate records in a large dataset, and how
would you remove only the duplicates, retaining the first occurrence?
Example Query:

Schema:

CREATE TABLE Employees (

EmployeeID INT,

Name VARCHAR(50),

Department VARCHAR(50)

);

INSERT INTO Employees VALUES

(1, 'Alice', 'HR'),

(2, 'Bob', 'Finance'),

(3, 'Alice', 'HR'),

(4, 'Charlie', 'IT'),

(5, 'Bob', 'Finance');

Query to Identify Duplicates:

SELECT Name, Department, COUNT(*)

FROM Employees

GROUP BY Name, Department

HAVING COUNT(*) > 1;

Query to Remove Duplicates (Retain First Occurrence):

WITH CTE AS (

SELECT

EmployeeID,

Name,

Department,

ROW_NUMBER() OVER (PARTITION BY Name, Department ORDER BY EmployeeID) AS

RowNum

FROM Employees

)
DELETE FROM Employees

WHERE EmployeeID IN (

SELECT EmployeeID

FROM CTE

WHERE RowNum > 1

);

PYTHON
1. Write a Python function to find the longest consecutive sequence of
unique numbers in a list.
Explanation:

The problem is to find the longest subarray where all the elements are unique and
consecutive. This can be solved using a sliding window technique:

1. Use a set to track unique elements in the current window.

2. Use two pointers (start and end) to expand and contract the window as needed.

3. Update the maximum length of the subarray when the condition is met.

Code:

def longest_consecutive_sequence(nums):

if not nums:

return 0, []

unique_set = set()

start = 0

max_length = 0

longest_seq = []
for end in range(len(nums)):

while nums[end] in unique_set:

unique_set.remove(nums[start])

start += 1

unique_set.add(nums[end])

current_length = end - start + 1

if current_length > max_length:

max_length = current_length

longest_seq = nums[start:end + 1]

return max_length, longest_seq

# Example usage

nums = [1, 2, 3, 1, 4, 5, 6, 2, 7, 8]

length, sequence = longest_consecutive_sequence(nums)

print("Longest Length:", length)

print("Longest Sequence:", sequence)

Example Output:

For the input [1, 2, 3, 1, 4, 5, 6, 2, 7, 8]:

mathematica

CopyEdit

Longest Length: 6

Longest Sequence: [1, 4, 5, 6, 2, 7]

2. If you’re working with a large dataset with missing values, what Python
libraries would you use to handle missing data, and why?
Libraries to Use:

1. pandas:

o Provides powerful tools like fillna(), dropna(), and isnull() to handle missing
data effectively.

o Suitable for structured datasets like dataframes.

2. numpy:

o Provides numpy.nan for identifying missing values in arrays and operations

like np.isnan() to detect and manipulate them.

o Useful for numerical computations with arrays.

3. scikit-learn:

o Offers the SimpleImputer and IterativeImputer classes for statistical

imputation.

o Best for preprocessing data before machine learning tasks.

4. pyjanitor (optional):

o Built on top of pandas, it simplifies cleaning operations, including handling

missing data.

Examples:

Example Dataset:

import pandas as pd

import numpy as np

data = {

"Name": ["Alice", "Bob", "Charlie", None],

"Age": [25, np.nan, 30, 22],

"Salary": [50000, 60000, None, 45000]

}

df = pd.DataFrame(data)

print("Original Dataset:\n", df)

Example 1: Using pandas

# Drop rows with missing values

df_dropped = df.dropna()

print("\nAfter Dropping Missing Values:\n", df_dropped)

# Fill missing values with a default value

df_filled = df.fillna({

"Name": "Unknown",

"Age": df["Age"].mean(),

"Salary": df["Salary"].median()

})

print("\nAfter Filling Missing Values:\n", df_filled)

Example 2: Using scikit-learn

from sklearn.impute import SimpleImputer

# Impute missing values in the "Age" and "Salary" columns

imputer = SimpleImputer(strategy="mean")

df[["Age", "Salary"]] = imputer.fit_transform(df[["Age", "Salary"]])

print("\nAfter Imputation Using Scikit-learn:\n", df)

Example Output:

Original Dataset:

Name Age Salary

0 Alice 25.0 50000.0

1 Bob NaN 60000.0

2 Charlie 30.0 NaN

3 None 22.0 45000.0

After Dropping Missing Values:

Name Age Salary

0 Alice 25.0 50000.0

After Filling Missing Values:

Name Age Salary

0 Alice 25.000000 50000.0

1 Bob 25.666667 60000.0

2 Charlie 30.000000 47500.0

3 Unknown 22.000000 45000.0

After Imputation Using Scikit-learn:

Name Age Salary

0 Alice 25.0 50000.0

1 Bob 25.7 60000.0

2 Charlie 30.0 47500.0

3 None 22.0 45000.0

Guesstimates
1. Estimate the number of online food delivery orders in a large
metropolitan city over a month:
• Assume population:
o Large metropolitan city population ≈ 10 million.

o Online food delivery users ≈ 40% (4 million).

• Assume order frequency per user:

o Regular users (20%): 15 orders/month.

o Occasional users (80%): 4 orders/month.

• Estimate orders:

o Regular users: 0.2 × 4M × 15 = 12M orders.

o Occasional users: 0.8 × 4M × 4 = 12.8M orders.

• Total monthly orders:

o 12M + 12.8M = 24.8M orders/month.

2. How many customer service calls would a telecom company receive

daily for a customer base of 1 million?
• Assume complaint rate:

o Active users (95% of 1M): 950,000.

o Daily issue rate: 2%.

• Breakdown of issues:

o Billing (30%): 5,700 calls/day.

o Network (40%): 7,600 calls/day.

o Other (30%): 5,700 calls/day.

• Total daily calls:

o 950,000 × 0.02 = 19,000 calls/day.

Case Studies
1. A sudden decrease in conversion rate is observed in a popular product
category. How would you investigate the cause and propose solutions?
• Data Analysis:

o Analyze traffic sources for sudden drops.

o Compare conversion rates across platforms and devices.

o Study user demographics and behavior trends.

• Operational Factors:

o Check inventory and pricing issues.

o Identify recent policy changes (return policies, delivery fees).

o Monitor seasonal trends or competitor campaigns.

• Technical Investigation:

o Audit website or app performance (loading time, errors).

o Identify bugs in the checkout process.

• Solutions:

o Optimize product pages and pricing.

o Run A/B testing for checkout improvements.

o Launch targeted marketing campaigns.

2.Imagine the company is considering adding a new subscription model.

How would you evaluate its potential impact on customer lifetime value
and revenue?
• Market Research:

o Survey customer willingness to pay for subscriptions.

o Study competitor subscription offerings.

• Revenue Impact:

o Calculate incremental revenue from expected subscribers.

o Factor in cannibalization of existing one-time purchases.

• CLV Analysis:

o Assess changes in average customer tenure.

o Include upsell opportunities and churn rate reduction.

• Implementation Feasibility:

o Evaluate operational costs for managing subscriptions.

o Plan loyalty benefits and exclusivity perks.

Managerial Questions
1. Describe a time when you faced conflicting priorities on a project. How
did you manage your workload to meet deadlines?
Managing conflicting priorities on a project:

• Identify and prioritize:

o Break tasks into urgent vs. important.

o Align priorities with organizational goals.

• Delegate and negotiate:

o Reassign tasks to team members based on expertise.

o Negotiate extended deadlines or resource allocation.

• Time management:

o Use tools like Gantt charts or Kanban boards.

o Allocate focused time slots for high-priority tasks.

• Communicate effectively:

o Update stakeholders on progress and challenges.

o Seek guidance from managers to resolve bottlenecks.

2. How would you handle a disagreement within the team on an analytical

approach?
Handling a disagreement within the team on an analytical approach:

• Encourage open dialogue:

o Facilitate a brainstorming session to hear all perspectives.

o Promote a culture of constructive feedback.

• Use data as the arbitrator:

o Test multiple approaches with sample data.

o Choose the method that delivers the best outcome.

• Promote collaboration:

o Merge ideas into a hybrid approach if feasible.

o Assign team members to analyze pros and cons objectively.

• Escalate if needed:

o Seek guidance from a senior manager if the disagreement persists.

o Ensure the final decision aligns with organizational goals.

12 IP HY BluePrint-Assignment 2024
No ratings yet
12 IP HY BluePrint-Assignment 2024
9 pages
(Viral) Kamal Kaur Viral Video Original Link
No ratings yet
(Viral) Kamal Kaur Viral Video Original Link
5 pages
XII Sample Paper Complete Syllabus-1 Answer Key
No ratings yet
XII Sample Paper Complete Syllabus-1 Answer Key
16 pages
XII Practical File 2023-24
No ratings yet
XII Practical File 2023-24
35 pages
EXL Data Analyst Interview Questions
No ratings yet
EXL Data Analyst Interview Questions
43 pages
QP Xii Ip Hy 2023-24
No ratings yet
QP Xii Ip Hy 2023-24
9 pages
Myntra Data Analyst Interview Questions
No ratings yet
Myntra Data Analyst Interview Questions
34 pages
IP Class 12 Practical Questions
No ratings yet
IP Class 12 Practical Questions
20 pages
IP Practicals
No ratings yet
IP Practicals
30 pages
PRACTICAL LIST CLASS-XII (INFO. PRACTICALS - fINAL PDF
100% (1)
PRACTICAL LIST CLASS-XII (INFO. PRACTICALS - fINAL PDF
8 pages
65 (Hs Xii A SC Com Ip 22)
No ratings yet
65 (Hs Xii A SC Com Ip 22)
11 pages
Xii Ip Hy 23 24
No ratings yet
Xii Ip Hy 23 24
13 pages
Class Xii Ip - Sahodaya Set2 2023
No ratings yet
Class Xii Ip - Sahodaya Set2 2023
13 pages
Sample Questions For XII IP
No ratings yet
Sample Questions For XII IP
59 pages
Ip Practical Revised Paper
No ratings yet
Ip Practical Revised Paper
9 pages
Yash Dbms
No ratings yet
Yash Dbms
56 pages
IP 12th
No ratings yet
IP 12th
45 pages
CBSE Class 6 Maths Practice Worksheets
100% (1)
CBSE Class 6 Maths Practice Worksheets
2 pages
Xii PB Ip 2019-20 QP Set-H PDF
No ratings yet
Xii PB Ip 2019-20 QP Set-H PDF
6 pages
Walmart Data Analyst Interview Experience
No ratings yet
Walmart Data Analyst Interview Experience
10 pages
Theories in Nursing Informatics
No ratings yet
Theories in Nursing Informatics
31 pages
XIIInfo Pract S E 273
No ratings yet
XIIInfo Pract S E 273
8 pages
Interview Questions
No ratings yet
Interview Questions
24 pages
Kis W Class 12 Practical File
No ratings yet
Kis W Class 12 Practical File
31 pages
Xii Ip Ncert 065-Book Questions
No ratings yet
Xii Ip Ncert 065-Book Questions
18 pages
12 CS Set A Anskey
No ratings yet
12 CS Set A Anskey
16 pages
Lab Programmes Adwaith
No ratings yet
Lab Programmes Adwaith
18 pages
HCLTech
No ratings yet
HCLTech
5 pages
Exercises On Connectors
No ratings yet
Exercises On Connectors
4 pages
1.lab Manual Class Xii 2020-21
0% (1)
1.lab Manual Class Xii 2020-21
6 pages
Practice Questions 2024
No ratings yet
Practice Questions 2024
8 pages
Wipro Data Analyst Interview Questions
No ratings yet
Wipro Data Analyst Interview Questions
29 pages
SET1 Ans
No ratings yet
SET1 Ans
7 pages
Kendriya Vidyalaya Sangathan: Kolkata Region First Preboard E Informatics Practices New (065) - Class Xii
No ratings yet
Kendriya Vidyalaya Sangathan: Kolkata Region First Preboard E Informatics Practices New (065) - Class Xii
15 pages
Marketing Principles
No ratings yet
Marketing Principles
54 pages
Accounting - Study Plan
No ratings yet
Accounting - Study Plan
1 page
Pratice Paper 6
No ratings yet
Pratice Paper 6
7 pages
Ip Sample Paper 6 Answer Key
No ratings yet
Ip Sample Paper 6 Answer Key
6 pages
12 Ip PP1 MS
No ratings yet
12 Ip PP1 MS
8 pages
12 22 23sp Informaticspractices
No ratings yet
12 22 23sp Informaticspractices
17 pages
12 Ip
No ratings yet
12 Ip
4 pages
Half Yearly Examination 2022-23 PT2: Class XII
No ratings yet
Half Yearly Examination 2022-23 PT2: Class XII
7 pages
HEALTHCARE
No ratings yet
HEALTHCARE
3 pages
12 Practice Building Tips
No ratings yet
12 Practice Building Tips
7 pages
Grade 2, Module 2 (45 Pages)
No ratings yet
Grade 2, Module 2 (45 Pages)
45 pages
SQL 1737456396
No ratings yet
SQL 1737456396
17 pages
Practical 12
No ratings yet
Practical 12
6 pages
STD Xii-Tee - Ip
No ratings yet
STD Xii-Tee - Ip
9 pages
Xii Ip CHN 02 QP
No ratings yet
Xii Ip CHN 02 QP
5 pages
Xii Ip Half Yearly
No ratings yet
Xii Ip Half Yearly
4 pages
Surfnews
No ratings yet
Surfnews
5 pages
Sample Paper-2
No ratings yet
Sample Paper-2
8 pages
Attachment 14940535 2 4 - S-GATE - Presentation
No ratings yet
Attachment 14940535 2 4 - S-GATE - Presentation
14 pages
Class 12 Ip Pre Board Set A
No ratings yet
Class 12 Ip Pre Board Set A
9 pages
Informatics Practices-Sahodaya QP New
No ratings yet
Informatics Practices-Sahodaya QP New
15 pages
Ip MS
No ratings yet
Ip MS
6 pages
QP-1PB-IP-2024 Set 1
No ratings yet
QP-1PB-IP-2024 Set 1
9 pages
12th - QPAPER - Half Yearly 2023
No ratings yet
12th - QPAPER - Half Yearly 2023
9 pages
Marketing Plan: de La Salle University - Dasmariñas
No ratings yet
Marketing Plan: de La Salle University - Dasmariñas
16 pages
Type VR Vacuum Circuit Breaker Interruptor Automático Al Vacío Tipo VR Disjoncteur Sous Vide Type VR
No ratings yet
Type VR Vacuum Circuit Breaker Interruptor Automático Al Vacío Tipo VR Disjoncteur Sous Vide Type VR
113 pages
Xii Ip QP
No ratings yet
Xii Ip QP
8 pages
CFE
No ratings yet
CFE
5 pages
Xii Ip CHN 03 QP
No ratings yet
Xii Ip CHN 03 QP
6 pages
Hpfs Instruments India LLP
No ratings yet
Hpfs Instruments India LLP
25 pages
ATO Tutorials
100% (1)
ATO Tutorials
36 pages
Ba01572cen 0320
No ratings yet
Ba01572cen 0320
16 pages
Q - Pratical Program 24 - 25
No ratings yet
Q - Pratical Program 24 - 25
6 pages
PB 1 IP Answer Key 2024
No ratings yet
PB 1 IP Answer Key 2024
6 pages
SAP Material Training
No ratings yet
SAP Material Training
37 pages
SQL Interview Questions
No ratings yet
SQL Interview Questions
5 pages
Set 2
No ratings yet
Set 2
2 pages
Ip Sample Paper 1
No ratings yet
Ip Sample Paper 1
6 pages
Pre Cal Circle
No ratings yet
Pre Cal Circle
16 pages
SQL Interview Questions
No ratings yet
SQL Interview Questions
7 pages
Management Education in India
No ratings yet
Management Education in India
22 pages
Menu Bela Terbaru 2023
No ratings yet
Menu Bela Terbaru 2023
10 pages
InformaticsPractices SQP
No ratings yet
InformaticsPractices SQP
9 pages
Maths - Matrices - Matrices Multiplication Symmetric - Skew-Symmetric - Assingment - 9 June 2020
100% (1)
Maths - Matrices - Matrices Multiplication Symmetric - Skew-Symmetric - Assingment - 9 June 2020
2 pages
Spark Fun
No ratings yet
Spark Fun
1 page
Sahil - Shamra - TCA NDA Form
No ratings yet
Sahil - Shamra - TCA NDA Form
2 pages
09 Dcio Ib Fictfee Eng 30082024
No ratings yet
09 Dcio Ib Fictfee Eng 30082024
2 pages
Datasheet SX95
No ratings yet
Datasheet SX95
1 page
HTML & SQL Programmes
No ratings yet
HTML & SQL Programmes
4 pages
HDCS DS
No ratings yet
HDCS DS
4 pages
Summer 2022
No ratings yet
Summer 2022
2 pages
Lesson Plan: Veer Surendra Sai University of Technology
No ratings yet
Lesson Plan: Veer Surendra Sai University of Technology
2 pages
Ans IP AISSCE Practical Exam 2023
No ratings yet
Ans IP AISSCE Practical Exam 2023
7 pages
Soap, Fatty Acids, and Synthetic Detergents: Janine Chupa, Steve Misner, Amit Sachdev, and George A. Smith
No ratings yet
Soap, Fatty Acids, and Synthetic Detergents: Janine Chupa, Steve Misner, Amit Sachdev, and George A. Smith
2 pages
TEDxYouth Programme
No ratings yet
TEDxYouth Programme
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.