Lecture 4
Lecture 4
Communications
• General communication skills:
– Oral and written
– Formal and informal
– Talk to people with different level of backgrounds
• English proficiency
• Get used to talking to people from different fields
2
Persistence
• Work only on topics that you are passionate about
• Work only on hypothesis that you believe in
• Don’t draw negative conclusions prematurely and
give up easily
– positive results may be hidden in negative results
– In many cases, negative results don’t completely reject
a hypothesis
Unknown
Good applications
Not interesting
Known for research
“entry point”
problems Impact/Usefulness
7
Optimizing “Research Return”:
Pick a Problem Best for You
High (Potential)
Impact
Your
Passion
Your Strength
Best problems for you
Find your passion: If you don’t have to work/study for money, what would you do?
Test of impact: If you are given $1M to fund a research project, what would you fund?
Find your strength/Avoid your weakness: What are you (not) good at?
8
How to Find a Problem?
• Application-driven (Find a nail, then make a hammer)
– Identify a need by people/users that cannot be satisfied well
currently (“complaints” about current data/information
management systems?)
– How difficult is it to solve the problem?
• No big technical challenges: do a startup
• Lots of big challenges: write a research proposal
– Identify one technical challenge as your topic
– Formulate/frame the problem appropriately so that you can solve
it
• Every time you reframe a problem, try to do all the three tests
again.
14
Frame a New Computation Task
• Define basic concepts
• Specify the input
• Specify the output
• Specify any preferences or constraints
15
From a new application to
a clearly defined research problem
• Try to picture a new system, thus clarify what new functionality is to be
provided and what benefit you’ll bring to a user
• Among all the system modules, which are easy to build and which are
challenging?
• Pick a challenge and try to formalize the challenge
– What exactly would be the input?
– What exactly would be the output?
Unknown
Known
Impact/Usefulness
17
“Short-Cut” for starting Research
• Scan most recently published papers to find papers that you like or can
understand
• Carefully study existing literature to figure our where exactly you can make
a new contribution (what do you want others to cite your work as?)
• The more specialized a hypothesis is, the more likely it’s new, but a narrow
hypothesis has lower impact than a general one, so try to generalize as
much as you can to increase impact
• But avoid over-generalize (must be supported by your experiments)
• Tuning hypotheses 20
Procedure of Hypothesis Testing
• Clearly define the hypothesis to be tested (include
any necessary conditions)
22
Design the Right Experiments
• Flawed experiment design is a common cause of rejection of
an IR paper (e.g., a poorly chosen baseline)
• The data should match the hypothesis
– A general claim like “method A is better than B” would need a
variety of representative data sets to prove