ºÚÁÏÍø

Research & Art

Best practices for AI use in research work

A practical collection of best practices for working with AI tools in research.
A banner that says "AI and Research Work"

Note: This page complements the Responsible use of Generative Artificial Intelligence in the research process guidelines with practical tips. The content is based on the many interactions with 1000+ researchers over the last three years of teaching "AI and Research Work" (Glerean, Silva) [][], the mandatory "Research Ethics for Doctoral Students" course (AI topics covered by Glerean, Solin, Rehbinder), and the CodeRefinery workshop on "Responsible Use of Generative AI in Assisted Coding" (Glerean, CodeRefinery) [][]. Do you want to expand these practical guidelines? Get in touch with Enrico Glerean and Aalto Data Agents (researchdata@aalto.fi). We are preparing a MOOC which will be openly available from September 2026. 

AI and research work

Artificial Intelligence (AI) tools, for example tools based on generative AI, are used throughout the research lifecycle: from literature search, to software coding, to editing manuscripts. Please review the Responsible use of Generative Artificial Intelligence in the research process guidelines if you did not already. This page helps you identify where "AI" can help you while being aware of the risks, within the principles of research integrity.

1. AI in your research: what is AI?

AI is a vague term, in the context of research AI can play three roles:

  • AI as a Topic: studying AI itself (e.g., human–chatbot interaction, new machine learning (ML) algorithms). Digitalisation and AI is  one of the key research areas at Aalto University.
  • AI as a Method: using ML or other AI methods as the analysis approach in your research (e.g., classification, prediction).
  • AI as a Tool: using software tools with an "AI" (often generative AI) component, to assist tasks that are not themselves about AI (e.g., proofreading, code debugging). 

For the majority of researchers, even those not studying AI as a topic or using it as a method, AI is indeed a tool. This guide focuses on (generative) AI as a tool.

2. Generative AI tools across the research lifecycle

Generative AI tools can assist at every stage of the research process:

Stage Examples of AI assistance
Planning Literature search, grant drafting, research question brainstorming
Data collection Survey design, data synthesis, simulations
Pre-processing Formatting, quality checks, data cleaning
Analysis Analysis code generation, qualitative annotations
Preservation Documentation, README generation, metadata
Sharing Manuscript drafting, press releases, socual media posts
Reuse Code documentation, dataset descriptions

Each use carries risk. There is no task that is always good or always bad with AI, it depends on the context and on the user of the AI tool. As a responsible researcher, your task is to evaluate the risks before deciding whether to proceed. If you are unsure, you can start a conversation with peers (e.g. with the data agents researchdata@aalto.fi) or just avoid using AI for that specific task. The figure below gives some examples on AI tools usage across the research lifecycle.

A visualisation of the table that is in the webpage: for each  step of the research lifecycle there are examples of AI use.

3. Avoid the three forms of research misconduct: how AI can enable fabrication, falsification, and plagiarism

Research misconduct in Finland (and broadly in academia) means fabrication, falsification, and plagiarism. If not used carefully, generative AI can be the ideal misconduct machine as it can very easily engage with any of the three malpractices. Familiarise with the risks as covered in the Responsible use of Generative Artificial Intelligence in the research process guidelines.

4. Evaluating risk: the expertise × output-risk matrix

Not all AI use is equal. If you are unsure, you can consider these two factors to determine how much caution is needed:

  1. Your expertise on the task you are delegating to AI
  2. The importance of the output: will it be submitted for peer review, or is it of low importance (e.g. a workshop webpage)?
  Low expertise High expertise
High risk (peer-review submission: text, code, figures, references) Use AI only to brainstorm: ask it what sources to read, get keyword suggestions, then go read the real sources.

Example: find relevant keywords or ask which statistical methods might fit your data, then go read about them yourself.
Use AI for simple delegated tasks you can fully review. Check everything as carefully as if you had retyped it yourself. This is where overconfidence is most dangerous.

Examples: generating short code snippets function-by-function. Text revision: Ask AI to mark suggested text changes in bold, so you can decide which ones to apply manually to your final text.
Low risk (event webpage, social media post, presentations) Use AI, but accept a non-zero chance of errors slipping through.

 Example: help with CSS for a workshop webpage or vibe-coding a small demo.
Delegate most of the work to AI. Verify that the output makes sense.

Example: generating documentation for scripts you wrote, or drafting an outline of a presentation from a transcription of your own recordings.

(matrix from Glerean, 2026, "AI and research work" in preparation)

Note 1: The sensitivity of your data increases the level of risks. With confidential or personal data you need to pay attention not only on how you use the tool, but also which tools you use.

Note 2: A special case that spans across levels is with generated software code when the proof can be formalised through automated unit tests, if the tests are well-defined and cover the relevant behaviour of the code. This can lower the risk level, but it shifts the quality requirements on the tests themselves.

Note 3: there might be other specific use cases that do not fall within this 2x2 matrix. You are the final responsible person who can evaluate if the use of AI is appropriate.

5. Choosing AI tools: data classification

The AI tool you use should match the sensitivity of the information you are sharing with it. 
Our guidelines (Recommendation 3) require using Aalto AI Assistant for anything beyond
 fully public data. The table below maps Aalto's four information classification levels
 to the appropriate AI tool choice, putting that requirement into practice.

Level Examples of data AI tool
Public Wikipedia content, published papers with CC license, public data Any AI tool is acceptable
Internal Meeting notes, expense reports, internal university pages Prefer approved institutional tools
Confidential Participant data, unpublished findings Use only tools with contractual data protection guarantees (like ai.aalto.fi), or run AI models locally (e.g. on Triton cluster)
Secret Data where a breach causes serious harm (e.g. medical records) No cloud AI tools, only local tools inside a Trusted Research Environment like SECDATA

Remember: when using non-approved tools, also consider the interaction itself as data: whatever you type into an AI system may be used for training or could be accessed by others. As a good practical example treat interactions with Meta AI, Grok (xAI), and DeepSeek as fully public, regardless of the data classification of what you share and regardless of what they promise when it comes to privacy. These tools either operate under permissive privacy policies that allow use of your inputs for model training, or are provided by organisations whose data practices cannot be independently verified.

And even with public data, disclosing unpublished research ideas to any AI service carries a non-zero risk of being scooped or of the idea being exposed to other users / public internet.

6. Disclosure of the use of AI

The Aalto Universiy guidelines on responsible use of AI in research are rooted in the ALLEA European Code of Conduct's four principles: reliability, honesty, respect, and accountability. In practice, these mean the following for your manuscript preparation:

What to declare?

As an example, here are (please check the recommendations of your publisher before submitting):

  • No declaration needed: fixing typos and grammar only
  • Declaration required: any synthesis of text, generation of code, creation of figures, suggestion of analysis approaches, or drafting of any section
  • AI-generated images: acceptable only for illustrating a pipeline or method; never as result figures or quantitative plots

Disclosure template

When declaring AI use, include a statement such as ():

> Title of section: Declaration of generative AI and AI-assisted technologies in the manuscript preparation process

> Statement: During the preparation of this work, the author(s) used [NAME OF TOOL / SERVICE] in order to [REASON]. After using this tool/service, the author(s) reviewed and edited the content as needed and take(s) full responsibility for the content of the published article.

7. Practical tips

Here a series of practical tips and use cases with AI tools and research work, in no specific order. 

8. Selected Q&A

Here some of the questions that have often come up in the research ethics course or in our "AI in research work" workshops.

9. Video lectures on the topic of AI and research

This section is only visible if you are logged in. The section contains videos from the Research Ethics Course for Doctoral students, by  Arno Solin (Ethics in AI Research), Enrico Glerean (Generative AI and Research Integrity), and Maria Rehbinder (AI Act). We are preparing a mooc with these videos and much more. Please get in touch if you want to contribute to the MOOC.

Conclusions

In the current era of Artificial Intelligence expanding in all aspects of our lives, responsible researchers are not those who avoid AI completely, nor those who delegate everything to it. Responsible researchers understand how these systems work, what they are delegating. They can verify the output, consider the data and legal risks, disclose AI use honestly, and preserve the core human skills needed for research: reading, reasoning, documentation, communication, and accountability. 

Responsible use of Generative Artificial Intelligence in the research process

These guidelines focus on one particular type of AI used in the research process: generative AI. The goal of these guidelines, originally provided by the European Research Area forum, is to prevent misuse and to ensure that generative AI plays a positive role as part of research practices.
The key principles for the responsible use of generative AI in research are:
• Reliability in ensuring the quality of research, reflected in the design, methodology, analysis and use of resources. This includes aspects related to verifying and reproducing the information produced by the AI for research. It also involves being aware of possible equality and non-discrimination issues in relation to bias and inaccuracies.
• Honesty in developing, carrying out, reviewing, reporting and communicating on research transparently, fairly, thoroughly and impartially. This principle includes disclosing that generative AI has been used.
• Respect for colleagues, research participants, research subjects, society, ecosystems, cultural heritage and the environment. Responsible use of generative AI should consider the limitations of the technology, its environmental impact16 and its societal effects (bias, diversity, non-discrimination, fairness and prevention of harm). This includes the proper management of information, respect for privacy, confidentiality and intellectual property rights, and proper citation.
• Accountability for the research from idea to publication, for its management and organisation, for training, supervision and mentoring, and for its wider societal impacts. This includes responsibility for all output that a researcher produces, underpinned by the notion of human agency and oversight.

Services
Cute white AI robot diagram showing data inputs, processing icons and green output symbols for tasks and tools

Coding assistants in Aalto

Coding assistants are AI-based tools that help you to write programming code, review it and make development processes faster.

Services
Digital illustration of clouds, arrows, and a microphone representing data flow and cloud computing.

Speech-to-Text

Speech-to-Text is a multilingual, AI-powered transcription service.

Services
Local AI llm gateway banner

Local generative Artificial Intelligence LLM gateway for research and teaching

A safer way for Aalto researchers, teachers and students to build tools with large language models within Aalto’s infrastructure.

News
Black circles and squares connected by dotted lines on a blue background.

Artificial intelligence and copyright

AI systems, AI models and copyright questions related to them.

Services
  • Updated:
  • Published:
Share
URL copied!