Should we develop a MCP Server for OpenRefine?

b2m · April 10, 2025, 6:37am

At the end of last year Anthropic released a standard on how LLMs can interact with tools via a common protocol.

Name of the protocol: Model Context Protocol (MCP)
Announcement by Anthropic: https://www.anthropic.com/news/model-context-protocol
Project website: https://modelcontextprotocol.io/
List of implementations: GitHub - modelcontextprotocol/servers: Model Context Protocol Servers

As most of the functions in OpenRefine can be controlled via API calls, developing a MCP Server for OpenRefine that maps these API calls to the protocol seems not that complicated.

What do you think?

Rory · April 10, 2025, 3:34pm

I'm not always up to date on LLM activity, but I think this sounds like a good idea. Would it be possible to build this as an extension?
Relatedly, I've been spending some time trying to improve documentation for Butterfly, the web framework OpenRefine uses. Together, these two things seem like a good opportunity to motivate some work to standardize the OpenRefine API.

b2m · April 11, 2025, 6:18am

I think so... I guess it would also be possible to run the MCP Server completely independent of OpenRefine similar to the OpenRefine Clients. But an extension would make it easier for people to install and use it, as they would not have to maintain a separate tech stack.

I am not familiar with the Butterfly framework (yet). But better documenting and standardizing the OpenRefine API was something we identified as desideratum at the last BarCamp.

One of the problems we might face is, that a lot of functionality in OpenRefine is a command that is combined with an expression. This will either blow up the MCP Server API or handle the burden of generating a suitable expression to the LLM.

But with the growing support of other languages as expression provider, this will hopefully make it easier for Standard LLMs to be able to generate suitable expressions e.g. in JavaScript or Python 3.

closedLoop · May 12, 2025, 7:24pm

Has there been any more discussion about this? I'd be happy to help move one forward if others have interest.

Rory · May 13, 2025, 6:25pm

Hi @closedLoop, welcome to the forum! I haven't seen any other discussion around this. I don't know much about the protocol itself and don't regularly work with LLMs so I don't feel well-suited to be the one driving this forward, but I'd welcome a proposal from you or anyone else interested in keeping this moving.

Topic		Replies	Views
Using LLMs in OpenRefine for data wrangling with Hugging Face inference API Support and Helpdesk hints-and-tips	2	86	December 4, 2024
Using local ChatGPT-like LLMs in OpenRefine for data wrangling Support and Helpdesk hints-and-tips	137	1381	May 23, 2025
Developer and Community Engagement update: April 2025 Community	10	82	May 21, 2025
OpenRefine 2024 Barcamp: Support OpenAPI in OpenRefine Development & Design barcamp-2024	0	48	July 9, 2024
Developer and Community Engagement update: May 2025 Community	0	28	June 5, 2025

Should we develop a MCP Server for OpenRefine?

Related topics