openrefine/googlerefine - reconcile two datasets












0















i'm in a situation with two json files: file A contains different datas, one of them contains numeric id. File B contains all the possible numeric ids linked with a vat number.In the rdf final file i'd like to replace the column of ids of the first file or add another column with the associated vat number. Any suggest is welcome, thank you



EDIT: file A structure (it's an array of data structured like this). The field 'suppliers' contains always a single value which is the id param i mentioned before



{
"coupon_number": 25422,
"url": "xxx",
"title": "Lorem ipsum dolor sit amet, duo ei accusam aliquando rationibus, sed id dolor sensibus delicatissimi.",
"suppliers": [
3043
],
}


file B structure (another array)



{
"id": 3043,
"vatNumber": "03918590401",
}


I need to link 'suppliers' with the vat number or replace it with the vat number










share|improve this question

























  • Hello. Could you post a little sample of your data (by editing your question) ?This is always easier to do some tests and provide a good answer. Thanx!

    – Ettore Rizza
    Nov 23 '18 at 11:53













  • hope this makes things simplier, thank you

    – heartlex
    Nov 23 '18 at 12:07











  • i tried this tutorial, but i get errors blog.ouseful.info/2011/05/06/…

    – heartlex
    Nov 23 '18 at 16:43
















0















i'm in a situation with two json files: file A contains different datas, one of them contains numeric id. File B contains all the possible numeric ids linked with a vat number.In the rdf final file i'd like to replace the column of ids of the first file or add another column with the associated vat number. Any suggest is welcome, thank you



EDIT: file A structure (it's an array of data structured like this). The field 'suppliers' contains always a single value which is the id param i mentioned before



{
"coupon_number": 25422,
"url": "xxx",
"title": "Lorem ipsum dolor sit amet, duo ei accusam aliquando rationibus, sed id dolor sensibus delicatissimi.",
"suppliers": [
3043
],
}


file B structure (another array)



{
"id": 3043,
"vatNumber": "03918590401",
}


I need to link 'suppliers' with the vat number or replace it with the vat number










share|improve this question

























  • Hello. Could you post a little sample of your data (by editing your question) ?This is always easier to do some tests and provide a good answer. Thanx!

    – Ettore Rizza
    Nov 23 '18 at 11:53













  • hope this makes things simplier, thank you

    – heartlex
    Nov 23 '18 at 12:07











  • i tried this tutorial, but i get errors blog.ouseful.info/2011/05/06/…

    – heartlex
    Nov 23 '18 at 16:43














0












0








0








i'm in a situation with two json files: file A contains different datas, one of them contains numeric id. File B contains all the possible numeric ids linked with a vat number.In the rdf final file i'd like to replace the column of ids of the first file or add another column with the associated vat number. Any suggest is welcome, thank you



EDIT: file A structure (it's an array of data structured like this). The field 'suppliers' contains always a single value which is the id param i mentioned before



{
"coupon_number": 25422,
"url": "xxx",
"title": "Lorem ipsum dolor sit amet, duo ei accusam aliquando rationibus, sed id dolor sensibus delicatissimi.",
"suppliers": [
3043
],
}


file B structure (another array)



{
"id": 3043,
"vatNumber": "03918590401",
}


I need to link 'suppliers' with the vat number or replace it with the vat number










share|improve this question
















i'm in a situation with two json files: file A contains different datas, one of them contains numeric id. File B contains all the possible numeric ids linked with a vat number.In the rdf final file i'd like to replace the column of ids of the first file or add another column with the associated vat number. Any suggest is welcome, thank you



EDIT: file A structure (it's an array of data structured like this). The field 'suppliers' contains always a single value which is the id param i mentioned before



{
"coupon_number": 25422,
"url": "xxx",
"title": "Lorem ipsum dolor sit amet, duo ei accusam aliquando rationibus, sed id dolor sensibus delicatissimi.",
"suppliers": [
3043
],
}


file B structure (another array)



{
"id": 3043,
"vatNumber": "03918590401",
}


I need to link 'suppliers' with the vat number or replace it with the vat number







openrefine google-refine






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Nov 23 '18 at 12:06







heartlex

















asked Nov 23 '18 at 11:38









heartlexheartlex

318




318













  • Hello. Could you post a little sample of your data (by editing your question) ?This is always easier to do some tests and provide a good answer. Thanx!

    – Ettore Rizza
    Nov 23 '18 at 11:53













  • hope this makes things simplier, thank you

    – heartlex
    Nov 23 '18 at 12:07











  • i tried this tutorial, but i get errors blog.ouseful.info/2011/05/06/…

    – heartlex
    Nov 23 '18 at 16:43



















  • Hello. Could you post a little sample of your data (by editing your question) ?This is always easier to do some tests and provide a good answer. Thanx!

    – Ettore Rizza
    Nov 23 '18 at 11:53













  • hope this makes things simplier, thank you

    – heartlex
    Nov 23 '18 at 12:07











  • i tried this tutorial, but i get errors blog.ouseful.info/2011/05/06/…

    – heartlex
    Nov 23 '18 at 16:43

















Hello. Could you post a little sample of your data (by editing your question) ?This is always easier to do some tests and provide a good answer. Thanx!

– Ettore Rizza
Nov 23 '18 at 11:53







Hello. Could you post a little sample of your data (by editing your question) ?This is always easier to do some tests and provide a good answer. Thanx!

– Ettore Rizza
Nov 23 '18 at 11:53















hope this makes things simplier, thank you

– heartlex
Nov 23 '18 at 12:07





hope this makes things simplier, thank you

– heartlex
Nov 23 '18 at 12:07













i tried this tutorial, but i get errors blog.ouseful.info/2011/05/06/…

– heartlex
Nov 23 '18 at 16:43





i tried this tutorial, but i get errors blog.ouseful.info/2011/05/06/…

– heartlex
Nov 23 '18 at 16:43












1 Answer
1






active

oldest

votes


















0














Basically, you have to create two projects based on your Json files, then perform a kind of Vlookup between them.



You mentioned a blog post that explains how to make a join in OpenRefine using the cell.cross() function, but this method is not used much anymore. Most users have downloaded the Vib-Bit plugin (the first one that can be downloaded from this page) that allow you to join them visually.



Just unzip the plugin into the webappsextensions folder of your OpenRefine directory, restart OpenRefine, and check "Edit Columns" -> "Add column(s) from another project".



Important detail: before making a join, transform the common columns containing numbers (in green) into strings (in black).



The following screencast shows the operations.



enter image description here






share|improve this answer
























  • thank you very much, i'm going to test it

    – heartlex
    Nov 26 '18 at 8:24






  • 1





    great, it works fine. thank you very much

    – heartlex
    Nov 26 '18 at 9:33











Your Answer






StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});


}
});














draft saved

draft discarded


















StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53445998%2fopenrefine-googlerefine-reconcile-two-datasets%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown

























1 Answer
1






active

oldest

votes








1 Answer
1






active

oldest

votes









active

oldest

votes






active

oldest

votes









0














Basically, you have to create two projects based on your Json files, then perform a kind of Vlookup between them.



You mentioned a blog post that explains how to make a join in OpenRefine using the cell.cross() function, but this method is not used much anymore. Most users have downloaded the Vib-Bit plugin (the first one that can be downloaded from this page) that allow you to join them visually.



Just unzip the plugin into the webappsextensions folder of your OpenRefine directory, restart OpenRefine, and check "Edit Columns" -> "Add column(s) from another project".



Important detail: before making a join, transform the common columns containing numbers (in green) into strings (in black).



The following screencast shows the operations.



enter image description here






share|improve this answer
























  • thank you very much, i'm going to test it

    – heartlex
    Nov 26 '18 at 8:24






  • 1





    great, it works fine. thank you very much

    – heartlex
    Nov 26 '18 at 9:33
















0














Basically, you have to create two projects based on your Json files, then perform a kind of Vlookup between them.



You mentioned a blog post that explains how to make a join in OpenRefine using the cell.cross() function, but this method is not used much anymore. Most users have downloaded the Vib-Bit plugin (the first one that can be downloaded from this page) that allow you to join them visually.



Just unzip the plugin into the webappsextensions folder of your OpenRefine directory, restart OpenRefine, and check "Edit Columns" -> "Add column(s) from another project".



Important detail: before making a join, transform the common columns containing numbers (in green) into strings (in black).



The following screencast shows the operations.



enter image description here






share|improve this answer
























  • thank you very much, i'm going to test it

    – heartlex
    Nov 26 '18 at 8:24






  • 1





    great, it works fine. thank you very much

    – heartlex
    Nov 26 '18 at 9:33














0












0








0







Basically, you have to create two projects based on your Json files, then perform a kind of Vlookup between them.



You mentioned a blog post that explains how to make a join in OpenRefine using the cell.cross() function, but this method is not used much anymore. Most users have downloaded the Vib-Bit plugin (the first one that can be downloaded from this page) that allow you to join them visually.



Just unzip the plugin into the webappsextensions folder of your OpenRefine directory, restart OpenRefine, and check "Edit Columns" -> "Add column(s) from another project".



Important detail: before making a join, transform the common columns containing numbers (in green) into strings (in black).



The following screencast shows the operations.



enter image description here






share|improve this answer













Basically, you have to create two projects based on your Json files, then perform a kind of Vlookup between them.



You mentioned a blog post that explains how to make a join in OpenRefine using the cell.cross() function, but this method is not used much anymore. Most users have downloaded the Vib-Bit plugin (the first one that can be downloaded from this page) that allow you to join them visually.



Just unzip the plugin into the webappsextensions folder of your OpenRefine directory, restart OpenRefine, and check "Edit Columns" -> "Add column(s) from another project".



Important detail: before making a join, transform the common columns containing numbers (in green) into strings (in black).



The following screencast shows the operations.



enter image description here







share|improve this answer












share|improve this answer



share|improve this answer










answered Nov 23 '18 at 18:56









Ettore RizzaEttore Rizza

1,9802617




1,9802617













  • thank you very much, i'm going to test it

    – heartlex
    Nov 26 '18 at 8:24






  • 1





    great, it works fine. thank you very much

    – heartlex
    Nov 26 '18 at 9:33



















  • thank you very much, i'm going to test it

    – heartlex
    Nov 26 '18 at 8:24






  • 1





    great, it works fine. thank you very much

    – heartlex
    Nov 26 '18 at 9:33

















thank you very much, i'm going to test it

– heartlex
Nov 26 '18 at 8:24





thank you very much, i'm going to test it

– heartlex
Nov 26 '18 at 8:24




1




1





great, it works fine. thank you very much

– heartlex
Nov 26 '18 at 9:33





great, it works fine. thank you very much

– heartlex
Nov 26 '18 at 9:33


















draft saved

draft discarded




















































Thanks for contributing an answer to Stack Overflow!


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53445998%2fopenrefine-googlerefine-reconcile-two-datasets%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

404 Error Contact Form 7 ajax form submitting

How to know if a Active Directory user can login interactively

TypeError: fit_transform() missing 1 required positional argument: 'X'