If you ask people who work with data, you will get to know that combining Excel files or merging workbooks is a part of their daily work.
A simple an example: Let’s say you want to create a sales report and you have data of four different zones in four different files.
The very first thing you need to do is to combine those files in one single workbook and only then you can create your report further.
The point is: You have to have a method which you can use for merging these files. Say “YES” in the comment section if you want to know the best method for this.
Today in this post, I’m going to share with you the best way to merge data from multiple Excel files into a SINGLE workbook.
But, here's the kicker.
This post will teach you something you need to learn to use in the real world data problem so make sure to read the entire post.
The Best Possible Way for Combining Excel Files by Merging data into ONE Workbook - POWER QUERY
Power Query is the best way to merge or combine data from multiple Excel files in a single file. You need to store all the files in a single folder and then use that folder to load data from those files into the power query editor. It also allows you to transform that data along with combining.
It works something like this:
Note: For combining data from different Excel files, your data should be structured in the same way. That means the number of columns and their order should be the same.
To merge files, you can use the following steps:
- First of all, extract all the files from the sample folder and save that folder at the desktop (or wherever you want to save it).
- Now, the next thing is to open a new Excel workbook and open “POWER Query”.
- For this, go to Data Tab ➜ Get & Transform Data ➜ Get Data ➜ From File ➜ From Folder.
- Here you need to locate the folder where you have files.
- In the end, click OK, and once you click OK, you’ll get a window listing all the file from the folder, just like below.
- Now, you need to combine data from these files and for this click on “Combine & Edit”.
- From here, the next thing is to select the table in which you have data in all the workbooks and yes, you’ll get a preview of this at the side of the window.
- Once you select the table, click OK. At this point, you have merged data from all the files into your power query editor and, if you look closely you can see a new column with the name of the workbooks from which data is extracted.
- So, right-click on the column header and select “Replace Values”.
- Here in the “Value to Replace” enter the text “.xlsx” and leave “Replace With” blank (here idea is to remove the file extension from the name of the workbook).
- After that, double click on the header and select “Rename” to enter a name for the column i.e. Zone
- At this point, your merged data is ready and all you need is to load it into your new workbook. So, go to the Home Tab and click on the “Close & Load”.
One more thing:
The next thing is, to have the same name for the column headers, but here the kicker: The order of the columns doesn’t matter. If column1 in the north.xlsx is column2 in the west.xlsx, Power Query will match it, but you have to have column names the same.
So now, while combining files using power query you can use the worksheet name instead of the table name, and here you have "SalesData" as the worksheet name in all the files.
You select it and click on the "Combine & Edit" and follow all the steps which I have mentioned in the above method.
Why Power Query is the Best Way to Merge Data into a Single File?
Merge Data from Multiple Workbooks When you don’t have the Same Name for Worksheets and data in Tables
This is the hard truth…
…that in some situations, you won’t have the same name for worksheets and not all the data in tables all the time.
Now, what you should be doing in that case?
…in this case, you must know how you can combine data from all the files and I don’t want to miss to share with this thing with you.
...so without any further ado, let's get started.
- First of all, open the “From Folder” dialog box to locate the folder where you have all the files.
- Now in this dialog box, locate the folder and click OK.
- After that, click on the “Edit” to edit the table.
- At this point, you will have a table like below in your power query editor.
- Next, select the first two columns of the table and click on the “Remove Other Columns” from the right-click menu.
From here, we need to add a custom column to fetch data from the worksheets of the workbooks.
- For this, go to Add Column Tab and click on the “Custom Column” button. This will open the “Custom Column” dialog box.
- In the dialog box, enter =Excel.Workbook([Content]) and click OK.
…at this time you have a new column in the table but next, you need to extract data from it.
- Now, open the filter from that newly added custom column and click OK to expand all the data into the table.
- Here you have the newly expanded table with some new columns.
- Now from this new table, delete all the columns except third and fourth.
- So, open the filter for the column “Custom.Data” to expand it and click OK.
The moment you click OK, you’ll get all the data from all the files into a single table…
…you need to make some changes into it to make it PERFECT.
If you notice, all the heading of the column are into data itself...
...so you need to add the column headings.
- To do this, you need to double click on the header and add a name, or you can right click and select rename it.
The next you need to exclude the headings which you have in the data table.
- Now open any column’s filter option and unselect the heading name which you have in the column data and click OK after that.
Now our data is ready to load into the worksheet, so, go to the Home Tab and click on the close and load.
Congratulations! you have just combined data from the different workbooks (with different worksheets name and without any table).
This is also important:
At this point, you have merged the data into one table.
But there’s one thing you need to do…
…and that’s applying some formatting to it and making sure that it won’t go away when you update your data.
Here’s what you need to do…
- First of all, select the column where you have dates (as it is formatted as number right now) and format it as dates.
- After that, make all the columns wide as per the data you have in them.
- Here you can also format amount and price as “Currency”.
But the next thing is to make this formatting fix.
- For this, go to “Design Tab”, and open properties.
- Untick “Adjust Column” width and tick mark “Preserve Cell Formatting”.
- Yes, that’s it.
Now you have a query in your workbook which can combine data from multiple files...
...and merge it into a single workbook...
...even if the worksheet name is not the same or if you don’t have tables.
And yes, you have also made the formatting fix. ?
In the end,
As I said, POWER QUERY is real and if you frequently use to combine data from multiple files then you must use this method…
…as it’s a ONE-TIME setup.
The most important thing is you when you use power query you can even clean the data from those files as well.
Which method do you use to combine data from multiple files?
Make sure to share your views with me in the comment section, I'd love to hear from you. And please, don’t forget to share this post with your friends, I am sure they will appreciate it.
You must Read these Next
- Consolidate Data From Multiple Worksheets: This option can help you to combine data from multiple worksheets into a single one...
- Unpivot Data using Power Query: In this situation, you need to put some efforts and spend your precious time to make it re-usable...
- Create a Pivot from Multiple Files: In this post, I’d like to show you a 3 steps process to create a pivot table by using data from multiple...
About the Author
Puneet is using Excel since his college days. He helped thousands of people to understand the power of the spreadsheets and learn Microsoft Excel. You can ﬁnd him online, tweeting about Excel, on a running track, or sometimes hiking up a mountain.