This repository was archived by the owner on Dec 12, 2024. It is now read-only.

Description
The current batch loader CAN correctly process CSV files that contain encoded text (for example, ﺎﺴﺘﻣﺍﺭﺓ ﺶﻛﻭﻯ, resulting in a valid JSON file that is ingested correctly by GW ScholarSpace's rake task. However, typical usage is that metadata is developed in Microsoft Excel, and the metadata file is saved from Excel as a CSV file. Saving as CSV results in a CSV file that garbles the encoded text.
Adding the ability to use an Excel-format metadata spreadsheet would avoid this loss of encoding information.
This should resolve #18 and #23 . This may also provide guidance for a (related, but not identical) solution to issues where https://github.com/gwu-libraries/etd-loader receives text from ProQuest that includes special characters.