I am trying to use the content form the new SRD files (5.2, 5.2.1) in some projects I am working on but having the data like creatures, spells, equipment, magic items, classes, etc. only in PDF format makes it almost impossible to use without having to re-write everything by hand. I have tried my hand at parsing the PDF and parsing a txt file generated from the PDF, but with PDF artifacts, Unicode characters, broken spacing, words, new lines, etc. it makes it impossible to get it 100% accurate. I mean WotC/D&DBeyond has to have the information in other formats to be able to generate the PDF, I just do not know why we do not have access to that to help create fan content much easier in the current time.
Is there a way that the SRD data can be published/provided to the community in JSON, CSV, spreadsheets, or some other format that can be used in technical capacities so that building databases, web pages, etc. of this information doesn't take weeks of copy-paste and formatting? Because this forces you to essentially re-write the entire SRD line-by-line for accuracy.
Alternatively, has anyone successfully built a parser for the new SRD's that they would be willing to share to solve this limitation?
Rollback Post to RevisionRollBack
To post a comment, please login or register a new account.
I am trying to use the content form the new SRD files (5.2, 5.2.1) in some projects I am working on but having the data like creatures, spells, equipment, magic items, classes, etc. only in PDF format makes it almost impossible to use without having to re-write everything by hand. I have tried my hand at parsing the PDF and parsing a txt file generated from the PDF, but with PDF artifacts, Unicode characters, broken spacing, words, new lines, etc. it makes it impossible to get it 100% accurate. I mean WotC/D&DBeyond has to have the information in other formats to be able to generate the PDF, I just do not know why we do not have access to that to help create fan content much easier in the current time.
Is there a way that the SRD data can be published/provided to the community in JSON, CSV, spreadsheets, or some other format that can be used in technical capacities so that building databases, web pages, etc. of this information doesn't take weeks of copy-paste and formatting? Because this forces you to essentially re-write the entire SRD line-by-line for accuracy.
Alternatively, has anyone successfully built a parser for the new SRD's that they would be willing to share to solve this limitation?