@matrixShould have learned more models designed for preprocessing data. It is remarkable how little most of trade school academia attempts to work on the actual design bottlenecks in industry.
@matrix All 2.5 million rows of data are correct except for the one where the title has a single comma in it and the other one that somehow has a newline character followed by an EOF.
And also about 20 of them have NULL where a value is required.
And a few are out of range of allowed values, but you can't snap them to the minimum or maximum, and you can't delete them. Figure it out!
Oh and here's another dataset you need to merge that first one with; the columns are kind of sort of the same but there are a few extra columns on each dataset that the other one doesn't have. We don't want to lose any data, so combine it all together.
Also can you make sure it's accessible over the network? You know, just set up the permissions and stuff for everyone on the team. Easy.