Which dataset should be used for Photon's media bias action?
Photon has a feature on links which gets data from Media Bias Fact Check to determine its media bias. However, I've seen controversy on lemmy.world's bot with this, and I'm not sure if this is the best place to get the data from.
Should I use a different dataset? like allsides'?
TBH, I think you'd get the same controversy no matter what bias / fact checker you integrate. I've had MBFC embedded for close to a year now, and while I won't say it's perfect, the consistent "controversy" common to it and lemmy.world's bot is blown widely out of proportion and basically boils down to people getting upset about:
Having already blazed this trail, my advice is, if you have more than one source and it's practical, add both for extra coverage and make it optional so the people who would shriek about it can just turn it off.
Based.
The whole website bias rating is flawed by design. Each publication have multiple authors, which have different inherent biases and personal leanings that even unconsciously can influence the writing. And while corporate influence does exist and politics are unavoidable, labelling everything a site published based on a small dataset just makes the situation worse.
I was afraid that Photon will go the same route as Tesseract (originally fork of Photon) who went all into in flawed media bias checking and ruin the good project, but having the button for people that care about it is a good compromise.
As far as MBFC specifically, https://slrpnk.net/comment/10200142 highlights the issues with it quite well.
https://spinscore.io is a relatively new tool, but shows some promising results so far.
Sadly there are no other possibilities, as no one gives them access to free API for Bias check ( like ground.news and other alternatives ).