• FishFace@piefed.social
    link
    fedilink
    English
    arrow-up
    1
    ·
    28 days ago

    The interpretation here depends on the idea of a word-vector. This is a component of language models which treat each individual word in a language as a vector in a pretty high-dimensional space (how high is up to the model author). The way this is usually described is that if you look at the word pairs “man - woman”, “boy - girl”, “king - queen” and so on, they should differ by a similar vector in word-vector-space, and that vector should correspond to the concept of “male” (or “female” depending on which way round you do it). If you have a word vector model, you should then be able to take the dot product of this gender concept-vector with a word like “actress” or “actor”, and see if it has learnt that “actress” is female and “actor” is kinda male but kinda gender neutral due to changing usage.

    So what this diagram is showing is a measure of similarity between various word vectors. Those vectors are (the vector of) a slur minus a related word. The idea is to see if subtracting “Mexican” from “spic” leaves you with an underlying concept of “slur” that corresponds to these other vectors - just like with gender and man, woman; boy, girl, etc.

    The confusion matrix is actually pretty interesting IMO. There is pretty high similarity between all of the “racial slur - race” vectors, and much less between “cunt - woman” and “fag - homosexual” and the others. So it’s showing that there isn’t that good a concept - in this word vector model at any rate - of “slur” in general, but you could argue pretty strongly that racial slur does exist in that way.

  • ranzispa@mander.xyz
    link
    fedilink
    English
    arrow-up
    1
    ·
    21 days ago

    This is all good, but give us the insult now: I just got a Jewish Asiatic mexican lesbian with African origins to deal with right now.

    • oneser@lemmy.zip
      link
      fedilink
      English
      arrow-up
      0
      ·
      29 days ago

      I think the x-axis labels are wrong. Cosine similarity is used to compare vectors in maths. 1 would mean the vectors are going in the same direction and 0 would mean they are going 90° to each other and -1 is opposite.

      • VoodooAardvark@lemmy.zip
        link
        fedilink
        English
        arrow-up
        0
        ·
        29 days ago

        While it seems you’re on to something with the x-axis, I do not believe that was the question. My interpretation, that I share, is wtf am I looking at? Haha