cross-posted from: https://lemmy.ml/post/15471632

Codeberg was asking about this. The linked toot by a commenter points to :

SEqlite

These are CC-BY-SA 4.0 remixes of the Stack Exchange Creative Commons Data Dumps. 100% Unendorsed by Stack Exchange, Inc.

They are minimal. They provide the data you probably care about and the data you need to comply with the original license in SQLite format.

  • Miaou@jlai.lu
    link
    fedilink
    English
    arrow-up
    0
    ·
    2 months ago

    They have already access to SO’s CC content, why would they get it from the fediverse?

    • lambalicious@lemmy.sdf.org
      link
      fedilink
      English
      arrow-up
      0
      ·
      2 months ago

      They already have it.

      I said alternative to SO. As in, likely, a place to post new content (answers, comments). Nothing can really be done with the content OAI already got their hands on other than firing off a few well-placed EMP bombs.

      • Miaou@jlai.lu
        link
        fedilink
        English
        arrow-up
        0
        ·
        2 months ago

        Yes, but you mentioned importing old content is problematic, and I don’t see why?