Spacc BBS Spacc BBS
    • Categorie
    • Recenti
    • Tag
    • Popolare
    • Mondo
    • Utenti
    • Gruppi
    • Registrati
    • Accedi
    La nuova BBS è in fase Alpha. I post precedenti al 22 luglio 2024 potrebbero non essere trasferibili, ma rimarranno disponibili per la lettura su /old/.

    Can anyone tell me what is #NodeBB and why is it scraping and republishing fediverse content without consent?

    Pianificato Fissato Bloccato Spostato Uncategorized
    nodebb
    41 Post 10 Autori 221 Visualizzazioni
    Caricamento altri post
    • Da Vecchi a Nuovi
    • Da Nuovi a Vecchi
    • Più Voti
    Rispondi
    • Topic risposta
    Effettua l'accesso per rispondere
    Questa discussione è stata eliminata. Solo gli utenti con diritti di gestione possono vederla.
    • dentangle@chaos.socialD Questo utente è esterno a questo forum
      dentangle@chaos.social @alex@anarres.family
      ultima modifica di

      @alex yes, it appears to be a forum that has recently patched in fediverse support without understanding or respecting our conventions.

      1 Risposta Ultima Risposta Rispondi Cita 0
      • gargron@mastodon.socialG Questo utente è esterno a questo forum
        gargron@mastodon.social @dentangle@chaos.social
        ultima modifica di

        @dentangle @jonny @onepict As I said, that page should have a noindex tag on it (if you know what that is), and I consider it an oversight that it doesn't. I've let the NodeBB folks know about it a few minutes ago. However, the existence of this page is completely normal. The equivalent page on mastodon.social is mastodon.social/@dentangle@chaos.social, and it is how I can talk to you despite not having an account on chaos.social.

        onepict@chaos.socialO dentangle@chaos.socialD 2 Risposte Ultima Risposta Rispondi Cita 0
        • onepict@chaos.socialO Questo utente è esterno a questo forum
          onepict@chaos.social @gargron@mastodon.social
          ultima modifica di

          @Gargron @dentangle @jonny I'm aware of backfilling and profiles existing on fediverse instances. So are other folks.

          My main issue is it being searchable on search engines. Plus mushing everything together without respecting the public/quiet public stuff.

          Thank you Eugen for making them aware.

          1 Risposta Ultima Risposta Rispondi Cita 0
          • dentangle@chaos.socialD Questo utente è esterno a questo forum
            dentangle@chaos.social @gargron@mastodon.social
            ultima modifica di

            @Gargron @jonny @onepict Thanks. Yes, I understand. I do hope it is merely an "oversight" as you put it.

            Given the number of times we've had to slap down attempts to make the fediverse searchable it's astonishing that a fediverse developer wouldn't take more care. Mistake or not, it's a huge breach of trust.

            thisismissem@hachyderm.ioT 1 Risposta Ultima Risposta Rispondi Cita 0
            • thisismissem@hachyderm.ioT Questo utente è esterno a questo forum
              thisismissem@hachyderm.io @dentangle@chaos.social
              ultima modifica di

              @dentangle @Gargron @jonny @onepict so at a protocol level "quiet public" doesn't really exist, all that happens in mastodon is that as:Public gets moved from `to`to `cc`, so they're effectively the same audience being addressed.

              So NodeBB is actually right, at a protocol level, to treat public and "quiet public" as the same.

              Though it sounds like steps will be taken to prevent indexing & display (when unauthenticated) of remote content outside of the context of a thread (you can't exactly mark sections of a page as noindex)

              dentangle@chaos.socialD 1 Risposta Ultima Risposta Rispondi Cita 0
              • dentangle@chaos.socialD Questo utente è esterno a questo forum
                dentangle@chaos.social @thisismissem@hachyderm.io
                ultima modifica di

                @thisismissem @Gargron @jonny @onepict

                The problem, as Gargron identified, appears to be the lack of a "noindex" tag, which in Fediverse terms is like running an SMTP open relay - a misconfiguration rather than a fault in protocol - but which should not be the default in any software, and which will get you instablocked by the entire Internet.

                thisismissem@hachyderm.ioT 1 Risposta Ultima Risposta Rispondi Cita 0
                • thisismissem@hachyderm.ioT Questo utente è esterno a questo forum
                  thisismissem@hachyderm.io @dentangle@chaos.social
                  ultima modifica di

                  @dentangle @Gargron @jonny @onepict right, best practice is to not make remote content directly viewable without authentication (but it may still appear in thread/reply views without authentication)

                  dentangle@chaos.socialD 1 Risposta Ultima Risposta Rispondi Cita 0
                  • dentangle@chaos.socialD Questo utente è esterno a questo forum
                    dentangle@chaos.social @thisismissem@hachyderm.io
                    ultima modifica di

                    @thisismissem @Gargron @jonny @onepict yes, where "best practice" == "if I don't want my instance defederated by the majority of the fediverse"

                    thisismissem@hachyderm.ioT 1 Risposta Ultima Risposta Rispondi Cita 0
                    • thisismissem@hachyderm.ioT Questo utente è esterno a questo forum
                      thisismissem@hachyderm.io @dentangle@chaos.social
                      ultima modifica di

                      @dentangle @Gargron @jonny @onepict the source of that best practice is more around rehosting random content and consequently having liability for that content.

                      dentangle@chaos.socialD 1 Risposta Ultima Risposta Rispondi Cita 0
                      • dentangle@chaos.socialD Questo utente è esterno a questo forum
                        dentangle@chaos.social @thisismissem@hachyderm.io
                        ultima modifica di

                        @thisismissem @Gargron @jonny @onepict

                        That may be the case for some instance admins, but most users are not admins.

                        The bigger issue is that feeding fediverse toots into search engines violates conventions and the expectations of most users. That's what causes fedi-riots every time some bright spark does it.

                        1 Risposta Ultima Risposta Rispondi Cita 0
                        • dentangle@chaos.socialD Questo utente è esterno a questo forum
                          dentangle@chaos.social @dentangle@chaos.social
                          ultima modifica di

                          Hi @julian

                          I know you're very busy sitting on panels at #Fedicon and talking about how to make the fediverse better. Great.

                          Unfortunately you are still running a scraper that is feeding search engines.

                          You've been posting from the con (tip: we use alt text on pictures here on the fediverse), so I know you're online.

                          You're following me, so you'll have seen my question. @Gargron has spoken to you too I believe.

                          A day later, no acknowledgement or apology or fix or promise of a fix. Why?

                          julian@community.nodebb.orgJ 1 Risposta Ultima Risposta Rispondi Cita 0
                          • julian@community.nodebb.orgJ Questo utente è esterno a questo forum
                            julian@community.nodebb.org @dentangle@chaos.social
                            ultima modifica di

                            Hi dentangle@chaos.social, I haven't been at a laptop this entire day since 7am this morning.

                            Around then I added a change to the link tags sent for remote profiles so that they point to the canonical source (your actual profile).

                            I'll likely just put in a redirect to your profile so it won't be accessible.

                            1 Risposta Ultima Risposta Rispondi Cita 0
                            • julian@community.nodebb.orgJ Questo utente è esterno a questo forum
                              julian@community.nodebb.org
                              ultima modifica di

                              dentangle@chaos.social I appreciate your civility so far while I work through what needs to be done about this.

                              1 Risposta Ultima Risposta Rispondi Cita 0
                              • deadsuperhero@social.wedistribute.orgD Questo utente è esterno a questo forum
                                deadsuperhero@social.wedistribute.org @dentangle@chaos.social
                                ultima modifica di

                                @dentangle@chaos.social Quick question, what makes you think this is a scraper? NodeBB is forum software that implements ActivityPub and federates using the protocol.

                                dentangle@chaos.socialD 1 Risposta Ultima Risposta Rispondi Cita 0
                                • dentangle@chaos.socialD Questo utente è esterno a questo forum
                                  dentangle@chaos.social @deadsuperhero@social.wedistribute.org
                                  ultima modifica di

                                  @deadsuperhero It doesn't matter where the data is coming from, the effect is the same. Scraping done over AP is still scraping. The data (retrieved over AP in this case) is being republished without a "noindex" tag so it is being fed into search engines, including posts on your peertube server.

                                  1 Risposta Ultima Risposta Rispondi Cita 0
                                  • dentangle@chaos.socialD Questo utente è esterno a questo forum
                                    dentangle@chaos.social
                                    ultima modifica di

                                    @julian Thank you for your response and taking this seriously.

                                    Please keep everyone informed. Feeding fediverse data to search engines (even accidentally, as this appears to be) is a breach of trust. How you handle this now is likely to be remembered by the fediverse for a long time.

                                    julian@community.nodebb.orgJ 1 Risposta Ultima Risposta Rispondi Cita 0
                                    • julian@community.nodebb.orgJ Questo utente è esterno a questo forum
                                      julian@community.nodebb.org @dentangle@chaos.social
                                      ultima modifica di

                                      dentangle@chaos.social the noindex tag has been added to all remote profiles.

                                      1 Risposta Ultima Risposta Rispondi Cita 0
                                      • Primo post
                                        Ultimo post