• 55 Posts
  • 712 Comments
Joined 1 year ago
cake
Cake day: December 18th, 2023

help-circle









  • Only few search engines index the Web at scale. Third parties who want to develop downstream applications based on web search fully depend on the terms and conditions of the few vendors. The public availability of the large-scale Common Crawl does not alleviate the situation, as it is often cheaper to crawl and index only a smaller collection focused on a downstream application scenario than to build and maintain an index for a general collection the size of the Common Crawl. Our goal is to improve this situation by developing the Open Web Index.

    The Open Web Index is a publicly funded basic infrastructure from which downstream applications will be able to select and compile custom indexes in a simple and transparent way. Our goal is to establish the Open Web Index along with associated data products as a new open web information intermediary.

    https://downloads.webis.de/publications/papers/hendriksen_2024.pdf

    This paper seems to give a good, quick overview.

    It looks to be the usual EU tech project. Doing more to achieve less in a desperate, hopeless attempt to make up for the stupidity and greed of European elites.





  • A user is typically a natural person. A username identifies that person. Any information that is directly or indirectly linked to that username is thus personal data of that person. The GDPR explicitly gives “online identifier” as an example of an identifier. I did link to the official repository, which hosts translation in all European languages. Each translation can be reached with 1 click. It cannot be a language issue. I do not understand what the problem could be.

    The personal data in the OP (consent options) are linked to a person via a cookie stored in their browser. I do not understand how one could make sense of the case without understanding what personal data is.

    There also appears to be some confusion between GDPR and copyright. I do not know where these strange ideas come from.



  • Da ist nichts, was man einem Erwachsenen, der einen IT-Job hat, erklären müsste. Die Behauptung, dass personenbezogene Daten nach DSGVO und PII im US-Recht dasselbe sein, ist so fundamental unsinnig, dass ich sie nur als Witz verstehen kann. Klar, normalerweise würde ich das erklären, aber wenn einer so rumtextet von wegen Profi, dann muss das ein Witz sein.

    In case there’s really anyone lurking here. Maybe you could explain to them what you think happens when one agrees to be tracked for ads. That ought to be funny. Do they send a drone swarm with 4K-cameras to your location? What’s a TC-string? Something that goes up your butt?