341 items found

Filter Results
  • Dataset

    Social Network dataset - LiveJournal

    LiveJournal is a free on-line blogging community where users declare friendship each other. LiveJournal also allows users form a group which other members can then join. We...
    • HTML
      The resource: 'LiveJournal social network ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Call Data Record District of Pisa 2013 October

    The dataset contains mobile phone records collected in the provinces of Pisa, Lucca, Livorno and Firenze in October 2013. It contains about 60 mln of Call Data Records (CDR),...
  • Dataset

    ClueWeb09

    The ClueWeb09 dataset consists of about 1 billion web pages in ten languages that were collected in January and February 2009. It was created to support research on...
  • Dataset

    Official administrative information of Tuscany

    The data contains the spatial partitioning of Tuscany and some statistical information published by the Italian Statistical Bureau.
    • LOD
      The resource: 'Linked Open Data' is not accessible as guest user. You must login to access it!
  • Method

    A hybrid approach for PPI

    We propose a new framework that can exploit topological and biological information to predict protein-protein interactions. The algorithm relies on the underlying hypothesis...
  • Dataset

    German Credit

    In the german credit dataset each one of the 1,000 persons is classified as a good or bad creditor according to attributes like age, sex, checking_account, credit_amount,...
    • CSV
      The resource: 'German Credit' is not accessible as guest user. You must login to access it!
  • Dataset

    Twitter Dumps

    The dataset consists of the 10% of the daily stream of tweets produced on Twitter filtered into 3 subsets: English, Italian, geo-referenced. The tweets are a random sample of...
  • Dataset

    Open data from NervousNet

    This dataset contains anonymized proximity information sent by 154 mobile phones (both Android and iPhone) via phone apps. These information are sent by bluetooth beacons every...
    • ZIP
      The resource: 'open data from NervousNet' is not accessible as guest user. You must login to access it!
  • Dataset

    Car sharing dataset

    The dataset comprises pickup and drop-off times and locations of vehicles in 10 European cities for one of the major free-floating car sharing operator. For nine of these...
  • Dataset

    Twitter social bots

    Spambots are automated accounts (i.e., accounts driven by a bot) that repeatedly advertise unsolicited and often harmful content (e.g., malware, URLs to phishing Web sites,...
  • Dataset

    Broad Twitter Corpus

    The Broad Twitter Corpus is a named entity-annotated dataset of tweets, collected in order to capture temporal, spatial and social diversity. The goal of the corpus is to...
    • JSON
      The resource: 'Broad Twitter Corpus' is not accessible as guest user. You must login to access it!
  • Method

    Gene-specific regularization for COPD partial-correlation estimation

    We introduce a gene-specific regularization factor when computing the Partial Correlation score to make the indeterminate regression feasible. We decided to slightly modify...
  • Dataset

    Estonian public sector electronic services and service providers and consumers

    The dataset contains records of electronic services (aka X-Road services), service providers and consumers harvested in April 2014 from RIHA (https://riha.eesti.ee). The data...
  • Dataset

    Twitter fake followers

    Fake followers are fake accounts massively created to follow a target account and that can be bought from online markets. In other words, their goal is that of increasing the...
  • Dataset

    Disease Twitter Dataset

    This Twitter dataset covers two recent outbreaks: Ebola and Zika. About 60 million tweets were collected through a query-based access to the Twitter Streaming API, covering...
  • Dataset

    e-MID interbank transactions

    This dataset is an edgelist containing daily interbank transactions as registered in the electronic Market for Interbank Deposits (e-MID), in the period 2010--2014. e-MID is...
  • Method

    EpiCID: A framework for discovering interactions between SNPs

    Epistatic interactions (EIs) of gene loci often determine complex trait phenotypes. EIs may indicate the underlying molecular mechanisms of multifactorial traits and diseases....
  • Dataset

    GeoLife - GPS trajectories dataset

    This (link to a) GPS trajectory dataset was collected in (Microsoft Research Asia) Geolife project by 182 users in a period of over three years (from April 2007 to August 2012)....
    • ZIP
      The resource: 'GeoLife Download page' is not accessible as guest user. You must login to access it!
  • Dataset

    Russell 3000 stock prices

    This dataset contains the price and volume of the 3000 stocks belonging to the Russell 3000 Index, roughly corresponding to the 3000 more capitalized stocks. Traded volume and...
  • Dataset

    .ee Web archive

    .ee Web archive consisting of snapshots from 2015