Pinned repositories

  1. engine

    engine is a library for running scalable data retrieval pipelines that process any number of Git repositories for source code analysis.

    Scala 55 32

  2. gitbase

    SQL interface to Git repositories, written in Go.

    Go 1.1k 38

  3. ml

    sourced.ml is a library and command line tools to build and apply machine learning models on top of Universal Abstract Syntax Trees

    Python 59 21

  4. enry

    A faster file programming language detector

    Go 191 24

  5. go-git

    A highly extensible Git implementation in pure Go.

    Go 2.7k 254

  6. guide

    Aiming to be a fully transparent company. All information about source{d} and what it's like to work here.

    JavaScript 83 35

  • sourced.ml is a library and command line tools to build and apply machine learning models on top of Universal Abstract Syntax Trees

    Python 59 21 Updated Jun 24, 2018
  • An extensible MySQL server implementation in Go.

    Go 53 11 Apache-2.0 Updated Jun 22, 2018
  • source{d} blog

    HTML 18 25 MIT Updated Jun 22, 2018
  • Python 6 6 GPL-3.0 Updated Jun 22, 2018
  • gitbase playground

    Go 6 5 GPL-3.0 Updated Jun 22, 2018
  • SQL interface to Git repositories, written in Go.

    Go 1,070 38 Apache-2.0 Updated Jun 22, 2018
  • engine is a library for running scalable data retrieval pipelines that process any number of Git repositories for source code analysis.

    Scala 55 32 Apache-2.0 Updated Jun 22, 2018
  • Advanced similarity and duplicate source code detection at scale

    Python 20 12 GPL-3.0 Updated Jun 22, 2018
  • Calculates Word Mover's Distance Insanely Fast

    Python 172 31 3 issues need help Updated Jun 22, 2018
  • Find similar code in Git repositories

    Scala 11 5 GPL-3.0 Updated Jun 21, 2018
  • Aiming to be a fully transparent company. All information about source{d} and what it's like to work here.

    JavaScript 83 35 Updated Jun 21, 2018
  • Queue is a generic interface to abstract the details of implementation of queue systems.

    Go 18 8 Apache-2.0 Updated Jun 21, 2018
  • A highly extensible Git implementation in pure Go.

    Go 2,714 254 Apache-2.0 22 issues need help Updated Jun 21, 2018
  • Just a prototype, nothing very relevant.

    Go 2 Updated Jun 21, 2018
  • A limited go-billy filesystem implementation based on siva.

    Go 6 5 Apache-2.0 Updated Jun 21, 2018
  • Rovers is a service to retrieve repository URLs from multiple repository hosting providers.

    HTML 8 10 GPL-3.0 2 issues need help Updated Jun 21, 2018
  • Cross-Company's Projects Brand Assets

    2 1 Updated Jun 21, 2018
  • The missing interface filesystem abstraction for Go

    Go 64 15 Apache-2.0 Updated Jun 20, 2018
  • Applications for Kubernetes

    Smarty 3 9 Apache-2.0 Updated Jun 20, 2018
  • source{d} datasets ("big code") for source code analysis and machine learning on source code

    Jupyter Notebook 27 10 Updated Jun 19, 2018
  • Kallax is a PostgreSQL typesafe ORM for the Go language.

    Go 621 34 MIT 1 issue needs help Updated Jun 18, 2018
  • borges collects and stores Git repositories.

    Go 29 16 GPL-3.0 Updated Jun 18, 2018
  • Cool links & research papers related to Machine Learning applied to source code (MLonCode)

    2,931 395 CC-BY-SA-4.0 Updated Jun 15, 2018
  • Tracking events, CfPs, abstracts, slides, and all other even related things

    9 4 Apache-2.0 Updated Jun 14, 2018
  • landing for source{d}

    CSS 9 MIT Updated Jun 13, 2018
  • Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall

    Python 29 MIT Updated Jun 13, 2018
  • Machine learning models for MLonCode trained using the source{d} stack

    Python 7 3 1 issue needs help Updated Jun 11, 2018
  • Large scale K-means and K-nn implementation on NVIDIA GPU / CUDA

    Jupyter Notebook 304 58 Updated Jun 11, 2018
  • Shell 3 9 Updated Jun 11, 2018
  • Log is a generic logging library based on logrus

    Go 9 6 Apache-2.0 Updated Jun 11, 2018