"We present a novel streaming algorithm for evaluating XPath expressions that use backward axes
(parent and ancestor) and forward axes in a single document-order traversal of an XML document.
Other streaming XPath processors, such as YFilter, XTrie, and TurboXPath handle only forward axes.
We show through experiments that our algorithm significantly outperforms (by more than a factor of
two) a traditional non-streaming XPath engine. Furthermore, since our algorithm only retains relevant
portions of the input document in memory, it scales better than traditional XPath engines. It can process
large documents; we have successfully tested documents over 1GB in size. On the other hand, the
traditional XPath engine degrades considerably in performance for documents over 100 MB in size and
fails to complete for documents of size over 200 MB."
"Our project (titled xstream)
concentrated on evaluation of XPath over XML streams.
This research area contains multiple challenges resulting
from both the richness of the language and the
requirement of having only a single pass over the data.
We modified and extended one of the known algorithms,
TurboXPath [4], a tree-based IBM algorithm. We also
provide extensive comparative analysis between
TurboXPath and XSQ [5], currently the most advanced of
finite automata (FA)-based algorithms."
"Figaro is a probabilistic programming language that supports development of very rich probabilistic models and provides reasoning algorithms that can be applied to models to draw useful conclusions from evidence. Both model representation and reasoning algorithm development can be challenging tasks.
Figaro makes it possible to express probabilistic models using the power of programming languages, giving the modeler the expressive tools to create a wide variety of models. Figaro comes with a number of built-in reasoning algorithms that can be applied automatically to new models. In addition, Figaro models are data structures in the Scala programming language, which is interoperable with Java, and can be constructed, manipulated, and used directly within any Scala or Java program."
"LZ4 is a very fast lossless compression algorithm, providing compression speed at 300 MB/s per core, scalable with multi-cores CPU. It also features an extremely fast decoder, with speeds up and beyond 1GB/s per core, typically reaching RAM speed limits on multi-core systems."
"LibLZF is a very small data compression library. It consists of only two .c and two .h files and is very easy to incorporate into your own programs. The compression algorithm is very, very fast, yet still written in portable C."
Assessing the effect of different programming languages and programming styles on programmer productivity is of critical interest. In his paper, Gilles Dubochet, describes how he investigated two aspects of programming style using eye movement tracking. He found that it is, on average, 30% faster to comprehend algorithms that use for-comprehensions and maps, as in Scala, rather than those with the iterative while-loops of Java.
"Apache Hama is a pure BSP (Bulk Synchronous Parallel) computing framework on top of HDFS (Hadoop Distributed File System) for massive scientific computations such as matrix, graph and network algorithms.
Today, many practical data processing applications require a more flexible programming abstraction model that is compatible to run on highly scalable and massive data systems (e.g., HDFS, HBase, etc). A message passing paradigm beyond Map-Reduce framework would increase its flexibility in its communication capability. Bulk Synchronous Parallel (BSP) model fills the bill appropriately. Some of its significant advantages over MapReduce and MPI are:
* Supports message passing paradigm style of application development
* Provides a flexible, simple, and easy-to-use small APIs
* Enables to perform better than MPI for communication-intensive applications
* Guarantees impossibility of deadlocks or collisions in the communication mechanisms"
"Joone is a FREE Neural Network framework to create, train and test artificial neural networks. The aim is to create a powerful environment both for enthusiastic and professional users, based on the newest Java technologies.
Joone is composed by a central engine that is the fulcrum of all applications that are developed with Joone. Joone's neural networks can be built on a local machine, be trained on a distributed environment and run on whatever device.
Everyone can write new modules to implement new algorithms or new architectures starting from the simple components distributed with the core engine. The main idea is to create the basis to promote a zillion of AI applications that revolve around the core framework."
A new "Least Time" load-balancing algorithm
Full-featured TCP load balancing
High availability and failover between NGINX Plus instances
A new statistics dashboard and improved monitoring
Support for SSL authentication of email traffic (IMAP, POP3, and SMTP)
"pHash is an open source software library released under the GPLv3 license that implements several perceptual hashing algorithms, and provides a C-like API to use those functions in your own programs. pHash itself is written in C++."
...investigadores de la Universidad Carlos III de Madrid (UC3M) han desarrollado el algoritmo SoSACO, que acelera la búsqueda de caminos entre dos nodos pertenecientes al grafo que representa a una red social...
Habría que analizar si es aplicable a los datos de navrel este algoritmo.