As the title suggests, this article covers how raw text can be converted into structured data. Before going any further, a word on the terminology used here is needed. In simple terms, raw text can be thought of as a set of tokens grouped into sentences, carrying information expressed in a natural language such as English. Put simply, raw text...
Author: Yuli Vasiliev
Training Models in spaCy
spaCy v2.0 features statistical neural network models for named entity recognition, dependency parsing, and similarity prediction. One of the most exciting opportunities is that you cannot only use pre-trained, ready-for-use models, but you can also train a model with your own training examples, thus tuning it for your particular use case. spaCy’s Model Architecture Before starting to train a model,...
Intelligent Text Processing with spaCy
The task of extracting meaning from text data can be quite challenging. The problem is that you cannot rely on the meaning of individual words in a sentence or phrase, since the order of words may invert the whole point. Moreover, the same words may have different meaning, depending on the context. To address this problem, NLP has linguistic features...
The Magic of Word Vectors
In a nutshell, word vectors represent semantic meanings of natural language words, encoded in numbers. This looks a bit different from traditional representations of words, but this is what makes human language understandable to machines. Visually, a vector space of words can be imagined as a cloud in which words with similar meanings are located nearby. It is important to...
Using Machine Learning Algorithms to Analyze Chatbot Data
As a chatbot developer, you must tap into a set of emerging technologies. Machine learning is definitely one of them. Broadly speaking, machine learning can be used in a pretty wide variety of fields to recognize the patterns in your data to provide visual insights and statistical analysis, helping you interpret the data in your data set. Turning back to...
Machine Learning with R from a SQL Environment
Using R with Oracle Database Setting Up Your Working Environment Before you can start using Oracle R Also, to follow along with the article examples, you will need a SQL tool to interact with your Oracle database, such as Oracle SQL Developer or Oracle SQL Plus. Deploying R Scripts to Oracle Database Using the following code, you store an R...
Setting up Oracle R Technologies in Oracle Database 12.2
By Yuli Vasiliev Before you can start using Oracle R technologies, you need to install the software components enabling them. In particular, you will need to install: Oracle R Distribution and Oracle R Enterprise. The latter in turn consists of two components: a server and a client. The server is installed on the machine running the Oracle Database server that...
Developing Machine Learning Apps with Oracle R Technologies
By Yuli Vasiliev Oracle R Technologies provides a comprehensive platform for data analysts and data scientists, making it easy to perform statistical data analysis and generate advanced graphics. If you are using an Oracle database to store your data and are interested in advanced analytics, then employing Oracle R Technologies should be your natural choice. The key component of Oracle...
Building and Scoring R Models in Oracle Database
By Yuli Vasiliev Oracle R Enterprise comes with the packages that include functions for building and scoring regression models, neural network models, and Oracle Data Mining models. Regardless of the package they belong, all those functions have a similar syntax, requiring you to write just a couple lines of R code to build a model and then score it on...
Incorporating R Analytics Into Oracle Database
By Yuli Vasiliev One of the most important features Oracle R Enterprise offers is the ability to store R scripts in the database and then invoke them through either an R interface or a SQL interface. The latter becomes especially important when it comes to the production stage, since SQL remains the primary interface in production database applications. This article...