Description :
We propose the development of a data analytics and visualization platform which facilitates the capture and analysis of unstructured text-based data. The text captured and analyzed is in the format of a response to a question, which the platform is able to semi-autonomously structure. The quantity of data clustered can be further optimized through enhancement of synonym dictionaries, suffix stemming and other tweaks and algorithms. We would then like to visualize this data in a format which allows businesses to quickly understand key insights and trends identifiable within the data, which they can act upon to improve their product and service offerings.
We have completed preliminary research around data clustering platforms and believe that by developing add-on modules on top of preexisting platforms we can create a new, innovative, and we believe - disruptive new business.
The analytics platform will be built around existing open source as well as commercial text data clustering software. We propose today the delivery of an initial proof of concept for this project:
1. Cloud-based infrastructure set-up of Logic3G clustering software
2. Development of front-end web-page infrastructure that captures responses to questions
3. Development of transcoding module that converts this unstructured data into machine readible format(s) for Logic3G and
other clustering software
4. Set-up and/or development of a simple visualization front-end which allows a viewer the ability to see a graphical
representation of the clustered data
The service this platform provides can be marketed as a service to enterprises whom have large electronic email opt-in databases and/or large Twitter/Facebook fan pages and whom are interested in conducting market research utilizing broad, public openended question and responses.
This same platform - has the potential to also be massively disruptive to how people consume and react to media. We'll talk more about this later in the proposal.