INTRODUCTION Structured data vs unstructured data: structureddata is involved ofclearly characterized data types whose pattern makes them effectivelysearchable; while unstructured data “everything else” contains data whichis not easily searchable such as social media postings.Unstructured data versusstructured data does not signify any genuine clash between the two. Clientsselect either not founded on their information structure, but rather on theapplications that utilization them: social databases for organized, and mostsome other sort of use for unstructured data. However, there is agrowing strain between the simplicity of investigation on structured dataversus additionally difficult examination on unstructured data.
Structured dataexamination is a develop procedure and innovation. Unstructured data analyticsis a beginning industry with a great deal of new speculation into R&D,however isn’t a develop innovation. The structured data versus unstructureddata issue inside companies is choosing in the event that they ought to putresources into investigation for unstructured data, and on the off chance thatit is conceivable to total the two into better business knowledge. What is structured data ?Thestructured data depends upon the creation of data model :- which tells the typeof business data which will be recorded and how it will be stored andprocessed. It also includes which field of data is stored and how the data willbe stored which is called data type and it includes Numeric, textual, name,address, etc and also the restrictions on the data input. Structured data has abenefit that it can be easily stored, processed and analysed. Structured data is oftenmanaged using Structured Query Language (SQL) – which is a programming languagecreated for management and query of data Whatis unstructured data? Unstructured data is not arrangedin fixed pre defined way and it’s the data which have no fixed data model 1.
Unstructureddata cant be stored in a table without preprocessing2. Examples: social media sites(tweets, blogs,posts, etc.), call centre data, email, surveys with open questions. Unstructured data has stronginfluence of three V’s:-Volume:- Unstructured data usuallyrequires more storage than structured data.Variety:-Unstructured datapreviously was generated by untapped data sources, which can reveal personalinformation of customers.Velocity:-The unstructured data isincreasing at more pace than the structured data.Figure representing 3V’s is below:- Figure 1 Sourceinfodiagram.
com How prevalent are unstructured data? Most of thebusiness data is unstructured data. It grows much more faster than thestructured data. 1. Moredata storage is required for pictures and videos which is also called as “RichContent” 2. Thedata which is produced by objects that are formerly not connected, likewatches, cars, robots, etc are very important for the growth of data.
Unstructured data sources become transcendent reason for customer insights.3. Thestructured data when combined with unstructured data sources help to obtain amore complete picture of the needs and what customers want.
4. Unstructureddata is more subjective, while the structured data tends to provide answers to”what” questions while unstructured data usually provides the answer to “why”questions. Theuniverse of computing has developed from a little, moderately unsophisticatedworld in the mid 1960’s to an environment of enormous size and modernity.Everything from the day by day life of people to our national financialprofitability has been significantly and emphatically influenced by thedevelopment of the utilization of the computer. Furthermore, this developmentcan be measured in two ways :- structured systems andunstructured systems DIFFERENCE BETWEEN AND STRUCTURED ANDUNSTRUCTURED DATA STRUCTURED DATA UNSTRUCTURED DATA Structured systems are those systems where the activity of processing data and output is predetermined and highly composed. Structured systems are designed, built and operated by the IT department. ATM transactions, manufacturing inventory control systems are all forms of structured systems.
The rules in structured system are little complex. By contrast, unstructured systems are those systems which have very less form or structure. Unstructured systems include email, reports, contracts, and other communications. A person who performs a communications activity in an unstructured system has wide latitude to structure the message in whatever form is desired. The rules of unstructured systems are fewer and less complex. Figure2:- Great benefits can be achieved frombridging the gap between structured and unstructured systems The structured andunstructured data system has grown in parallel but separately. So, both hasseparate environment and different from each other in ways such as:-1.
Structural2. Organisational3. Functionaland technical There could behuge number of possibilities if both of the systems are connected in aneffective way.
The new type of systems can be built with the enhancement toexisting systems. There could be more amazing benefits which could be achievedif all the technical, structural, functional and organisational barriers can beremoved. A NEW PERSPECTIVE OF DATABusiness intelligencefaces certain limitations because it is totallybased on the numbers. The most distinctive and necessary way to reduce the gapbetween structured and unstructured data is to merge the text and numeric data,which can lead to better and higher information and insight which was not attainablepreviously.
There are numerous wayswith which the merger of numeric and textual data can be used to make moreinnovative results. An example is to create an unstructured contact file, whichhas access to every communication which the customer had previously with theorganisation including letters and emails. So, this file will have all usefulsources such as communication, date of contact, with whom person contacted,nature of the contact and many more. USESFOR THE UNSTRUCTUED CONTACT FILE The most powerful use ofcontact file of customer in terms of increasing a CRM system to create abroader view of a customer, enables us to attain these important objectives :-One of the most powerfuluses of the customer contact file is in terms of supplementing a CRM system tocreate the broad view of the customer, enabling to accomplish these important objectives: 1. CrossSelling:- If one understands a lot about the customer in one arena, the chancesto sell to the same customer in another arena will materialize.2. Prospecting:-Better one knows or understands a customer, the better one can qualify salesprospect list.3.
Anticipation:-By understanding more about the customer, we can meet the future needs. One of the essentialfundamentals of CRM is that it is substantially simpler to offer into a establishedclient than get another client. This long haul relationship is set up in viewof coordinated learning about the client, including: · Age · Occupation · Net worth · Marital status · Education · Children · Income · Address The idea behind makingthe 360 degree perspective of the client is to unite information from a widerange of places in request to coordinate the information and accomplish agenuinely strong and far reaching perspective of the client. Figure3 However, there arechallenges to integrating all this data, such as:1.
Datafinding in first place.2. Datamaintainence using different technologies3. Mergingthe gathered data4. Maintainingcustomer’s profile up to date5. Managementof volume of collected data Unstructured contact file CUSTOMER ID · name · age · gender · address · phone · occupation · Income Independent from anyoneelse the information accumulated as a major aspect of this procedure isprofitable. In any case, to make a genuine 360 degree view of the client, oneshould upgrade this information with the rich vein of unstructured clientcorrespondences data.
At exactly that point will you have the completeviewpoint. Rather than simply knowing odd actualities about the client, theorganization can recognize what the client has been stating what communicationhave happened. So as to accomplish the 360 degree perspective of the client,bunches of different kinds of data are coordinated together. Figure4 BUILDINGTHE UNSTRUCTURED CONTACT FILE There are variousmethods to accomplish build of an unstructured file. Using an example of email,the easiest and common way is to index the un-structured the contact file andleave email from where they are located originally.
With the use of thistechnique , an index is created for every communication, which contains few itemssuch as :- • Communication date• With whom thecommunication is directed• Customer’s name andidentification• Email’s location Whenever anycorporation wants to figure out if there is any communication, the index isused. If it seems that the communication is relevant, the corporation can seethe storage location of the email and also can read the email. Alternately, theactual email sent with the index and there is no requirement of further search.This approach requires more system resources , it does reduces the requiredwork finding a specific email. USES OF UNSTRUCTURED CONTENT IN OTHERAPPLICATIONS Themost important use of unstructured data is in litigation support.
For instance:- if a company is sued by someone. The first thing which that company shouldknow is that what contact it had with that person. With whom he/she was workingwith and with whom her/she contacted. In this case, the ability of viewingunstructured data is invaluable. There isanother use of mixing structured and unstructured data to increase the businessintelligence and reports.
While it is through reports and businessinsight that applications pass on their discoveries to the end client, there isan incredible impediment to reports and business insight since they essentiallydepend on structured frameworks for their data. Structured applications aregreat at:1. Summariescreation 2.
Summaryof data break down into different categories.3. Drilldown creation4. Drillacross creation Figure5How Semi-Structured Data Fits with Structuredand Unstructured DataSemi-structured datakeeps internal markings that acknowledge separate data elements, that empowersinformation grouping and chain of commands. The two reports and databases willbe semi-structured. This information just represents around 5-10% of the semi-structured/structured/unstructureddata pie, but also has basic business use cases.Email is an very basiccase of a semi-structured data type. Although further developed examination toolsare important for string chase, close dedupe, and idea seeking; email’s localmetadata empowers grouping and catchphrase looking with no extra tools.
Semi-structured Dataexamples :-· Markup language XMLIt is a semi structuredlanguage. XML is an arrangement of report encoding rules that characterizes ahuman-and machine-decipherable format. Its value is that its tag-drivenstructure is profoundly flexible, and coders can adjust it to universalizeinformation structure, storage, and transport on the Web. Open standard JSON JSON is another semi-structured data tradearrange.
Java is understood in the name yet other C-like programming languagesrecognize it. Its structure comprises of name/value matches (ex question), anda requested value list (ex cluster). Since the structure is exchangeable among languages,JSON exceeds expectations at transmitting information between web applicationsand servers. NoSQL Semi-structured information is a vital part of various NoSQL databases. NoSQL databases distinction from relative databases since they do not separate theorganization from the info.
This settles on NoSQL a superior call tostore information that doesn’t effectively match into the record and tableformat, as an example,content with dynamical lengths.It likewise takes into thought less hard data trade between databases. Some a lot of up to this point NoSQL information bases like Couchbase &MongoDB to boot fusesemi-structured data by regionally put away them within the JSON format. Structured vs Unstructured Data: Next GenerationTools are Game ChangersThere are new tools whichare accessible to interrupt unstructured data. Most of these tools rely onmachine learning. Structured data examination may also use machine learning,the huge volume and a huge range of various kind of unstructured data needs it.Unstructured information examination with machine-learning insight enablesassociations to :- •Analyze digital correspondence for consistence:-Failed consistence cancost organizations a lot of dollars in lostbusiness and cost. Pattern recognition and email threading investigationprogramming seeks enormous measures of email and visit information forpotential noncompliance.
A current case incorporates Volkswagen’s burdens, whomay have maintained a strategic distance from a tremendous fines andreputational hits by utilizing examination to screen correspondences forsuspicious messages. •Track high-volume client conversations in social media:- Content analytics and opinion investigationgives experts a chance to audit positive and negative results of advertisingefforts, or even distinguish online dangers. This level of analytics issignificantly more modern straightforward keyword search, which can just reportbasics like how frequently notices said the organization name during newcampaign.
New investigation likewise incorporate setting: was the say positiveor negative? Were notices responding to each other? What was the tone ofresponses to official declarations? The automotive business for instance isintensely engaged with examining online networking, since auto purchasersfrequently swing to different notices to measure their auto buying experience.Experts utilize a mix of text mining and assessment analysis to trackauto-related client posts on social media sites (Twitter). • Gain new advertising intelligence:- Machine-learning examination instrumentsrapidly work enormous measures of archives to investigate client behaviour. Anoteworthy magazine distributer connected content mining to countless articles,examining each different production by the prevalence of major subtopics.
Atthat point they broadened analytics over all their substance properties to seewhich general themes got the most consideration by client statistic. The analyticskept running crosswise over a huge number of bits of substance over allproductions, and cross-referenced interesting issue comes about by segments.The outcome was a rich instruction in which topics were most fascinating toparticular clients, and which marketing messages reverberated most firmly withthem.