뉴스레터

이메일로 Hortonworks의 새 업데이트를 받으세요.

한 달에 한 번 빅 데이터와 관련한 최신 인사이트, 동향, 분석 정보, 지식을 받아 보세요.

AVAILABLE NEWSLETTERS:

Sign up for the Developers Newsletter

한 달에 한 번 빅 데이터와 관련한 최신 인사이트, 동향, 분석 정보, 지식을 받아 보세요.

CTA

시작하기

클라우드

시작할 준비가 되셨습니까?

Sandbox 다운로드

어떤 도움이 필요하십니까?

* 저는 언제든지 구독을 해지할 수 있다는 점을 이해합니다. 또한 저는 Hortonworks이 개인정보 보호정책에 추가된 정보를 확인하였습니다.
닫기닫기 버튼
이전 슬라이드
Empower Your Enterprise Cybersecurity Strategy With Open Collaboration and Community
December 19, 2018
What Is Data Fabric, and What Value Does It Offer Organizations?
다음 슬라이드

How a Big Data Fabric Can Transform Your Data Architecture

작성자:
Jonathan Hassell

Over the last couple of years, big data fabric technology has emerged as a strategic way for companies to get the most value from their data investments. As data lakes proliferate, they also become more difficult to manage. A big data fabric might be the answer for many enterprises struggling to manage their vast stores of big data.

So what, exactly, is a data fabric? And how does it relate to the traditional data lake? Here’s a closer look at the function of a data fabric and what its advantages are, so you can decide if the technology could help your business.

Moving to Multiple Data Lakes

As big data evolves, organizations are tending toward having multiple data lakes instead of a single one. These additional data lakes are built for a number of reasons: they may serve backup or disaster-recovery purposes for an existing production data lake, or perhaps they may replicate the contents of one data lake to another geographic location.

Regardless of why data lake proliferation happens, it presents a challenge to any organization: How do you ensure consistent security governance and data management across all those data lakes?

You likely spent a lot of time building policies for governing and securing your first data lake. When you built another lake, you most likely wanted to ensure you had a way to consistently apply those original policies to that one, too. But the more your big data environment grows, the more difficult it becomes to govern, secure, and manage all those data lakes. It may even become necessary to build out brand-new policies that take the new size of your environment into account.

Another complicating factor is the emergence of the cloud. Many use cases relating to data science, artificial intelligence, machine learning, deep learning, and the like are well-suited to operating in the cloud, and many companies are moving their data off on-premise data centers to save on operating costs and improve availability. While the benefits of the cloud cannot be disputed, it further expands the big data environment and may introduce new management complications.

All of these scenarios require that you have a management layer and abstraction layer that fit across all of your data sources or lakes, whether they are in the cloud or on premises. The abstraction layer’s role is to ensure consistent security and data management across data lakes. A big data fabric can serve as that abstraction layer.

Uniting Them All With a Big Data Fabric

A data fabric weaves together and surrounds all of your data sources. It’s aware of all that exists now and automatically registers new data sources as they are added.

There are several characteristics that a good, enterprise-class data fabric should have:

  • The fabric should be aware of all of your data sources and know where all of your data clusters are. It should provide a system administrator with an easy visualization of where all of the data you have resides, what kinds of services are running in those places, what the statuses of those services are, and where all of your data clusters live. This is a basic requirement.
  • The data fabric should lend itself to building a number of applications on top of it. For example, if you want to move data from one cluster to another, there should be an application in the fabric that lets you do that. If you need to know where sensitive data is located in various clusters, there should be an application available that tells you that (for example, that cluster A, column B, contains social security numbers, telephone numbers, or similar personal information). If you know that, you’ll have the ability to apply a tag labeling that data as personally identifiable or sensitive, and therefore not available for business analysis.
  • The data fabric should also help you derive a security policy. If you have identified sensitive data in your various clusters, you should be able to restrict access to that data. For instance, you should be able to designate that only the human resources organization has access, or perhaps that the data can only be accessed from a certain geography if, for example, it pertains to European citizens. Or you might want to be able to designate that a security event be logged if data is accessed outside of normal business hours.

Keeping Your Big Data Environment Organized

Ultimately, a data fabric helps you evolve your organization into a multiple data lake environment in an organized and secure way. Data fabrics help your organization achieve consistency, security, and high availability while providing a seamless management layer that is aware of all your data all the time.

For more on data fabrics, read The Forrester Wave: Big Data Fabric report.

답변을 남기십시오

귀하의 이메일 주소는 공개되지 않을 것입니다. 필수 내용은 *로 표시되어 있습니다.