http://social.technet.microsoft.com/wiki/contents/articles/15989.sharepoint-2013-search-architecture-in-spc202.aspx

 

Presentation broken down into 4 parts:

  • Feeding Chain(crawler and content processing)
  • Index core(index components)
  • Query Chain
  • Analytics Component

A few steps back: In SharePoint 2010 we had a built-in search service, but also shipped FAST search service in addition to that.  With SharePoint 2013, all different components are merged into a single cohesive search service.
In addition to SharePoint, Search component is also shared inside Exchange.  Search with FAST technologies now spans everything from Outlook Web Access through eDiscovery through intranet search scenarios.  Immense investments into Internet search as well, with such features  as Product Catalogue and Cross-Site Publishing.  We are now using search without even thinking about it in many areas that were not traditionally search-driven in the past.  It's much more than the original "Search Box".

Feeding Chain

  • Crawl Component:  Crawler in SP2013 is much more specialized.  Extensible through BCS, Local Disk Cache, OOB Connectors, Configurations stored in Admin database.  Big Change:  New crawl mode called Continuous crawl mode.  Runs continuously due to parallel processing. Crawl components scale individually now, no longer interlocked scaling regarding crawl components->crawl db's.
  • Content Processing Component:  Stateless node, analyzes content for indexing, uses "Processing flows", schema mappings, stores links and anchors in Links db(analytics), extensible via web service call-outs, configurations stored in Admin database. Each CrawlerFlow processes one document at a time.  Note:  Security crawls should be done faster now, due to using Update Groups.


Index Core

  • Index component: Index is a state-full component.
  • New concepts: Partitions and Replicas - no more Columns and Rows.  All nodes perform indexing.  Journal Shipping from primary to replicas. Each partition can have many replicas. Different from SharePoint 2010(but typical for FAST), index is stored on local disk and not in the database, i.e how SharePoint 2010 stores it in Property db.
  • Improvements to index freshness:  as content comes in, it goes straight to memory and is immediately searchable.  Eventually flushed to disk.  In previous FAST products, documents would have to first be stored on disk prior to being searchable.
  • Understanding Index schemas: Crawled property -> Managed property, Schema administration can now be done on a Site Collection level. Adding "Searchable" and Retrievable" to managed property properties.


Query Chain

  • Web Front-End:  REST/ODATA API, CSOM and SSOM, Portals and Publishing, Search Center, ContentWebPart, RefinerWebPart, Result Templates. Programming model that allows you to serve both in the cloud and on-premise. Can take advantage of these API's and develop applications running on Surface/phones/tablets.
  • Query Processing Component: Stateless node, processing query flows, query analyzer, linguistics/dictionaries, result sources, schema mappings, query rules, query federation, configuration stored in Admin database.

Notes

  • Query federation:  Possible for on-premise/cloud federation.
  • Query router uses various Search Provider Flows(i.e Best Bets, People Search, Exchange Search, Local SharePoint Search,, Remote SharePoint Search,  Personal Favorites, etc).  These are exposed to end-users and administrators via Result Sources/Query Rules and such.
  • Query Rules:  Captures Search Intent.  Composed of 3 top level elements: 
  • Query Conditions:  What queries should be handled
  • Query Actions:  What happens when a rule matches
  • Publishing Options:  Is the rule active and for how long.

Analytics Service

  • Analytics Processing Component:  Map/Reduce, learns by usage, Search Analytics, Usage Analytics, enriches index by updating index items, Usage reports in Analytics database.

Making Recommendations is made easy.  Recommendations by user behavior. 
Built-in recommendations:
  Event stream analysis
  Item-to-item recommendations
  Stored in "recommendedfor" managed property.  Up to 12 configurable, weighted events.
Note:  Example:  If i'm building an application using Java, can I take my events and feed them into the Analytics Engine.  Yes, through the REST API or CSOM.

最新文章

  1. 开发基于Edge渲染内核的浏览器应用
  2. vim如何配置go语言环境
  3. Windows Azure Service Bus (4) Service Bus Queue和Storage Queue的区别
  4. 【原创】使用.NET Core 1.0创建一个Self-Contained控制台应用
  5. 数据库连接池系列之——c3p0
  6. 重学STM32---(六)DAC+DMA+TIM
  7. 如何用 Swift 语言构建一个自定控件
  8. 安装eclipse要和JDK的位数相对应
  9. Python 处理文件
  10. [TPYBoard-Micropython之会python就能做硬件 4] 学习使用电位器和1602显示屏
  11. MySQL AutoCommit带来的问题
  12. unity游戏设计之背包系统
  13. C#开发短信发送
  14. Python面试真题第二节
  15. MySQL联接查询算法(NLJ、BNL、BKA、HashJoin)
  16. Github如何创建|删除仓库
  17. MAC /usr/local 文件夹权限问题
  18. JVM—JVM内存模型
  19. Feign status 400 reading 问题分析
  20. PHP代码审计笔记--SQL注入

热门文章

  1. java nio 网络框架实现
  2. [转]Android开发最佳实践
  3. css3背景颜色渐变
  4. base.js
  5. JS基础回顾,小练习(克隆对象,数组)
  6. 随着ScrollView的滑动,渐渐的执行动画View
  7. OP和DBA相关的一些有用资源
  8. Webstrom (或Phpstrom)使用git(oschina-码云)
  9. Spring基础—— 在 Spring Config 中使用外部属性文件
  10. HTML5使用ApplicationCache