SSIS 增量更新

本文转自 http://sqlblog.com/blogs/andy_leonard/archive/2007/07/09/ssis-design-pattern-incremental-loads.aspx

Andy Leonard

Andy Leonard is CSO of Linchpin People and SQLPeople, an SSIS Trainer, Consultant, and developer; SQL Server database and data warehouse developer, community mentor, engineer, and farmer. He is a co-author of SQL Server 2012 Integration Services Design Patterns. His background includes web application architecture and development, VB, and ASP. Andy loves the SQL Server Community!

SSIS Design Pattern - Incremental Loads

Introduction

Loading data from a data source to SQL Server is a common task. It's used in Data Warehousing, but increasingly data is being staged in SQL Server for non-Business-Intelligence purposes.

Maintaining data integrity is key when loading data into any database. A common way of accomplishing this is to truncate the destination and reload from the source. While this method ensures data integrity, it also loads a lot of data that was just deleted.

Incremental loads are a faster and use less server resources. Only new or updated data is touched in an incremental load.

When To Use Incremental Loads

Use incremental loads whenever you need to load data from a data source to SQL Server.

Incremental loads are the same regardless of which database platform or ETL tool you use. You need to detect new and updated rows - and separate these from the unchanged rows.

Incremental Loads in Transact-SQL

I will start by demonstrating this with T-SQL:

0. (Optional, but recommended) Create two databases: a source and destination database for this demonstration:

CREATE DATABASE [SSISIncrementalLoad_Source]

CREATE DATABASE [SSISIncrementalLoad_Dest]

1. Create a source named tblSource with the columns ColID, ColA, ColB, and ColC; make ColID is a primary unique key:

USE SSISIncrementalLoad_Source

CREATE TABLE dbo . tblSource

( ColID int NOT NULL

, ColA varchar ( 10 ) NULL

, ColB datetime NULL constraint df_ColB default ( getDate ())

, ColC int NULL

, constraint PK_tblSource primary key clustered ( ColID ))

2. Create a Destination table named tblDest with the columns ColID, ColA, ColB, ColC:

USE SSISIncrementalLoad_Dest GO CREATE TABLE dbo . tblDest ( ColID int NOT NULL , ColA varchar ( 10 ) NULL , ColB datetime NULL , ColC int NULL)

3. Let's load some test data into both tables for demonstration purposes:

USE SSISIncrementalLoad_Source GO

-- insert an "unchanged" row INSERT INTO dbo.tblSource (ColID,ColA,ColB,ColC) VALUES(0, 'A', '1/1/2007 12:01 AM', -1)

-- insert a "changed" row INSERT INTO dbo.tblSource (ColID,ColA,ColB,ColC) VALUES(1, 'B', '1/1/2007 12:02 AM', -2)

-- insert a "new" row INSERT INTO dbo.tblSource (ColID,ColA,ColB,ColC) VALUES(2, 'N', '1/1/2007 12:03 AM', -3)

USE SSISIncrementalLoad_Dest GO

-- insert an "unchanged" row INSERT INTO dbo.tblDest (ColID,ColA,ColB,ColC) VALUES(0, 'A', '1/1/2007 12:01 AM', -1)

-- insert a "changed" row INSERT INTO dbo.tblDest (ColID,ColA,ColB,ColC) VALUES(1, 'C', '1/1/2007 12:02 AM', -2)

4. You can view new rows with the following query: SELECT s.ColID, s.ColA, s.ColB, s.ColC FROM SSISIncrementalLoad_Source.dbo.tblSource s LEFT JOIN SSISIncrementalLoad_Dest.dbo.tblDest d ON d.ColID = s.ColID WHERE d.ColID IS NULL This should return the "new" row - the one loaded earlier with ColID = 2 and ColA = 'N'. Why? The LEFT JOIN and WHERE clauses are the key. Left Joins return all rows on the left side of the join clause (SSISIncrementalLoad_Source.dbo.tblSource in this case) whether there's a match on the right side of the join clause (SSISIncrementalLoad_Dest.dbo.tblDest in this case) or not. If there is no match on the right side, NULLs are returned. This is why the WHERE clause works: it goes after rows where the destination ColID is NULL. These rows have no match in the LEFT JOIN, therefore they must be new.

This is only an example. You occasionally find database schemas that are this easy to load. Occasionally. Most of the time you have to include several columns in the JOIN ON clause to isolate truly new rows. Sometimes you have to add conditions in the WHERE clause to refine the definition of truly new rows.

Incrementally load the row ("rows" in practice) with the following T-SQL statement:
INSERT INTO SSISIncrementalLoad_Dest.dbo.tblDest (ColID, ColA, ColB, ColC) SELECT s.ColID, s.ColA, s.ColB, s.ColC FROM SSISIncrementalLoad_Source.dbo.tblSource s LEFT JOIN SSISIncrementalLoad_Dest.dbo.tblDest d ON d.ColID = s.ColID WHERE d.ColID IS NULL
5. There are many ways by which people try to isolate changed rows. The only sure-fire way to accomplish it is to compare each field. View changed rows with the following T-SQL statement:
SELECT d.ColID, d.ColA, d.ColB, d.ColC FROM SSISIncrementalLoad_Dest.dbo.tblDest d INNER JOIN SSISIncrementalLoad_Source.dbo.tblSource s ON s.ColID = d.ColID WHERE ( (d.ColA != s.ColA) OR (d.ColB != s.ColB) OR (d.ColC != s.ColC) )

This should return the "changed" row we loaded earlier with ColID = 1 and ColA = 'C'. Why? The INNER JOIN and WHERE clauses are to blame - again. The INNER JOIN goes after rows with matching ColID's because of the JOIN ON clause. The WHERE clause refines the resultset, returning only rows where the ColA's, ColB's, or ColC's don't match and the ColID's match. This is important. If there's a difference in any or some or all the rows (except ColID), we want to update it.

Extract-Transform-Load (ETL) theory has a lot to say about when and how to update changed data. You will want to pick up a good book on the topic to learn more about the variations.

To update the data in our destination, use the following T-SQL:
UPDATE d SET d . ColA = s . ColA , d . ColB = s . ColB , d . ColC = s . ColC FROM SSISIncrementalLoad_Dest . dbo . tblDest d INNER JOIN SSISIncrementalLoad_Source . dbo . tblSource s ON s . ColID = d . ColID WHERE ( ( d . ColA != s . ColA ) OR ( d . ColB != s . ColB ) OR ( d . ColC != s . ColC ) )

Incremental Loads in SSIS

Let's take a look at how you can accomplish this in SSIS using the Lookup Transformation (for the join functionality) combined with the Conditional Split (for the WHERE clause conditions) transformations.

Before we begin, let's reset our database tables to their original state using the following query:

USE SSISIncrementalLoad_Source GO
TRUNCATE TABLE dbo.tblSource
-- insert an "unchanged" row INSERT INTO dbo.tblSource (ColID,ColA,ColB,ColC) VALUES(0, 'A', '1/1/2007 12:01 AM', -1)

-- insert a "changed" row INSERT INTO dbo.tblSource (ColID,ColA,ColB,ColC) VALUES(1, 'B', '1/1/2007 12:02 AM', -2)
-- insert a "new" row INSERT INTO dbo.tblSource (ColID,ColA,ColB,ColC) VALUES(2, 'N', '1/1/2007 12:03 AM', -3)
USE SSISIncrementalLoad_Dest GO
TRUNCATE TABLE dbo.tblDest
-- insert an "unchanged" row INSERT INTO dbo.tblDest (ColID,ColA,ColB,ColC) VALUES(0, 'A', '1/1/2007 12:01 AM', -1)
-- insert a "changed" row INSERT INTO dbo.tblDest (ColID,ColA,ColB,ColC) VALUES(1, 'C', '1/1/2007 12:02 AM', -2)
Next, create a new project using Business Intelligence Development Studio (BIDS). Name the project SSISIncrementalLoad:

Once the project loads, open Solution Explorer and rename Package1.dtsx to SSISIncrementalLoad.dtsx:

When prompted to rename the package object, click the Yes button. From the toolbox, drag a Data Flow onto the Control Flow canvas:

Double-click the Data Flow task to edit it. From the toolbox, drag and drop an OLE DB Source onto the Data Flow canvas:

Double-click the OLE DB Source connection adapter to edit it:

Click the New button beside the OLE DB Connection Manager dropdown:

Click the New button here to create a new Data Connection:

Enter or select your server name. Connect to the SSISIncrementalLoad_Source database you created earlier. Click the OK button to return to the Connection Manager configuration dialog. Click the OK button to accept your newly created Data Connection as the Connection Manager you wish to define. Select "dbo.tblSource" from the Table dropdown:

Click the OK button to complete defining the OLE DB Source Adapter.

Drag and drop a Lookup Transformation from the toolbox onto the Data Flow canvas. Connect the OLE DB connection adapter to the Lookup transformation by clicking on the OLE DB Source and dragging the green arrow over the Lookup and dropping it. Right-click the Lookup transformation and click Edit (or double-click the Lookup transformation) to edit:

When the editor opens, click the New button beside the OLE DB Connection Manager dropdown (as you did earlier for the OLE DB Source Adapter). Define a new Data Connection - this time to the SSISIncrementalLoad_Dest database. After setting up the new Data Connection and Connection Manager, configure the Lookup transformation to connect to "dbo.tblDest":

Click the Columns tab. On the left side are the columns currently in the SSIS data flow pipeline (from SSISIncrementalLoad_Source.dbo.tblSource). On the right side are columns available from the Lookup destination you just configured (from SSISIncrementalLoad_Dest.dbo.tblDest). Follow the following steps:

1. We'll need all the rows returned from the destination table, so check all the checkboxes beside the rows in the destination. We need these rows for our WHERE clauses and for our JOIN ON clauses.

2. We do not want to map all the rows between the source and destination - we only want to map the columns named ColID between the database tables. The Mappings drawn between the Available Input Columns and Available Lookup Columns define the JOIN ON clause. Multi-select the Mappings between ColA, ColB, and ColC by clicking on them while holding the Ctrl key. Right-click any of them and click "Delete Selected Mappings" to delete these columns from our JOIN ON clause.

3. Add the text "Dest_" to each column's Output Alias. These rows are being appended to the data flow pipeline. This is so we can distinguish between Source and Destination rows farther down the pipeline:

Next we need to modify our Lookup transformation behavior. By default, the Lookup operates as an INNER JOIN - but we need a LEFT (OUTER) JOIN. Click the "Configure Error Output" button to open the "Configure Error Output" screen. On the "Lookup Output" row, change the Error column from "Fail component" to "Ignore failure". This tells the Lookup transformation "If you don't find an INNER JOIN match in the destination table for the Source table's ColID value, don't fail." - which also effectively tells the Lookup "Don't act like an INNER JOIN, behave like a LEFT JOIN":

Click OK to complete the Lookup transformation configuration.

From the toolbox, drag and drop a Conditional Split Transformation onto the Data Flow canvas. Connect the Lookup to the Conditional Split as shown. Right-click the Conditional Split and click Edit to open the Conditional Split Editor:

Expand the NULL Functions folder in the upper right of the Conditional Split Transformation Editor. Expand the Columns folder in the upper left side of the Conditional Split Transformation Editor. Click in the "Output Name" column and enter "New Rows" as the name of the first output. From the NULL Functions folder, drag and drop the "ISNULL( <<expression>> )" function to the Condition column of the New Rows condition:

Next, drag Dest_ColID from the columns folder and drop it onto the "<<expression>>" text in the Condition column. "New Rows" should now be defined by the condition "ISNULL( [Dest_ColID] )". This defines the WHERE clause for new rows - setting it to "WHERE Dest_ColID Is NULL".

Type "Changed Rows" into a second Output Name column. Add the expression "(ColA != Dest_ColA) || (ColB != Dest_ColB) || (ColC != Dest_ColC)" to the Condition column for the Changed Rows output. This defines our WHERE clause for detecting changed rows - setting it to "WHERE ((Dest_ColA != ColA) OR (Dest_ColB != ColB) OR (Dest_ColC != ColC))". Note "||" is used to convey "OR" in SSIS Expressions:

Change the "Default output name" from "Conditional Split Default Output" to "Unchanged Rows":

Click the OK button to complete configuration of the Conditional Split transformation.

Drag and drop an OLE DB Destination connection adapter and an OLE DB Command transformation onto the Data Flow canvas. Click on the Conditional Split and connect it to the OLE DB Destination. A dialog will display prompting you to select a Conditional Split Output (those outputs you defined in the last step). Select the New Rows output:

Next connect the OLE DB Command transformation to the Conditional Split's "Changed Rows" output:

Your Data Flow canvas should appear similar to the following:

Configure the OLE DB Destination by aiming at the SSISIncrementalLoad_Dest.dbo.tblDest table:

Click the Mappings item in the list to the left. Make sure the ColID, ColA, ColB, and ColC source columns are mapped to their matching destination columns (aren't you glad we prepended "Dest_" to the destination columns?):

Click the OK button to complete configuring the OLE DB Destination connection adapter.

Double-click the OLE DB Command to open the "Advanced Editor for OLE DB Command" dialog. Set the Connection Manager column to your SSISIncrementalLoad_Dest connection manager:

Click on the "Component Properties" tab. Click the elipsis (button with "...") beside the SQLCommand property:

The String Value Editor displays. Enter the following parameterized T-SQL statement into the String Value textbox:

UPDATE dbo.tblDest SET ColA = ? ,ColB = ? ,ColC = ? WHERE ColID = ?

The question marks in the previous parameterized T-SQL statement map by ordinal to columns named "Param_0" through "Param_3". Map them as shown below - effectively altering the UPDATE statement for each row to read:

UPDATE SSISIncrementalLoad_Dest.dbo.tblDest SET ColA = SSISIncrementalLoad_Source.dbo.ColA ,ColB = SSISIncrementalLoad_Source.dbo.ColB ,ColC = SSISIncrementalLoad_Source.dbo.ColC WHERE ColID = SSISIncrementalLoad_Source.dbo.ColID

Note the query is executed on a row-by-row basis. For performance with large amounts of data, you will want to employ set-based updates instead.

Click the OK button when mapping is completed.

Your Data Flow canvas should look like that pictured below:

If you execute the package with debugging (press F5), the package should succeed and appear as shown here:

Note one row takes the "New Rows" output from the Conditional Split, and one row takes the "Changed Rows" output from the Conditional Split transformation. Although not visible, our third source row doesn't change, and would be sent to the "Unchanged Rows" output - which is simply the default Conditional Split output renamed. Any row that doesn't meet any of the predefined conditions in the Conditional Split is sent to the default output.

That's all! Congratulations - you've built an incremental database load! [:)]

Get the code! (Free registration required)

:{> Andy

Published Monday, July 09, 2007 3:13 PM by andyleonard

Filed under: Design Pattern, Incremental, SSIS

Comment Notification

If you would like to receive an email when updates are made to this post, please register here

Subscribe to this post's comments using RSS

Comments

	Alberto Ferrari said: Andy, maybe you are interested in taking a look at the TableDifference component I published at http://www.sqlbi.eu. It is an all-in-one and completely free SSIS component that handles these kind of situations without the need to cache data in the Lookup. Lookups are nice but - in real situaton - they may shortly lead to out of memory situations (think at a hundred million rows table... it simply cannot be cached in memory). Beware that - for huge table comparison - you will need both TableDifference AND the FlowSync component that you can find at the same site. I'll be glad to hear your comments about it. Alberto July 12, 2007 5:21 AM
	andyleonard said: Thanks Alberto! Checking it out now. :{> Andy July 13, 2007 9:30 PM
	David R Buckingham said: Thank you greatly Andy. This couldn't have come at a better time as I just started using Integration Services for the first time on Friday to handle eight different data loads (all for a single client). Four of the data loads are straight appends, but the other four are incremental. This approach is vastly superior to loading the incremental data into a temporary table and then processing it against the destination table. In fact, it proved to be more efficient than both set-based insert/updates or a cursor-based approach. Yes, I tested both approaches prior to implementing yours. Your approach was faster than the set-based insert/updates even though I tested it across the WAN which suprised me greatly. I also created a script to assist with the creation of the Conditional Split "Changed Rows" condition which follows (be sure your results aren't being truncated when you have a table with many columns): --- BEGIN SCRIPT --- DECLARE @Filter varchar(max) SET @Filter = '' -- ((ISNULL(<ColumnName>)?"":<ColumnName>)!=(ISNULL(Dest_<ColumnName>)?"":Dest_<ColumnName>)) \|\| SELECT @Filter = @Filter + '((ISNULL(' + c.[name] + ')?"":' + c.[name] + ')!=(ISNULL(Dest_' + c.[name] + ')?"":Dest_' + c.[name] + ')) \|\| ' FROM sys.tables t INNER JOIN sys.columns c ON t.[object_id] = c.[object_id] WHERE SCHEMA_NAME( t.[schema_id] ) = 'GroupHealth' AND t.[name] = 'ConsumerDetail' AND c.[is_identity] = 0 AND c.[is_rowguidcol] = 0 ORDER BY c.[column_id] SET @Filter = LEFT( @Filter, LEN( @Filter ) - 2 ) SELECT @Filter --- END SCRIPT --- Again, thanks greatly. I now have 2 SSIS books on there way to me. I am eager to learn as much as I can. July 17, 2007 3:52 PM
	Bill Mo said: Hello,Andy!Thanks a lot for your incremental process!I'm doing SSIS project! July 17, 2007 9:47 PM
	david boston said: Thanks this worked a treat for my SSIS project. July 20, 2007 5:01 AM
	andyleonard said: Hi David, Bill, and David, Thanks for the feedback! :{> Andy August 8, 2007 7:14 PM
	saul said: Hi Andy !! Great work... I was scared because of this Incremental load... and you saved my weekend... now I can enjoy it .... :-) September 7, 2007 5:56 PM
	Steve Hall said: Anyone had a problem with the insert and update commands locking each other out? Didn't happen at first but does now. Update gets blocked by the insert and it just hangs. Steve September 18, 2007 1:18 PM
	andyleonard said: Thanks Saul! Steve, are you sure there's not something more happening on the server that's causing this? If this is repeatable, please provide more information and I'll be happy to take a look at it. SQL Server does a fair job of detecting and managing deadlocks when they occur. I haven't personally seen SQL Server "hang" since 1998 - and then it was due to a failing I/O controller. :{> Andy September 27, 2007 6:57 PM
	Bill Mo said: Hi,Andy! I have a same problem with Steve,it is block. When bulk insert and update happen,Update gets blocked by the insert and it just hangs!Insert's wait type is ASYNC_NETWORK_IO. October 8, 2007 4:15 AM
	Bobby said: Thx 4 the trick with Fail -> Left Join ! I was thinking how to do it whole day :o) October 18, 2007 1:23 AM
	Andy Leonard said: Introduction This post is part of a series of posts on ETL Instrumentation. In Part 1 we built a database November 18, 2007 10:53 PM
	Michael Ross said: Steve, This most certainly can be the case with larger datasets. In my case, I ran into this issue with large FACT table loads. Either consider dumping the contents of the insert into a temp table or SSIS RAW datafile and complete the insert in a separate dataflow task or modify the isolationlevel of the package. Be warned, make sure you research the IsolationLevel property thoroughly before making such a change. November 26, 2007 12:03 PM
	Michael said: What happens when a field is NULL in the destination or source when determining changed rows? Don't we need special checks to ensure if a destination field is NULL the source should also be? Thus a change has occured and the record should be updated? December 26, 2007 10:26 AM
	andyleonard said: Hi Michael, Excellent question! This post was intended to cover the principles of Incremental Loads, and not as a demonstration of production-ready code. </CheesyExcuse> There are a couple approaches to handling NULLs in the source or destination, each with advantages and disadvantages. In my opinion, the chief consideration is data integrity and the next-to-chief consideration is metadata integrity. A good NULL trap can be tricky because NULL == NULL should never evaluate to True. I know NULL == NULL can evaluate to True with certain settings, but these settings also have side-effects. And then there's maintenance to consider... basically, there's no free lunch. A relatively straightforward method involves identifying a value for the field that the field will never contain (i.e. -1, "(empty)", or even the string "NULL") and using that value as a substitute for NULL. In the SSIS expression language you can write a change-detection expression like: (ISNULL(Dest_ColA) ? -1 : Dest_ColA) != (ISNULL(ColA) ? -1 : ColA) But again, if ColA is ever -1 this will evaluate as a change and fire an update. Why does this matter? Some systems include "number of updated rows" as a validation metric. :{> Andy December 26, 2007 12:50 PM
	Michael said: Hi Andy, Thanks for this great article! Do you have any hints for implementing your design with an Oracle Source. I am attempting to incrementally update from a table with 7 million rows with ~50 fields. The Lookup Task failed when I attempted to use it like you described above due to a Duplicate Key error...cache is full. I googled this and found an article suggesting enabling restrictions and enabling smaller cache amounts. However it is now extremely slow. Do you have any experience/advice on tweaking the lookup task for my environment? Is there value in attempting to port this solution to an Oracle to SQL environment? Is there a way to speed things up/replace the lookup task by using a SQL Execution Task which calls a left outer join? Is there major difference\impact in having multiple primary keys? Thanks Again December 26, 2007 1:47 PM
	Andy Leonard said: Now that our 5-month old son - Riley Cooper - is on the mend , I am hitting the speaking trail again! January 6, 2008 6:16 PM
	Jigs said: Hi AndY looks great and work also great but if there are more records to update than it just hangs while doing insert and update so what should i do ..is there any workaround by which we can avoid hanging od SSIS pacage. Please Suggest Thanks Jigu January 15, 2008 3:36 PM
	andyleonard said: Hi Bill and Jigu, Although I mention set-based updates here I did not demonstrate the principle because I felt the post was already too long - my apologies. I have since written more on Design Patterns. Part 3 of my series on ETL Instrumentnation (http://sqlblog.com/blogs/andy_leonard/archive/2007/11/18/ssis-design-pattern-etl-instrumentation-part-3.aspx#SetBasedUpdates) demonstrates set-based updates. I need to dedicate a post to set-based updates. :{> Andy January 16, 2008 7:10 AM
	Jai said: Hi Andy Thanks you did great help to understand data update through SSIS package April 5, 2008 6:16 PM
	Kenneth said: Hi Andy, I have a hard time following your instructions. Can you send me your sample project Thank You Kenneth [email protected] July 29, 2008 1:44 PM
	andyleonard said: Hi Kenneth, Sorry to hear you're having a hard time with my instructions. One of the last instructions is a link at the bottom of the page called "Get the code". It points to this URL:http://vsteamsystemcentral.com/dnn/Demos/IncrementalLoads/tabid/94/Default.aspx. Hope this helps, Andy July 29, 2008 1:59 PM
	EAD said: Not sure posted same question few places….May be you gurus can explain In SSIS Fuzzy grouping objects creates some temp tables and does the Fuzzy logic. I ran the trace to see how it does in one cursor it is taking very long time to process 150000 records. Same executes fine in any other test environments. The cursor is simple and I can post if needed. Any thoughts ? September 11, 2008 8:45 PM
	LNelson said: I have a similar package I am trying to create and this was a big help. The new rows write properly however I am getting an error on the changed rows because the SQL table i am writing to has an auto incremented identity spec column. The changes won't write to the SQL table. If I uncheck "keep identity" it writes new rows instead of updating existing. What am I missing? December 1, 2008 11:38 AM
	FDA said: Thanks a lot of Andy!! Very Helpful! December 17, 2008 3:48 AM
	Rajesh said: Hi Andy.. Thats the good alternative for slowly changing dimention...!! Welll done... What if the increamental is based on more than one columns...? And further to increase the complications, if any of the column included in the look up condition changes as well....? Last one...wht if the row is deleted from source....? January 6, 2009 3:23 AM
	Ken ([email protected]) said: it looks like your package handles new and updated rows. I don't see the code handling the deleted rows in source (asume that there is) Here is my two cents. in your lookup, you can split out the match and non-match rows. non match means new record and you can do an insert directly after the lookup. you can elimninate the 'new row' in your condition in 'conditional split' However, overall, your sample package is the best (at far as I have searched) sample on the net ( I love it, honestly). Keep up the great work and giving out sample package. Like most people, I do appreciate your efford. Ken January 7, 2009 8:10 PM
	andyleonard said: Hi Ken, Thanks for your kind words. I believe you're referring to functionality new to the SSIS 2008 Lookup Transformation - there is no Non-Match Rows output buffer in the SSIS 2005 Lookup Transformation. :{> Andy January 7, 2009 9:58 PM
	RVS said: Hi Andy, Thanks a lot for this article. It proved to be a great help for me. I was wondering if you can provide some solution to handle deleted rows from source table using lookup. I need this because I have to keep the historical data in the data warehouse. Thanks in advance, RVS [email protected] January 21, 2009 3:04 AM
	Charlie Asbornsen said: Andy, thanks for your help and effort. This is definitely more elegant than staging over to one database and then doing ExecuteSQLs to execute incremental loads. January 21, 2009 5:16 PM
	Charlie Asbornsen said: And re ranvijay's question, I would assume that when the row exists in the destination but not the source, the source RowID would show up as null, so you could do that as another split on the conditional. January 21, 2009 5:18 PM
	andyleonard said: Hi RVS and Charlie, RVS, Charlie answered your question before I could get to it! I love this community! I need to write more on this very topic. New features in SQL Server 2008 change this and make the Deletes as simple as New and Updated rows. I didn't mention Deletes in this post because the main focus was to get folks thinking about leveraging the data flow instead of T-SQL-based solutions (Charlie, in regards to your first comment). There's nothing wrong with T-SQL. But a data flow is built to buffer (or "paginate") rows. It bites off small chunks, acts on them, and then takes another bite. This greatly reduces the need to swap to disk - and we all know the impact of disk I/O on SQL Server performance. Charlie is correct. The way to do Deletes is to swap the Source and Destinations in the Correlate / Filter stages. Typically, I stage Deletes and Updates in a staging table near the table to be Deleted / Updated. Immediately after the data flow, I add an Execute SQL Task to perform a correlated (inner joined) update or delete with the target table. I do this because my simplest option inside a data flow is row-based Updates / Deletes using the OLE DB Command transformation. A set-based Update / Delete is a lot faster. I need to write more about that as well... :{> Andy January 21, 2009 5:29 PM
	Charlie Asbornsen said: Andy, Looks like I have some rewriting to do on the next version of the ETL. It's a good thing I enjoy working in SSIS! I'm working on building a data warehouse and BI solution for a government customer, and a lot of their 1970's era upstream data sources don't have ANY kind of data validation. In fact when we first installed in production we found out that they had some code fields in their data tables with a single quote for data! It played merry hob with our insert statements until we figured out what was happening. Then I got to figure out how to do D-SQL whitelisting with VB scripting in SSIS :) Of course since its the government we'll probaby have to wait until 3Q 2010 before we're allowed to upgrade to SQL 2008. We were all gung ho about VS 2008 (which we were allowed to get) but imagine my chagrin when I found out that I couldn't use my beloved BI Studio without SQL 2008... :P So I'll be using this for the next version... and possibly the version after that as well. Thanks a bunch! January 21, 2009 5:41 PM
	Charlie Asbornsen said: Me again. I think I made a mistake. If a row already exists in the destination table and it no longer exists in the source table, I want it deleted (sent to the deletes staging table). However, the lookup limits the row set in memory to items that are already in the source table, so its not really functioning as an outer join. Its perfect for determining inserts and updates, but I need to do something else to do deletes... I'm going to try adding an additional OLE DB source and point that at the same table the lookup is checking... hmm, maybe try the Merge? I'll see what happens and let you know. January 22, 2009 12:41 PM
	Charlie Asbornsen said: Actually I think I need a second pass... grrr. January 22, 2009 12:44 PM
	Charles Asbornsen said: Andy, Please feel free to combine this with the previous reply. What I wound up doing was creating a second data flow after the one that split the inserts and updates out. The deletes flow populated a deleted rows staging table with the deleted row id, which then was joined to the ultimate destination table in a delete command in an Execute SQL task. I would up reversing the lookup, but used the same technique by using a conditional split on whether or not the new column from the lookup was null, and if it was, the output went to the "deleted records" path, which populated the staging table. The reason I want to actually remove the data from the table as opposed to merely marking it as deleted is because the reason a row would disappear would be because it was a bad reference code in the first place. My big datawarehouse ETL adds new reference codes to the reference tables (which it needs to create in the first place because the source reference codes are held in these five gigantic tables which do not lend themselves to generating NV lists) for unmatched codes in the data tables (remember there's no validation at the source). When the reconciliation stick finally gets swung and the customer replaces the junk code it disappears from my ETL and I remove it from my table. It is different from a code that gets obsoleted; there's a reason to track those, but garbage just needs to be thrown out. Thanks again, I would have been very annoyed with myself if I wound up doing row-based IUDs... January 22, 2009 2:55 PM
	andyleonard said: Hi Charles, I wasn't clear in my earlier response but you figured it out anyway - apologies and kudos. You do need to do the Delete in another Data Flow Task. Excellent work! :{> Andy January 22, 2009 4:15 PM
	Charles Asbornsen said: Andy, Is there a limit to how many comparisons you can make in the Conditional Split Transformation Editor? I have a table with 20 columns, and I'm trying to do 19 comparisons. It's telling me that one of the columns doesn't exist in the input column collection. I can cut the expression and paste it back in and it picks a different column to complain about. Error 0xC0010009... it says the expression cannot be parsed, may contain invalid elements or might not be well formed, and there may also be an out-of-memory error. I've been looking at it for 1/2 an hour and all the columns it is variously complaining about are present in the input column collection, so I suspect it's a memory error. Should I alias the column names to be shorter (ie the problem is in the text box) or is it a metadata problem? I'm going home now but tomorrow I will see if splitting the staging table into 4 tables and splitting the conditions into 4 outputs (to be recombined later by an execute SQL command into the real staging table) does what I need. Thanks! Charlie January 22, 2009 5:54 PM
	RVS said: Hi Andy and Charles, I thank you for your comments. I still have a few doubts related to handling Deleted columns. I have created a solution to handle all three cases(add,update and delete). I have taken two OLEDB Source(one with source and data and another with destination table's data) then I have SORTED them and MERGED them(with FULL OUTER join) and finally used CONDITIONAL SPLIT to filter New, updated and Deleted data and used the OLEDB Command to do the required action. I am getting Deleted rows by using full outer join. I am getting expected result with this solution but I think this is not performance efficient as it is using sort, merge etc. I wanted to use Lookup as suggested by Andy. But the solution which you both have given is not fully clear to me. Will it be possible for you to send me a sketch of the proposed solution or explain it a bit in detail? Charles, regarding no. of comparisons, I don't think it is limited to 19 or 20 because I have used more than 35 comparisons and that is working fine. Please check if you have checked for null columns correctly. Thanks once again, RVS ([email protected]) January 23, 2009 6:57 AM
	Charlie Asbornsen said: Doh! Thanks Ranvijay. January 23, 2009 10:01 AM
	Charlie Asbornsen said: Actually what was happening was that since the comparison expression was so long I moved it into WordPad to type it and then copy/pasted into the rather annoyingly non-resizable condition field in the conditional split transformation editor. It turns out it doesn't like that. Maybe there were invisible control characters in the string, so I needed to just bite the bullet and type in the textbox. It works fine now. It would be nice to have a text visualizer for that field. Thanks! January 23, 2009 1:51 PM
	vidhya said: This was the excellent article and Andy illustration style is great. Thank you June 30, 2009 9:47 AM
	Nostromo said: Great tutorial! I'm new to SSIS and I worked through it without a hitch. Thanks!!! July 10, 2009 10:23 AM
	DVL said: Hellow, Many thanks for the step by step guide. It's nice to find a way to get your changed and new records in 2 separate outputs. But how who you get the deleted records? The only solution i found is to lookup every PK in the source db table and check if it still excists. If it does it will set the deleted_flag to 1. Do you have any idea to implement the deleted records into your solution? Mine is in a separete dataflow. Greetings August 27, 2009 8:05 AM
	CSu said: Great article! I originally used sort, merge join (with left outer join) and conditional split transforms to perform incremental load. Unfortunately it did not work as expected. Your article has simplified my design and it is now working perfectly. Thanks for sharing. :) October 26, 2009 7:26 AM
	hasan said: Dear Andy your solution is great but i have problem. the dimensions are not getting populated with the default data. does this work on the excel source because i have an excel source. December 29, 2009 7:31 AM
	Mike said: Hiya, Just read the article, confirms my approach to incremental loading on a series of smallish facts. I have used the "slowly changing dimension" element in the past to facilitate the same outcome, ie not using type2s (despite being a fact) - but it is much slower. RVA, re: "I am getting expected result with this solution but I think this is not performance efficient as it is using sort, merge etc"; if the sort(s) are the main problem, you can do the sort on the database and tell SSIS that the set is sorted to avoid using two sort dataflow tasks - not sure if that will give you sufficient gains? The Merge join, as you say, will still be not great within SSIS. Lastly - has anyone any experience of duplicated KEYS in the source table, that do not (yet) exist in the destination? I am performing bulk-inserts after the update/insert evaluation. I have a minor concern that if I have a key in the source data, that the FIRST record will correctly INSERT, does the lookup then add this key to memory, so that when the second key arrives it knows to update? Because, although I do not constrain the destination table, it will cause problems within the data (mini carteseans - shudder). Do I need to be aware of any settings or the like? I am about to do a test-case now - and see what happens... January 24, 2010 5:35 PM
	Mandar said: Hi Andy, I want to load data incrementally from source (MySQL 5.2) to SQL Server 2008, using SSIS 2008, based on modified date. Somehow I am not able do it as MySQL doesn't support parameters. Need some help on this. -regards, mandar March 15, 2010 6:40 AM
	Ramdas said: Thank you andy for this tutorial. I am using SSIS 2008, the Lookup task interface has changed a little bit, when you click on edit on the lookup task, the opening screen is layed out differently. March 25, 2010 9:46 AM
	KK said: In my source ID 3 Record has duplicated KEYS so i want first record Insert and Secode Record should be update in Destination table trough SSIS Can any one help me to resovle this problem. When I use SCD 2 type when it read record in target the id 3 record is not avlable in target so it’s treat for insert for second record also same. So that record insert two time I don’t want like that I want to first record insert and scoend record of ID 3 Update. So any way of resolve this problem . ID Name Date 1 Kiran 1/1/2010 12:00:00 AM 3 Rama 1/2/2010 12:00:00 AM 2 Dubai 1/2/2010 12:00:00 AM 3 Ramkumar 1/2/2010 12:00:00 AM March 25, 2010 5:11 PM
	Craig said: I need to incrementally load data from Sybase to SQL. There will be several hundred million rows. Will this approach work OK with this scenario? March 30, 2010 10:45 AM
	andyleonard said: Hi Craig, Maybe, but most likely not. This is one design pattern you can start with. I would test this, tweak it, and optimize like crazy to get as much performance out of your server as possible. :{> Andy March 30, 2010 10:52 AM
	jpedroalmeida said: Hy there from Portugal, Andy, i am a starter in SSIS and i found this article very useful and straightforward in explanation with text and images... Thanks a lot!! Cheers April 25, 2010 11:02 AM
	JohnnyReaction said: Hi Andy I amended your script to deal with different datatypes (saves a lot of debugging in the Conditional Split Transformation Editor): /* This script assists with the creation of the Conditional Split "Changed Rows" condition -- be sure your results aren't being truncated when you have a table with many columns / --- BEGIN SCRIPT --- USE master GO DECLARE @Filter varchar(max) SET @Filter = '' SELECT @Filter = @Filter + '((ISNULL(' + c.[name] + ')?'+ CASE WHEN c.system_type_id IN (35,104,167,175,231,239,241) THEN '""' WHEN c.system_type_id IN (58,61) THEN '(DT_DBTIMESTAMP)"1900-01-01"' ELSE '0' END + ':' + c.[name] + ')!=(ISNULL(Dest_' + c.[name] + ')?' + CASE WHEN c.system_type_id IN (35,104,167,175,231,239,241) THEN '""' WHEN c.system_type_id IN (58,61) THEN '(DT_DBTIMESTAMP)"1900-01-01"' ELSE '0' END +':Dest_' + c.[name] + ')) \|\| ' FROM sys.tables t INNER JOIN sys.columns c ON t.[object_id] = c.[object_id] WHERE SCHEMA_NAME( t.[schema_id] ) = 'dbo' AND t.[name] = 'DimUPRTable' AND c.[is_identity] = 0 AND c.[is_rowguidcol] = 0 ORDER BY c.[column_id] SET @Filter = LEFT(@Filter, (LEN(@Filter) - 2)) SELECT @Filter --SELECT -- c. --FROM -- sys.tables t --JOIN -- sys.columns c -- ON t.[object_id] = c.[object_id] --WHERE -- SCHEMA_NAME( t.[schema_id] ) = 'dbo' --AND t.[name] = 'DimUPRTable' --AND c.[is_identity] = 0 --AND c.[is_rowguidcol] = 0 --ORDER BY --c.[column_id] --SELECT -- schemas.name AS [Schema] -- ,tables.name AS [Table] -- ,columns.name AS [Column] -- ,CASE WHEN columns.system_type_id = 34 -- THEN 'byte[]' -- WHEN columns.system_type_id = 35 -- THEN 'string' -- WHEN columns.system_type_id = 36 -- THEN 'System.Guid' -- WHEN columns.system_type_id = 48 -- THEN 'byte' -- WHEN columns.system_type_id = 52 -- THEN 'short' -- WHEN columns.system_type_id = 56 -- THEN 'int' -- WHEN columns.system_type_id = 58 -- THEN 'System.DateTime' -- WHEN columns.system_type_id = 59 -- THEN 'float' -- WHEN columns.system_type_id = 60 -- THEN 'decimal' -- WHEN columns.system_type_id = 61 -- THEN 'System.DateTime' -- WHEN columns.system_type_id = 62 -- THEN 'double' -- WHEN columns.system_type_id = 98 -- THEN 'object' -- WHEN columns.system_type_id = 99 -- THEN 'string' -- WHEN columns.system_type_id = 104 -- THEN 'bool' -- WHEN columns.system_type_id = 106 -- THEN 'decimal' -- WHEN columns.system_type_id = 108 -- THEN 'decimal' -- WHEN columns.system_type_id = 122 -- THEN 'decimal' -- WHEN columns.system_type_id = 127 -- THEN 'long' -- WHEN columns.system_type_id = 165 -- THEN 'byte[]' -- WHEN columns.system_type_id = 167 -- THEN 'string' -- WHEN columns.system_type_id = 173 -- THEN 'byte[]' -- WHEN columns.system_type_id = 175 -- THEN 'string' -- WHEN columns.system_type_id = 189 -- THEN 'long' -- WHEN columns.system_type_id = 231 -- THEN 'string' -- WHEN columns.system_type_id = 239 -- THEN 'string' -- WHEN columns.system_type_id = 241 -- THEN 'string' -- WHEN columns.system_type_id = 241 -- THEN 'string' -- END AS [Type] -- ,columns.is_nullable AS [Nullable] --FROM -- sys.tables tables --INNER JOIN -- sys.schemas schemas --ON (tables.schema_id = schemas.schema_id ) --INNER JOIN -- sys.columns columns --ON (columns.object_id = tables.object_id) --WHERE -- tables.name <> 'sysdiagrams' -- AND tables.name <> 'dtproperties' --ORDER BY -- [Schema] -- ,[Table] -- ,[Column] -- ,[Type] July 28, 2010 8:26 AM
	Paul Klotka said: Using T-SQL to do change detection. I would not use a join to detect change because in the where clause you need to handle NULL values. For example if ColA in Source is NULL it doesn't matter what ColA is in the destination, the where clause will return false and not detect the change. To get around this I use a union to detect change. Here is an example. select ColId, ColA, ColB, ColC from Source union select ColId, ColA, ColB, ColC from Dest This returns a distinct set of rows, including handling NULL values. All that is left is to determine if the ColId appears more than once in the set. select ColId from ( select ColId, ColA, ColB, ColC from Source union select ColId, ColA, ColB, ColC from Dest ) x group by ColId having count() > 1 Now I have a list of keys which changed. I can take this list and sort it to use in a merge join in SSIS or I can use it as a subquery to join back to the Source table. See below. select ColId, ColA, ColB, ColC from Source s inner join ( select ColId from ( select ColId, ColA, ColB, ColC from Source union select ColId, ColA, ColB, ColC from Dest ) x group by ColId having count() > 1 ) y on s.ColId = y.ColId July 28, 2010 2:06 PM
	Chhavi said: Thanks for the good explanation and screenshots. I found this website to be extremly helpful and supportive. Please let me know if I can learn something more from you and rest of the guys visiting this website, so that we can become better in SSIS and SQL server 2005 or 2008. Please provide us similar articles so that we can through them and practice. Thanks again Andy. Long Live Andy :) August 18, 2010 3:59 PM
	AP said: This is excellent Article ! Great job October 11, 2010 10:12 AM
	TheAviator said: Thank you very very much. No where on the net I found it explained in so detail and clear. Thanks again October 22, 2010 11:08 AM
	V said: My requirement is Update: if records exist in both the table compare them, and update value in destination table if value is different. Insert: if record doesn't exist in destination table, add new record in destination table. Delete: if record exist in destination table but not in source table, delete record from destination table. The above code, perform only Insert and Update, however, it doesnt Delete data from destination table which has been deleted from Source data. I would not like to perform Truncate\Delete ALL data from destination table. Please let me know how shall i do this. Basically, it should perform Update, Insert, Delete in one single package (task) December 20, 2010 6:39 PM
	Dpostman said: Probably not the most common scenario, but if your source and destinations are coming from sql server could you just select checksum(*) as a column from your source and destination tables and test it to determine if the row has been updated? I would think it would be a pretty safe alternative when you have a lot of columns. Or has anyone created a hashing formula in an expression? (or would that be too slow to consider?) January 17, 2011 5:16 PM
	dbraver said: The following new feature of 2008 R2 seems does the same http://technet.microsoft.com/en-us/library/bb510625.aspx. February 16, 2011 7:28 PM
	Peter Schott said: Merge may be able to handle this, but if you start working with millions of rows, it tends not to perform too well. For smaller data sets, it's pretty effective, but I've seen that command sit for a while as Merge tries to calculate the updates/inserts. April 20, 2011 4:52 PM
	Samit Shah said: Hi Andy, Its great article. Very well explained. But I was wondering I am having around 20-30 tables in mysql and I have to use SSIS package for moving data of this tables to sql server. Some of the tables has 40-50 million records. I have to do this load very frequently (might be daily), is this best approach for it or if you can suggest some better approach. April 25, 2011 8:44 AM
	andyleonard said: Change Detection is a topic of many design patterns. Here I used a rather brute-force method for detecting updates, mainly to demonstrate the concept. There are more methods for detecting changed rows and I hope to blog about some soon. Thanks for the comments! :{> April 25, 2011 7:33 PM
	Kal said: This works great! Now the issue is, I have more than 100 tables which need an incremental load.. Do i have to build 100 packages? or is there an easy way out? Please Help.. May 12, 2011 6:24 AM
	Thiru-BI said: i just want to load based on the date ie if our table have date column we will capture that maximum date based on the loading if again running our package it allows which records have date greater than already capturing date. Allow this records only by the we can get only new records to target. can you please some idea to achive this? Thanks & Regards, Thirunavukkarasu P May 30, 2011 3:33 AM
	Michael Baumanns said: Hi Kal, i have actually the same problem. Maybe you can use the Slowly Changing Dimension Tool. This generates Updaten and insert command for you. May 30, 2011 9:50 AM
	Romualdo said: Hi Andy! Thanks a lot for that topic. I'm starting to use SSIS and i have some doubts: a) If my Destionation Database is null, i got error. Don't insert the rows. b) What can i do to delete rows on destination database who not exists on source database? c)If i have come source column with null, don't update the destination. But that i understant why a i see your sugest: (ISNULL(Dest_ColA) ? -1 : Dest_ColA) != (ISNULL(ColA) ? -1 : ColA) July 20, 2011 2:10 PM
	Lavanya said: Thank u a lot...nice explanation July 26, 2011 9:00 AM
	J Channin said: Could you please explain further.... "Note the query is executed on a row-by-row basis. For performance with large amounts of data, you will want to employ set-based updates instead." Thank you August 4, 2011 3:03 PM
	D Sharma said: Experts, i have a question.I'm working on loading a Very large table having existing data of the order of 150 million records which will keep on growing by adding 1 million records on a daily basis.Few days back the ETL started failing even after running for 24 hrs.In the DFT, we have source query pulling 1 million records which is LOOKed UP against the Destination table having 150 million records to check for new records.It is failing as the LOOKUP cannot hold data for 150 million records.i have tried changing the LOOKUP to Merge Join without success.Can you please suggest alternative designs to load the data in the large table successfully.Moreover, there is no way i can reduce the size of destination table.i already have indexes on all required columns.Hope i'm clear in explaining the scenario. August 17, 2011 10:47 AM
	andyleonard said: Hi D, You can limit the number of rows used by the Lookup by using a SQL query as the Lookup source instead of the entire table. Hope this helps, Andy August 17, 2011 11:11 AM
	D Sharma said: Thanks for the reply Andy.I have already mentioned that it's not possible for me to actually reduce the size of the LOOKUP Query since i need to check existing rows which can be anywhere in the table.Something came to my mind just now on which i would like expert comments.I'm thinking about splitting the target table into parts with sql query and use it to join with source sql query to find possible newer records which would be joined with the various parts of the destination table one by one to get to the actual new records.Will try that at work tomorrow, just came to mind now.Would appreciate a lot if you can suggest any better alternative. August 17, 2011 12:03 PM
	andyleonard said: Hi D, I recommend identifying the rows in the large table before you reach the lookup, and staging the data you need to return from the lookup - along with the lookup-matching criteria - in another table. Truncate this table prior to loading it. Populate it. Then use it for the lookup operation. Kimball refers to this a "key staging". Hope this helps, :{> August 17, 2011 12:44 PM
	dilip said: Hi andy, I tried this article in BIDS SSIS 2008 R2 but every time it had to to update it wont do any update instead it inserts a new row. As given in above example when ColID =1 it needs to update ColA but instead it inserts new colA with same Colid=1 so it has now two rows with Colid=1 one with cola = b and other with cola =c:( any idea where i went wrong....i followed the same steps which have been given by you. Thanks, dilip August 19, 2011 3:44 PM
	Neha said: Hi Andy, Very nice article. I had another question though, how do you handle data deletes. I see that the new records are inserted and changed ones are updated. What about the records that were deleted? Thanks Neha September 20, 2011 10:05 PM
	Jessica said: Hi Andy, Awesome posts. I have one question not sure if this is the right place. I am working on a data warehouse load that has comprises of multiple SSIS packages and my challenge is to make it rerunnable. Each package calls a stored procedure which is rerunnable i am trying to add something like a Batchid to each run. any ideas on how this should be approached? thanks!!! September 27, 2011 4:23 PM
	andyleonard said: Hi Jessica, I wrote a post recently about designing an SSIS Framework. It can help. There are other potential gotchas with DW loads, so please hit the email link in the upper right of this page. We'll. Probably do more good taking this offline. :{> September 27, 2011 5:29 PM
	srinivasan M said: I am the beginner for SSIS packages . It is very useful Thanks a lot October 4, 2011 1:12 AM
	indy said: Hi Andy, I followed the article published today - level 4 http://www.sqlservercentral.com/articles/Stairway+Series/76390/. I know you are going to publish article to take care of deletes as well. But I need to implement this in my project by this week. Do you have deletes article handy? Also, I neeed to work with 40 + tables and the SSIS package should refresh the destination database every 8 hours. How do I manage 40+ tables and data resfresh every 8 hours. Can you please suggest me better solution to achieve this? October 12, 2011 1:32 PM
	andyleonard said: Hi Indy, Please email me at [email protected]. Thanks, Andy October 12, 2011 8:01 PM
	Amu said: This OleDB command will be slow for lakh number of records.. By loading the data into stage table and update it outside the dataflow using execute sql task is one option.... Any other option is available to improve the performance of the package. November 10, 2011 5:46 AM
	andyleonard said: Hi Amu, You are correct and thank you for pointing this out. I have written another series about SSIS Incremental Loads for SQLServerCentral.com. I cover your suggestion in Step 4 of the Stairway to Integration Services (http://www.sqlservercentral.com/articles/Stairway+Series/76390/). :{> November 10, 2011 9:16 AM
	malli said: is there any chance. If source file is deleted, it need to effect on destination file how can we use condition for that in conditional split, the above ssis package is gud ( i am working wth it ) but the thing is it cant handle the deleted tables can u help me with that November 10, 2011 12:25 PM
	malli said: Sorry i mean does it able to handel the deleted rows ( not columns sorry for that ) in source table it shouls effect the destination to by the way i am using sorce as oracle data base November 10, 2011 2:53 PM
	andyleonard said: Hi Mali, Check out Step 5 of the Stairway to Integration Services - it talks about Deletes:http://www.sqlservercentral.com/articles/Integration+Services+(SSIS)/76395/ Hope this helps, Andy November 10, 2011 3:17 PM
	Reddy said: Hi Andy, if a cloumn has null values condition split saying "The expression results must be Boolean for a Conditional Split" my error is [Conditional Split [127]] Error: The expression "(STATUS != L_STATUS) \|\| (ORDER != L_ORDER)"on "output "Update" (167)" evaluated to NULL, but the "component "Conditional Split" (127)" requires a Boolean results. Modify the error row disposition on the output to treat this result as False (Ignore Failure) or to redirect this row to the error output (Redirect Row). The expression results must be Boolean for a Conditional Split. A NULL expression result is an error. November 11, 2011 10:17 AM
	Malli said: Hi Andy, Thanks a lot for that, right now i am facing a problem that my records r some millions so its is geeting error as out of memory some thing so on is there any suggesion on that Thanks Malli November 11, 2011 5:23 PM
	Reddy said: I got it anyway thanks for your post which u have posted recentelly. Thanks, Reddy November 11, 2011 5:25 PM
	malli said: Hi Andy The error is Error: The system reports 89 percent memory load. There are 3477643264 bytes of physical memory with 357049128 bytes free. There are 2147352579 bytes of virtual memory with 97837956 bytes free. The paging file has 5452554249 bytes with 1323617112 bytes free. November 11, 2011 5:27 PM
	Hi Andy, said: I am attempting to incrementally update from a table with 170 million rows with 78 fields. The Lookup Task failed when I attempted to use it like you described above due to a Duplicate Key error...cache is full. However it is now extremely slow. Do you have any dvice on tweaking the lookup task for my environment? my source is from Oracle Is there a way to speed things up/replace the lookup task by using a SQL Execution Task which calls a left outer join? Thanks Again November 15, 2011 9:15 PM
	andyleonard said: You should consider using a query in the Lookup Transformation and not selecting the table name from a dropdown. Selecting the table name essentially attempts to load the entire table into RAM before the data flow executes. Limiting the rows and columns returned will shrink the data volume returned. You can also look into key-staging. There's mention of it here (http://msdn.microsoft.com/en-us/library/cc671624.aspx) in the section on Targeted Staging. Another pattern to consider is Range-based lookups (http://blogs.msdn.com/b/mattm/archive/2008/11/25/lookup-pattern-range-lookups.aspx). Hope this helps, Andy November 15, 2011 11:56 PM
	John said: Hi Andy, This article has been veryuseful as I have built a package that almost works... ;-) It actually works for 4 out of 5 tables. The 5th table actually has a 5 field combined primary key. So in the Conditional Split --New rows, I put the following: Isnull(Dest_field1) && Isnull(Dest_field2) && Isnull(Dest_field3) && Isnull(Dest_field4) && Isnull(Dest_field5) which are the 5 fields that make the primary key. and in the -- Changed Rows, I put the following (Field1 == Dest_field1) && (Field2 == Dest_field2) && (Field3 == Dest_field3) && (Field4 == Dest_field4) && (Field5 == Dest_field5) && (TimeStamp != Dest_TimeStamp) which to me is just simply logical. The problem I'm having is that I get a insert Primary key violation... for which I'm not too sure how to troubleshoot since I only know the basics. What do you think I'm doing wrong? December 21, 2011 10:19 AM
	andyleonard said: Hi John, You want to use the Lookup Transformation to manage mapping your primary key fields. Remember, the Columns tab of the Lookup is akin to setting the JOIN ON clause. If you were joining this table in T-SQL, you would include all five fields. Hope this helps, Andy December 21, 2011 10:39 AM
	John said: Hi again Andy, Thank you for the pointer... It actually helped me understand and fix my problem. I had all columns chosen for the join in the Lookup transformation. By just joining the keys and it helped me understand the conditional split conditions. All I need now is to learn more about debugging and error handling which is still a big blur for me... Thanks again! Great blog! :-) December 21, 2011 12:02 PM
	Kingdom said: Thank you very much AndyLeonard, this is very helpful. Many thanks February 1, 2012 9:15 AM
	Samit Shah said: Hi Andy, Could you please advise how to do incremental load for 300+ tables, as it would be difficult to create dataflow for all tables and I have the requirement to use SSIS along with logging of how many records are inserted/updated. Could please help me out on the approach to be taken through SSIS. Thanks and Regards, Samit Shah February 3, 2012 4:19 PM
	andyleonard said: Hi Sanity, A couple options come to mind. 1. Use. Net to generate and save the packages. 2. Use BIML ( http://agilebi.com/blog/tag/biml/) Hope this helps, Andy February 3, 2012 6:43 PM
	Samit Shah said: Hi Andy Thanks for the help. I was able to create the packages using BIMLScript. Thanks and Regards, Samit Shah February 9, 2012 6:11 PM
	Hui Shi said: hi andy , great article. However, I have two questions 1. When you detect new rows, you used left out join the source table to the target table and find all records where columns from right tables are null. That's fine but what about if source and target tables contains billion records? Is that applicable? I know the other alternative might be extracting the max(modifiled_date) from the target table and get data from source where date is greater than max(modified_date), what about if there are no such audit columns on the source and target tables? Thanks Hui March 19, 2012 2:24 PM
	Peter Schott said: Hui, In those cases, you are probably better off using some sort of max(Created/Modified Date) - with an index! If you're replicating that table, it could potentially be added as a computed column on the replicated side with the index created there. That would let you know what records are new/changed since the last run. Pull those values into some form of staging table and compare against that - it will likely perform better that way than doing a direct join. March 19, 2012 3:10 PM
	Hui Shi said: thanks peter. do you know any way to add audit columns as a computed column on the replicated side? In my case, our source table did not contain such audit columns and in order to add that one, seems I need to write a trigger so that a timestamp will be inserted to each record whenever there is a transaction? Do you have any suggestions? March 19, 2012 3:54 PM
	Peter Schott said: Hui, that's where you start running into issues. If you don't have something in your row that's tracking Created/Updated Dates, you'll need to add it and possibly a trigger as well. Created Date is pretty straightforward to default. Updated Date would need an AFTER UPDATE trigger to populate it. I think it could still be done on your replicated side, but in this case, you're better off adding it to your main source database. You'll likely want it at some point even if you don't think you'll need it. If the data is strictly INSERT, you can get the max PK value (or some other unique value) and run against that. You have the option of Service Broker as well, but that takes some work and probably more triggers to manage. If you're on SQL 2008 Enterprise, CDC may be an option. In our case, we have Created/Updated Dates (some NULL) and were able to create a computed column to show COALESCE(Updated, Created, '') as WarehouseLoadDate. We could then index that and find just the rows that were new/changed. From there we processed as appropriate. March 19, 2012 4:58 PM
	jim said: hi andy i have a project where i need to load date each time for only 2 month and every year delete the last month and than reload with the next month. i'm having trouble with that i would be great if you could help me out with some ideas thanks jim July 16, 2012 2:47 AM
	Eliana said: Hi andy Can I replace the you source DB by a Excel file? Thanks Eliana July 16, 2012 9:28 PM
	andyleonard said: Hi Eliana, Yep, but there are a limited number of SSIS data types available from Excel (8, if memory serves) and mapping them to SQL Server data types can be "tricky". :{> July 17, 2012 4:57 PM
	Eliana said: Hi Andy Yepp I got a lot of headaches when I try to format import excel files to DB using SSIS solutions . Do you know some trickies for that? Now I'm trying to testing your solution but using 2 tables (1 source and 1 destination table) in the same DB I'm imported the excel file to mytable_source (tmp) and I made a lookup and split from here to update and insert new records into mytable_dest. But I'm still having issues... Mytable_Source have the same structure like Mytable_Source, even the Increment ID, but I want to use other fields to determine if the records are new or update. But no records are inserted and changed Any idea? July 18, 2012 12:36 AM
	andyleonard said: Hi Eliana, You may want to take a look at another series of articles I wrote about Incremental Loads. It provides more detail and screenshots. You can find the series at http://www.sqlservercentral.com/stairway/72494/. Hope this helps, Andy July 18, 2012 12:55 AM
	Eliana said: Hi Andy, I've an issue and I don't know what I'm doing wrong!!! 1. I have a SOURCE table with records and an empty DEST table. 2. I want to use 2 fields for a lookup, OrderDate and OrderNo 3. I choose only these in available lookup columns (join) and I used DES_ in the Output alias 4. In the conditional Split I'm using the following contidions 4.1 New Rows --> (OrderDate != DEST_OrderDate && OrderNo != DEST_OrderNo) 4.2 Change Rows --> (OrderDate == DEST_OrderDate && OrderNo == DEST_OrderNo) but When I ran any new rows has been inserted what I'm doing wrong? Thanks Eliana July 24, 2012 10:05 PM
	andyleonard said: Hi Eliana, You detect new rows by checking for a NULL (using the SSIS IsNull function) on any column returned from the Destination in the Lookup transformation. You use the expression currently in your New Rows detection to detect Changed Rows. Hope this helps, :{> July 25, 2012 10:08 AM
	Eliana said: Thanks for the help, it's working now Regards, Eliana July 25, 2012 7:34 PM
	andyleonard said: Good job, Eliana! :{> July 25, 2012 8:07 PM
	Eliana said: Hi Andy, I hope you can help me with this new issue Now I have a solution perfectly running separately but. If I want to run all together that is stuck in the validation phase of my second task. My solution have 3 stages 1. Import excel file to my source table (53647 rows) 2. Insert/update my dest table, from source table (using lookup and conditional split) 3. truncate source table. First stage is ok Second is stuck in a SSIS.Pipeline: Validation phase is beginning. showing yellow color in the Data flow but doing nothing. I setup Delay Validation as a True in each DataFlow but this is ignore I guess. If I ran each Dataflow separatelly that ran oh and faster What I have to do to improve it? Thanks Eliana July 25, 2012 10:42 PM
	Sarika said: Hello Andy, Your article has been very helpful. Thanks, Sarika July 30, 2012 3:26 PM
	Yogesh M said: Great Artical...Thanks sir October 19, 2012 3:34 AM
	Ron said: hi Andy, You said: You can limit the number of rows used by the Lookup by using a SQL query as the Lookup source instead of the entire table. Is it possible to dynamically pass a value from the destination table to a where clause in such a query? So I can, for example, fine max(ID) in the destination, and only get rows from the source that are greater than that (meaning only new rows)? thanks February 11, 2013 8:07 PM
	andyleonard said: Hi Ron, In SSIS 2008, yes you can. The Lookup Query is exposed in the Expressions for the Data Flow containing the lookup transformation. Hope this helps, :{> February 22, 2013 1:09 PM
	srinivas said: Hi friends, in increment loading i have small doubt.please clarify me. here we are useing oledb command for updateding records. without useing oledb command is it possible or not to update records. that time is it possible to use execute sql how to we use in control flow level. i follow same query to write in execute sql task. and mapping paramers variable.i write query in execute sql task like update table set name=? , sal=? where id=? and i map parmerts mapping. but its no updated any record. i want achive same result useing execute sql task.plese tell me wha steps i need follw. March 24, 2013 10:27 AM
	rohit said: thanks i does not clear tooo much can you provide the video that would help me lot. thanks thanks thanks in advance with regards rohit [email protected] March 25, 2013 5:50 AM
	Daniel Macho said: I recommend you take a look to the Dimension Merge SCD component and the video set from youtube (specially the 6th one) http://dimensionmergescd.codeplex.com April 3, 2013 10:45 AM
	Akshay said: New to SSIS ? Andy Leonard is the name to remember. Article of July 2007 still getting appreciated & lauded , Updated in sqlservercentral to date. Thank you very much ANDY for sharing your knowledge. April 13, 2013 2:27 PM
	Denis Goch said: Andy, Thank you very much for this contribution. I searched a lot for something like this and none of the things I found, worked like yours. I was trying to import a XML file into a SQL Server 2008 and your example worked just perfectly. Thanks again. April 15, 2013 9:38 PM
	Austine said: Thank you so much, am a baby in this hence my question, can I use same to do my incremental load on my fact table? what if I don't want to use the conditional split but just want to insert , update in the destination? April 17, 2013 4:20 PM
	Pratik said: Hi andy . How can I go for Update and Delete with this SSIS package. thanks. May 30, 2013 1:03 PM
	Saint said: Thanks Andy, your article gave me a head on on the job. July 1, 2013 2:58 AM

你可能感兴趣的:(SSI)

2024年hw蓝队初级面试总结_weblogic反序列化流量特征(1) 2401_84254530 面试职场和发展
apache：解析漏洞1.php%0a换行解析1.php.xxx未知后缀解析SSI远程命令执行路径穿越nginx：解析漏洞1.jpg/.php目录穿越（nginx配置别名的时候alias的时候，忘记加/将造成目录穿越漏洞）CRLF注入(Nginx会将$uri进行解码，导致传入%0a%0d即可引入换行符，造成CRLF注入漏洞)jboss:反序列化war后门部署weblogic：反序列化ssrf任意文
android+打开pdf文件,Android下载并打开pdf文件 weixin_39875167 android+打开pdf文件
System.out.println("我点击了按钮");StringurlString="http://public.dhe.ibm.com/common/ssi/ecm/en/wsd14109usen/WSD14109USEN.PDF";try{URLurl=newURL(urlString);HttpURLConnectionconnection=(HttpURLConnection)url
Wi-Fi基础术语 madmanazo Hardware
AP:accesspoint，即无线接入点，是一个无线网络的创建者，是网络的中心节点；无线路由器就是一个APSTA:station，指每一个连接到无线网络中的终端设备都可以称为一个站点IBSS：IndependentBasicServiceSet，独立基本服务集，IBSS是一种无线拓扑结构,IEEE802.11标准的模式·IBSS模式，又称作独立广播卫星服务，也称为特设模式，是专为点对点连接SSI
OpenAI 前首席科学家 Ilya Sutskever 成立新人工智能公司喜好儿网人工智能 OpenAI SSI
OpenAI联合创始人之一IlyaSutskever在正式离开OpenAI一个月后，成立了一家新公司SafeSuperintelligenceInc.(SSI)。Sutskever是OpenAI的长期首席科学家，他与前YCombinator合伙人DanielGross以及前OpenAI工程师DanielLevy共同创立了SSI。在OpenAI，Sutskever是公司随着“超级智能”AI系统的兴起
USER_CROSSING_SLR cckkppll fpga开发
在堆叠硅互连（SSI）设备上放置设计元素时，您可以使用要管理的USER_SLR_ASSIGNMENT、USER_CROSSING_SLR和USER_SLL_REG属性逻辑分区和Vivado放置工具的行为。SSI设备由以下部分组成多个超逻辑区域（SLR），由称为超长线的插入器连接连接连接起来（SLL）。有关在单反相机中放置和布线的更多信息，请参阅中的此链接Vivado设计套件的超快设计方法指南（UG
SSI测试-HTML页面值-插入数据库表陶宁
1、页面的form写一个action2、在loginAction类中加入register方法publicStringregister(){//通过userisexist脚本查询username的值在数据库表中是否存在如果不存在则把页面上的Hidvalue值设置为1Listuser_list=bs.queryForList("userisexist",username);if(user_list!=
ICASSP 2024｜字节跳动丢包补偿（冠军）与音质修复（亚军）国际挑战赛解决方案希尔贝壳AISHELL 智能语音人工智能语音识别网络
在本届ICASSP2024各类音频国际挑战赛中，字节跳动流媒体音频团队联合西北工业大学音频语音与语言处理研究实验室，在丢包补偿（PacketLossConcealment，PLC）与音质修复（SpeechSignalImprovement，SSI）两个挑战赛道中，多项指标上表现优秀，分别取得第一和第二的优异成绩，达到国际领先水平。ICASSP峰会上的音频挑战赛由国际音频顶级会议ICASSP和微软联
java EE 学习指南 Dazer007 javaweb
参考java成神之路javaweb学习指南SSI框架：struts+spring+ibatis，其中的ibatis已经更名为mybits,由apache-->GooglecodeSSH框架:struts+spring+hibernateSSM框架：Spring+SpringMVC+MyBatisspringBoot+swaggerspringboot：简化了配置，约定大约配置？用什么配置？类方法入
Apache SSI 远程命令执行漏洞爱喝水的泡泡 apache
ApacheSSI远程命令执行漏洞1.cd到ssi-rcecd/opt/vulhub/httpd/ssi-rce/2.执行docker-composeup-ddocker-composeup-d3.查看靶场是否开启成功dookerps拉取成功了4.访问url这里已经执行成功了，注意这里需要加入/upload.php5.写入一句话木马'>/var/www/html/this.php"-->6.提交查
读论文——专用处理器比较分析零尘haha 其他
论文引用：鄢贵海,卢文岩,李晓维,等.专用处理器比较分析.中国科学:信息科学,2022,52:358–375,doi:10.1360/SSI-2021-0274YanGH,LuWY,LiXW,etal.Comparativestudyofthedomain-specificprocessors(inChinese).SciSinInform,2022,52:358–375,doi:10.1360/
封装技术的驱动力 hhjc 高级电子封装电子封装结构设计电子封装
从历史角度来说，封装一直是一块IC价格不可忽略的部分，因此降低封装成本并保持可靠性和性能是许多年来封装工程师的研究焦点。在这段时间过程中，IC技术从20世纪60年代的小规模集成(SSI)过渡到当前的次微最小尺度极大规模集成(VLSI)。由于在过去30年中，封装技术没有跟进IC的性能进展，电子系统性能已经逐渐受限于IC封装。因此，如今封装工程师面临的设计决策正日益由一个市场的系统性能所驱使，这个市场
Apache SSI 远程命令执行漏洞黄公子学安全 apache
一、环境搭建二、访问upload.php三、写shell四、访问如图所示，即getshell成功！
服务器解析漏洞是什么？攻击检测及修复白帽黑客鹏哥服务器运维 web安全网络安全 SQL php java
服务器解析漏洞（Server-sideIncludeVulnerability，SSI漏洞）是一种安全漏洞，通常出现在支持服务器端包含（SSI）功能的Web服务器上。SSI是一种在Web页面中嵌入动态内容的技术，允许开发人员将外部内容（如文件、脚本）包含到HTML页面中。服务器解析漏洞通常是由于不安全的配置或程序漏洞导致的。解决这类问题需要采取一系列措施来提高服务器的安全性。小黑客们，记得多学习哦
zync spi flash 频率配置 proware 驱动之源 spi
spiflash的频率配置代码流程及最终的频率值。驱动目录基于4.14.55内核，\drivers\spi\spi-dw-fmsh.c(控制器)\drivers\spi\spi-dw.c\drivers\mtd\devices\m25p80.c（设备）\drivers\spi\spi.cspidts配置说明spi0:spi@e0001000{compatible="fmsh,dw-apb-ssi"
FAULHABER 运动控制器及运动控制系统 Embeded_FPGA Faulhaber Motor Motion Control CAN PWM RS232
冯哈勃第三代运动控制器PMW输出四象限控制带RS232CANopen或者EtherCAT通讯基本功能：驱动无刷，直流和直线电机；8个数字输入，3个数字输出，2个模拟输入，可灵活设置；支持的传感器系统：1).绝对值编码（AES或者SSI）;2).增量式编码器（光电或磁电式）；3).活儿传感器（数字或模拟）；4).测速发电机；通过现场总线，正交信号，脉冲和方向或模拟输入端确定目标参数设定值；使用模拟霍
vue通过span-method合并列之后，合并列显示在中间位置，根据鼠标滑动跟随展示 step by step. 前端VUE vue elementui
当vue通过span-method合并列之后，出现的合并列显示在中间位置，但是如果页面没有分页，如何进行展示呢，难道要滑到最下面去看吗，下面我们来根据鼠标滑动跟随展示没有处理的合并页面处理之后的合并页面{{scope.row.id}}理论上是给td层加height:1px,给cell加visible之类，给内部span加position：stickycss样式如下.ssi-col{height:1
NSSCTF第14页（2）呕... android
[UUCTF2022新生赛]ezpop提示说看看反序列化字符串逃逸PHP反序列化字符串逃逸_php反序列化逃逸-CSDN博客php反序列化字符逃逸_php反序列化逃逸_Leekos的博客-CSDN博客buuctf刷题9(反序列化逃逸&shtml-SSI远程命令执行&idna与utf-8编码漏洞)_extract($_post);foreach($_postas$var){if(preg__葫芦娃4
大语言模型调研汇总热爱文学的码农语言模型人工智能深度学习
LaMDALaMDA是谷歌在2021年开发者大会上公布的专用于对话的大语言模型，具有137B个参数。论文中提出三个指导模型更好训练的指标：质量/Quality（合理性/Sensibleness、特异性/Specificity、趣味性/Interestingness，SSI）、安全性/Safety、真实性/Groundedness。和其他大模型一样，LaMDA分为预训练和微调两步，在微调阶段，生成式
西南科技大学数字电子技术实验二（SSI逻辑器件设计组合逻辑电路及FPGA实现）预习报告 Myon⁶ 数电实验 fpga开发数电实验数字电子技术西南科技大学
一、计算/设计过程说明：本实验是验证性实验，计算预测验证结果。是设计性实验一定要从系统指标计算出元件参数过程，越详细越好。用公式输入法完成相关公式内容，不得贴手写图片。（注意：从抽象公式直接得出结果，不得分，页数可根据内容调整）1、1位半加器真值表：逻辑方程：S=`AB+A`BC=AB2、1位全加器真值表：AiBiCi-1SiCi000000011101011011000101001001011
西南科技大学数字电子技术实验二（SSI逻辑器件设计组合逻辑电路及FPGA实现）FPGA部分 Myon⁶ 模电实验 fpga开发数字电子技术数电实验西南科技大学 diamond
一、实验目的1、掌握用SSI（小规模集成电路）逻辑器件设计组合电路的方法。2、掌握组合逻辑电路的调试方法。3、学会分析和解决实验中遇到的问题。4、学会用FPGA实现本实验内容。二、实验原理包括：原理图绘制和实验原理简述1、1位半加器2、1位全加器3、三变量多数表决器4、1位二进制数比较器当A>B,L1=1,L2=L3=0;当A
理解微服务SpringCloud 戎·码一生微服务 spring boot spring 5 spring cloud 微服务网关
目录1、服务架构的发展1.1、单体SSH架构1.2、SSI和SSM1.3、业务的垂直化拆分1.4、SOA服务化的改造1.5、微服务2、SpringCloud生态3、SpringCloud的实现3.1、版本管理4、分布式工具组件简介4.1、网关4.2、服务注册与发现4.3、负载均衡4.4、降级限流4.5、配置中心4.6、消息通信5、SeviceMesh6、小结前言微服务是互联网发展的必经之路，它是安
nginx 使用记录慢慢羊的快发明
修改配置文件nginx.conf1.添加域名配置在文件夹C:\Windows\System32\drivers\etc\host文件里面，添加余名配置127.0.0.1www.test.com2.修改nginx.conf配置server{listen80;server_namewww.xuecheng.com;ssion;ssi_silent_errorson;location/{aliasF:/
华东理工大学计算机学院报告,作业答案-华东理工大学计算机科学与工程系.docx... Ada-苏婉妤华东理工大学计算机学院报告
作业(1)习题2.6(P75)根据2.3.1节有关SP2的介绍，试回答：SP设计者为了赶上市场作了什么决策？SP设计者为了达到系统通用采用了什么相应的技术？SP系统是如何支持4种SSI的：单一进入点、单一文件层次、单一控制点和单一作业管理系统？SP设计者为了增加带宽，在通信子系统中主要使用了什么技术？答案：IBMSp2系统主要包含一下的一些特性：为了赶上市场，遵循Moore定律，采用灵活的机群结构
BSIMM(构建安全成熟度模型 version 10 ) 模型介绍煜铭2011 安全建设安全合规 BSIMM10 BSIMM BSIMM中文版 BSIMM介绍
0x00背景构建安全成熟度模型(BSIMM)是一种数据驱动的模型，采用一套面对面访谈技术开展BSIMM评估，唯一目标就是观察和报告。企业通过参与BSIMM的评估，不仅可以更加具体的了解自身SSI的执行情况，还可以从行业视角明确所处的具体位置。BISMM模型，是一把衡量企业在软件开发阶段构建软件安全能力的标尺。BSIMM软件安全框架（SSF）包含四个领域—治理、情报、SSDL触点和部署。反过来，这四
nginx动态文件服务器,Nginx服务器配置支持ssi包含文件及动态包含 weixin_39969232 nginx动态文件服务器
SSI(ServerSideIncludes)可以实现多页面静态内容的局部整体更新，Nginx的HttpSsiModule模块提供SSI支持，普通应用主要有三个参数：ssissi_silent_errorsssi_types这三个参数均可以放在http,server或location的作用域里面。ssion开启ssi支持，默认是offssi_silent_errorson默认值是off，开启后当被
nginx采坑记录之ssi模块故国春城2023 系统运维系列 ssi nginx #include
问题描述：浏览器加载页面时，读取不到服务器js静态资源，造成页面内容加载失败。首先讲下排查思路。首先浏览器处理静态资源一定是从资源服务器加载,经过代理层，返回给浏览器，浏览器进行解析，并递归去加载页面中的其他的静态资源文件，直到需要加载的资源加载完毕。其次分析为什么加载不上，三种可能：1、服务器不具备该资源2、服务器具备资源但客户端访问地址错误3、服务器具备资源，访问地址正确，但代理没有代理静态资
nginx的handler模块唏噗笔记 nginx
先介绍一下nginx模块的概念。模块nginx将各功能模块组织成一条链，当有请求到达的时候，请求依次经过这条链上的部分或者全部模块进行处理。每个模块实现特定的功能，例如，实现对请求解压缩的模块，实现SSI的模块，实现与上游服务器进行通讯的模块，实现与FastCGI服务进行通讯的模块。有两个模块比较特殊，他们居于nginxcore和各功能模块的中间。这两个模块就是http模块和mail模块。这2个模
【Nginx36】Nginx学习：SSI静态文件服务器端包含模块码农老张Zy nginx 学习运维
Nginx学习：SSI静态文件服务器端包含模块这个模块让我想到了2009年刚刚工作的时候。最早我是做.NET的，而第一家公司其实是从ASP向ASP.NET转型中，因此，还是有不少的ASP做的页面。在那个时候，就用到了SSI。这么一说，大家估计也猜到了，这个功能其实是很早的技术了。现在的年轻大佬们可能很多都不知道这个功能。它可以让静态文件，也就是HTML文件实现一些简单的文件包含、定义变量、条件判断
shtml与html的区别 qiuhaifeng_csu 其他 xhtml
shtml也是网页文件，有shtml和shtm两个扩展名。但它是服务器动态产生的静态网页，需要服务器自带SSI处理出来静态网页，通常称为“伪静态”网页。shtml和html的格式差不多，shtml主要可以使用include，利用include将其它页面嵌入到这个页面中，并可以动态显示，再经过服务器生成静态页面，而静态的html页面则不行。不是所有IIS服务器都能解析shtml文件，解析shtml文
学成在线(一) 扶苏l 学成在线黑马
文章目录1.项目原型2.项目技术架构2.1技术架构2.2技术栈2.3开发步骤3.CMS需求分析3.1什么是CMS3.2静态门户工程搭建3.2.1导入门户工程3.2.2配置虚拟主机3.3SSI服务端包含技术3.4CMS页面管理需求4.CMS服务端工程搭建4.1开发工具配置4.2导入基础工程4.2.1工程结构4.2.2导入父工程4.2.3导入其它工程4.3MongoDB入门4.3.1安装MongoDB
集合框架天子之骄 java 数据结构集合框架
集合框架集合框架可以理解为一个容器，该容器主要指映射(map)、集合(set)、数组(array)和列表(list)等抽象数据结构。从本质上来说，Java集合框架的主要组成是用来操作对象的接口。不同接口描述不同的数据类型。简单介绍： Collection接口是最基本的接口，它定义了List和Set，List又定义了LinkLi
Table Driven（表驱动）方法实例 bijian1013 java enum Table Driven 表驱动
实例一： /** * 驾驶人年龄段 * 保险行业，会对驾驶人的年龄做年龄段的区分判断 * 驾驶人年龄段：01-[18,25);02-[25,30);03-[30-35);04-[35,40);05-[40,45);06-[45,50);07-[50-55);08-[55,+∞) */ public class AgePeriodTest { //if...el
Jquery 总结 cuishikuan java jquery Ajax Web jquery方法
1.$.trim方法用于移除字符串头部和尾部多余的空格。如：$.trim(' Hello ') // Hello2.$.contains方法返回一个布尔值，表示某个DOM元素（第二个参数）是否为另一个DOM元素（第一个参数）的下级元素。如：$.contains(document.documentElement, document.body); 3.$
面向对象概念的提出麦田的设计者 java 面向对象面向过程
面向对象中，一切都是由对象展开的，组织代码，封装数据。在台湾面向对象被翻译为了面向物件编程，这充分说明了，这种编程强调实体。下面就结合编程语言的发展史，聊一聊面向过程和面向对象。 c语言由贝尔实
linux网口绑定被触发 linux
刚在一台IBM Xserver服务器上装了RedHat Linux Enterprise AS 4，为了提高网络的可靠性配置双网卡绑定。一、环境描述我的RedHat Linux Enterprise AS 4安装双口的Intel千兆网卡，通过ifconfig -a命令看到eth0和eth1两张网卡。二、双网卡绑定步骤： 2.1 修改/etc/sysconfig/network
XML基础语法肆无忌惮_ xml
一、什么是XML？ XML全称是Extensible Markup Language，可扩展标记语言。很类似HTML。XML的目的是传输数据而非显示数据。XML的标签没有被预定义，你需要自行定义标签。XML被设计为具有自我描述性。是W3C的推荐标准。二、为什么学习XML？用来解决程序间数据传输的格式问题做配置文件充当小型数据库三、XML与HTM
为网页添加自己喜欢的字体知了ing 字体秒表 css
@font-face { font-family: miaobiao;//定义字体名字 font-style: normal; font-weight: 400; src: url('font/DS-DIGI-e.eot');//字体文件 } 使用： <label style="font-size:18px;font-famil
redis范围查询应用-查找IP所在城市矮蛋蛋 redis
原文地址： http://www.tuicool.com/articles/BrURbqV 需求根据IP找到对应的城市原来的解决方案 oracle表（ip_country）：查询IP对应的城市： 1.把a.b.c.d这样格式的IP转为一个数字，例如为把210.21.224.34转为3524648994 2. select city from ip_
输入两个整数，计算百分比 alleni123 java
public static String getPercent(int x, int total){ double result=(x*1.0)/(total*1.0); System.out.println(result); DecimalFormat df1=new DecimalFormat("0.0000%");
百合——————>怎么学习计算机语言百合不是茶 java 移动开发
对于一个从没有接触过计算机语言的人来说，一上来就学面向对象，就算是心里上面接受的了，灵魂我觉得也应该是跟不上的，学不好是很正常的现象，计算机语言老师讲的再多，你在课堂上面跟着老师听的再多，我觉得你应该还是学不会的，最主要的原因是你根本没有想过该怎么来学习计算机编程语言，记得大一的时候金山网络公司在湖大招聘我们学校一个才来大学几天的被金山网络录取，一个刚到大学的就能够去和
linux下tomcat开机自启动 bijian1013 tomcat
方法一：修改Tomcat/bin/startup.sh 为: export JAVA_HOME=/home/java1.6.0_27 export CLASSPATH=$CLASSPATH:$JAVA_HOME/lib/tools.jar:$JAVA_HOME/lib/dt.jar:. export PATH=$JAVA_HOME/bin:$PATH export CATALINA_H
spring aop实例 bijian1013 java spring AOP
1.AdviceMethods.java package com.bijian.study.spring.aop.schema; public class AdviceMethods { public void preGreeting() { System.out.println("--how are you!--"); } } 2.beans.x
[Gson八]GsonBuilder序列化和反序列化选项enableComplexMapKeySerialization bit1129 serialization
enableComplexMapKeySerialization配置项的含义 Gson在序列化Map时，默认情况下，是调用Key的toString方法得到它的JSON字符串的Key，对于简单类型和字符串类型，这没有问题，但是对于复杂数据对象，如果对象没有覆写toString方法，那么默认的toString方法将得到这个对象的Hash地址。 GsonBuilder用于
【Spark九十一】Spark Streaming整合Kafka一些值得关注的问题 bit1129 Stream
包括Spark Streaming在内的实时计算数据可靠性指的是三种级别： 1. At most once，数据最多只能接受一次，有可能接收不到 2. At least once, 数据至少接受一次，有可能重复接收 3. Exactly once 数据保证被处理并且只被处理一次，具体的多读几遍http://spark.apache.org/docs/lates
shell脚本批量检测端口是否被占用脚本 ronin47
#!/bin/bash cat ports |while read line do#nc -z -w 10 $line nc -z -w 2 $line 58422>/dev/null2>&1if[ $?-eq 0]then echo $line:ok else echo $line:fail fi done 这里的ports 既可以是文件
java-2.设计包含min函数的栈 bylijinnan java
具体思路参见：http://zhedahht.blog.163.com/blog/static/25411174200712895228171/ import java.util.ArrayList; import java.util.List; public class MinStack { //maybe we can use origin array rathe
Netty源码学习-ChannelHandler bylijinnan java netty
一般来说，“有状态”的ChannelHandler不应该是“共享”的，“无状态”的ChannelHandler则可“共享” 例如ObjectEncoder是“共享”的, 但 ObjectDecoder 不是因为每一次调用decode方法时，可能数据未接收完全（incomplete），它与上一次decode时接收到的数据“累计”起来才有可能是完整的数据，是“有状态”的 p
java生成随机数 cngolon java
方法一： /** * 生成随机数 * @author [email protected] * @return */ public synchronized static String getChargeSequenceNum(String pre){ StringBuffer sequenceNum = new StringBuffer(); Date dateTime = new D
POI读写海量数据 ctrain 海量数据
import java.io.FileOutputStream; import java.io.OutputStream; import org.apache.poi.xssf.streaming.SXSSFRow; import org.apache.poi.xssf.streaming.SXSSFSheet; import org.apache.poi.xssf.streaming
mysql 日期格式化date_format详细使用 daizj mysql date_format 日期格式转换日期格式化
日期转换函数的详细使用说明 DATE_FORMAT(date,format) Formats the date value according to the format string. The following specifiers may be used in the format string. The&n
一个程序员分享8年的开发经验 dcj3sjt126com 程序员
在中国有很多人都认为IT行为是吃青春饭的，如果过了30岁就很难有机会再发展下去!其实现实并不是这样子的，在下从事.NET及JAVA方面的开发的也有8年的时间了，在这里在下想凭借自己的亲身经历，与大家一起探讨一下。明确入行的目的很多人干IT这一行都冲着“收入高”这一点的，因为只要学会一点HTML, DIV+CSS，要做一个页面开发人员并不是一件难事，而且做一个页面开发人员更容
android欢迎界面淡入淡出效果 dcj3sjt126com android
很多Android应用一开始都会有一个欢迎界面，淡入淡出效果也是用得非常多的，下面来实现一下。主要代码如下： package com.myaibang.activity; import android.app.Activity;import android.content.Intent;import android.os.Bundle;import android.os.CountDown
linux 复习笔记之常见压缩命令 eksliang tar解压 linux系统常见压缩命令 linux压缩命令 tar压缩
转载请出自出处:http://eksliang.iteye.com/blog/2109693 linux中常见压缩文件的拓展名 *.gz gzip程序压缩的文件 *.bz2 bzip程序压缩的文件 *.tar tar程序打包的数据，没有经过压缩 *.tar.gz tar程序打包后，并经过gzip程序压缩 *.tar.bz2 tar程序打包后，并经过bzip程序压缩 *.zi
Android 应用程序发送shell命令 gqdy365 android
项目中需要直接在APP中通过发送shell指令来控制lcd灯，其实按理说应该是方案公司在调好lcd灯驱动之后直接通过service送接口上来给APP，APP调用就可以控制了，这是正规流程，但我们项目的方案商用的mtk方案，方案公司又没人会改，只调好了驱动，让应用程序自己实现灯的控制，这不蛋疼嘛！！！！发就发吧！一、关于shell指令：我们知道，shell指令是Linux里面带的
java 无损读取文本文件 hw1287789687 读取文件无损读取读取文本文件 charset
java 如何无损读取文本文件呢？以下是有损的 @Deprecated public static String getFullContent(File file, String charset) { BufferedReader reader = null; if (!file.exists()) { System.out.println("getFull
Firebase 相关文章索引 justjavac firebase
Awesome Firebase 最近谷歌收购Firebase的新闻又将Firebase拉入了人们的视野，于是我做了这个 github 项目。 Firebase 是一个数据同步的云服务，不同于 Dropbox 的「文件」，Firebase 同步的是「数据」，服务对象是网站开发者，帮助他们开发具有「实时」（Real-Time）特性的应用。开发者只需引用一个 API 库文件就可以使用标准 RE
C++学习重点 lx.asymmetric C++笔记
1.c++面向对象的三个特性：封装性，继承性以及多态性。 2.标识符的命名规则：由字母和下划线开头，同时由字母、数字或下划线组成；不能与系统关键字重名。 3.c++语言常量包括整型常量、浮点型常量、布尔常量、字符型常量和字符串性常量。 4.运算符按其功能开以分为六类：算术运算符、位运算符、关系运算符、逻辑运算符、赋值运算符和条件运算符。 &n
java bean和xml相互转换 q821424508 java bean xml xml和bean转换 java bean和xml转换
这几天在做微信公众号做的过程中想找个java bean转xml的工具，找了几个用着不知道是配置不好还是怎么回事，都会有一些问题，然后脑子一热谢了一个javabean和xml的转换的工具里，自己用着还行，虽然有一些约束吧，还是贴出来记录一下顺便你提一下下，这个转换工具支持属性为集合、数组和非基本属性的对象。 packag
C 语言初级位运算 1140566087 位运算 c
第十章位运算 1、位运算对象只能是整形或字符型数据，在VC6.0中int型数据占4个字节 2、位运算符：运算符作用 ~ 按位求反 << 左移 >> 右移 & 按位与 ^ 按位异或 | 按位或他们的优先级从高到低； 3、位运算符的运算功能： a、按位取反： ~01001101 = 101
14点睛Spring4.1-脚本编程 wiselyman spring4
14.1 Scripting脚本编程脚本语言和java这类静态的语言的主要区别是:脚本语言无需编译,源码直接可运行; 如果我们经常需要修改的某些代码,每一次我们至少要进行编译,打包,重新部署的操作,步骤相当麻烦; 如果我们的应用不允许重启,这在现实的情况中也是很常见的; 在spring中使用脚本编程给上述的应用场景提供了解决方案,即动态加载bean; spring支持脚本

SSIS 增量更新

Andy Leonard

SSIS Design Pattern - Incremental Loads

Comment Notification

Comments

Alberto Ferrari said:

andyleonard said:

David R Buckingham said:

Bill Mo said:

david boston said:

andyleonard said:

saul said:

Steve Hall said:

andyleonard said:

Bill Mo said:

Bobby said:

Andy Leonard said:

Michael Ross said:

Michael said:

andyleonard said:

Michael said:

Andy Leonard said:

Jigs said:

andyleonard said:

Jai said:

Kenneth said:

andyleonard said:

EAD said:

LNelson said:

FDA said:

Rajesh said:

Ken ([email protected]) said:

andyleonard said:

RVS said:

Charlie Asbornsen said:

Charlie Asbornsen said:

andyleonard said:

Charlie Asbornsen said:

Charlie Asbornsen said:

Charlie Asbornsen said:

Charles Asbornsen said:

andyleonard said:

Charles Asbornsen said:

RVS said:

Charlie Asbornsen said:

Charlie Asbornsen said:

vidhya said:

Nostromo said:

DVL said:

CSu said:

hasan said:

Mike said:

Mandar said:

Ramdas said:

KK said:

Craig said:

andyleonard said:

jpedroalmeida said:

JohnnyReaction said:

Paul Klotka said:

Chhavi said:

AP said:

TheAviator said:

V said:

Dpostman said:

dbraver said:

Peter Schott said:

Samit Shah said:

andyleonard said:

Kal said:

Thiru-BI said:

Michael Baumanns said:

Romualdo said:

Lavanya said:

J Channin said:

D Sharma said:

andyleonard said:

D Sharma said:

andyleonard said:

dilip said: