Module | Sequel::Model::Associations::DatasetMethods |
In: |
lib/sequel/model/associations.rb
|
Eager loading makes it so that you can load all associated records for a set of objects in a single query, instead of a separate query for each object.
Two separate implementations are provided. eager should be used most of the time, as it loads associated records using one query per association. However, it does not allow you the ability to filter or order based on columns in associated tables. eager_graph loads all records in a single query using JOINs, allowing you to filter or order based on columns in associated tables. However, eager_graph is usually slower than eager, especially if multiple one_to_many or many_to_many associations are joined.
You can cascade the eager loading (loading associations on associated objects) with no limit to the depth of the cascades. You do this by passing a hash to eager or eager_graph with the keys being associations of the current model and values being associations of the model associated with the current model via the key.
The arguments can be symbols or hashes with symbol keys (for cascaded eager loading). Examples:
Album.eager(:artist).all Album.eager_graph(:artist).all Album.eager(:artist, :genre).all Album.eager_graph(:artist, :genre).all Album.eager(:artist).eager(:genre).all Album.eager_graph(:artist).eager(:genre).all Artist.eager(albums: :tracks).all Artist.eager_graph(albums: :tracks).all Artist.eager(albums: {tracks: :genre}).all Artist.eager_graph(albums: {tracks: :genre}).all
You can also pass a callback as a hash value in order to customize the dataset being eager loaded at query time, analogous to the way the :eager_block association option allows you to customize it at association definition time. For example, if you wanted artists with their albums since 1990:
Artist.eager(albums: proc{|ds| ds.where{year > 1990}})
Or if you needed albums and their artist‘s name only, using a single query:
Albums.eager_graph(artist: proc{|ds| ds.select(:name)})
To cascade eager loading while using a callback, you substitute the cascaded associations with a single entry hash that has the proc callback as the key and the cascaded associations as the value. This will load artists with their albums since 1990, and also the tracks on those albums and the genre for those tracks:
Artist.eager(albums: {proc{|ds| ds.where{year > 1990}}=>{tracks: :genre}})
If the dataset is being eagerly loaded, default to calling all instead of each.
# File lib/sequel/model/associations.rb, line 2830 2830: def as_hash(key_column=nil, value_column=nil, opts=OPTS) 2831: if (@opts[:eager_graph] || @opts[:eager]) && !opts.has_key?(:all) 2832: opts = Hash[opts] 2833: opts[:all] = true 2834: end 2835: super 2836: end
Adds one or more INNER JOINs to the existing dataset using the keys and conditions specified by the given association. The following methods also exist for specifying a different type of JOIN:
association_full_join : | FULL JOIN |
association_inner_join : | INNER JOIN |
association_left_join : | LEFT JOIN |
association_right_join : | RIGHT JOIN |
# File lib/sequel/model/associations.rb, line 2672 2672: def association_join(*associations) 2673: association_inner_join(*associations) 2674: end
If the expression is in the form x = y where y is a Sequel::Model instance, array of Sequel::Model instances, or a Sequel::Model dataset, assume x is an association symbol and look up the association reflection via the dataset‘s model. From there, return the appropriate SQL based on the type of association and the values of the foreign/primary keys of y. For most association types, this is a simple transformation, but for many_to_many associations this creates a subquery to the join table.
# File lib/sequel/model/associations.rb, line 2683 2683: def complex_expression_sql_append(sql, op, args) 2684: r = args[1] 2685: if (((op == '=''=' || op == '!=''!=') and r.is_a?(Sequel::Model)) || 2686: (multiple = ((op == :IN || op == 'NOT IN''NOT IN') and ((is_ds = r.is_a?(Sequel::Dataset)) or r.all?{|x| x.is_a?(Sequel::Model)})))) 2687: l = args[0] 2688: if ar = model.association_reflections[l] 2689: if multiple 2690: klass = ar.associated_class 2691: if is_ds 2692: if r.respond_to?(:model) 2693: unless r.model <= klass 2694: # A dataset for a different model class, could be a valid regular query 2695: return super 2696: end 2697: else 2698: # Not a model dataset, could be a valid regular query 2699: return super 2700: end 2701: else 2702: unless r.all?{|x| x.is_a?(klass)} 2703: raise Sequel::Error, "invalid association class for one object for association #{l.inspect} used in dataset filter for model #{model.inspect}, expected class #{klass.inspect}" 2704: end 2705: end 2706: elsif !r.is_a?(ar.associated_class) 2707: raise Sequel::Error, "invalid association class #{r.class.inspect} for association #{l.inspect} used in dataset filter for model #{model.inspect}, expected class #{ar.associated_class.inspect}" 2708: end 2709: 2710: if exp = association_filter_expression(op, ar, r) 2711: literal_append(sql, exp) 2712: else 2713: raise Sequel::Error, "invalid association type #{ar[:type].inspect} for association #{l.inspect} used in dataset filter for model #{model.inspect}" 2714: end 2715: elsif multiple && (is_ds || r.empty?) 2716: # Not a query designed for this support, could be a valid regular query 2717: super 2718: else 2719: raise Sequel::Error, "invalid association #{l.inspect} used in dataset filter for model #{model.inspect}" 2720: end 2721: else 2722: super 2723: end 2724: end
The preferred eager loading method. Loads all associated records using one query for each association.
The basic idea for how it works is that the dataset is first loaded normally. Then it goes through all associations that have been specified via eager. It loads each of those associations separately, then associates them back to the original dataset via primary/foreign keys. Due to the necessity of all objects being present, you need to use all to use eager loading, as it can‘t work with each.
This implementation avoids the complexity of extracting an object graph out of a single dataset, by building the object graph out of multiple datasets, one for each association. By using a separate dataset for each association, it avoids problems such as aliasing conflicts and creating cartesian product result sets if multiple one_to_many or many_to_many eager associations are requested.
One limitation of using this method is that you cannot filter the current dataset based on values of columns in an associated table, since the associations are loaded in separate queries. To do that you need to load all associations in the same query, and extract an object graph from the results of that query. If you need to filter based on columns in associated tables, look at eager_graph or join the tables you need to filter on manually.
Each association‘s order, if defined, is respected. If the association uses a block or has an :eager_block argument, it is used.
# File lib/sequel/model/associations.rb, line 2751 2751: def eager(*associations) 2752: opts = @opts[:eager] 2753: association_opts = eager_options_for_associations(associations) 2754: opts = opts ? Hash[opts].merge!(association_opts) : association_opts 2755: clone(:eager=>opts.freeze) 2756: end
The secondary eager loading method. Loads all associations in a single query. This method should only be used if you need to filter or order based on columns in associated tables, or if you have done comparative benchmarking it and determined it is faster.
This method uses Dataset#graph to create appropriate aliases for columns in all the tables. Then it uses the graph‘s metadata to build the associations from the single hash, and finally replaces the array of hashes with an array model objects inside all.
Be very careful when using this with multiple one_to_many or many_to_many associations, as you can create large cartesian products. If you must graph multiple one_to_many and many_to_many associations, make sure your filters are narrow if the datasets are large.
Each association‘s order, if defined, is respected. eager_graph probably won‘t work correctly on a limited dataset, unless you are only graphing many_to_one, one_to_one, and one_through_one associations.
Does not use the block defined for the association, since it does a single query for all objects. You can use the :graph_* association options to modify the SQL query.
Like eager, you need to call all on the dataset for the eager loading to work. If you just call each, it will yield plain hashes, each containing all columns from all the tables.
# File lib/sequel/model/associations.rb, line 2779 2779: def eager_graph(*associations) 2780: eager_graph_with_options(associations) 2781: end
Run eager_graph with some options specific to just this call. Unlike eager_graph, this takes the associations as a single argument instead of multiple arguments.
Options:
:join_type : | Override the join type specified in the association | ||||||||||
:limit_strategy : | Use a strategy for handling limits on associations. Appropriate
:limit_strategy values are:
This can also be a hash with association name symbol keys and one of the above values, to use different strategies per association. The default is the :ruby strategy. Choosing a different strategy can make your code significantly slower in some cases (perhaps even the majority of cases), so you should only use this if you have benchmarked that it is faster for your use cases. |
# File lib/sequel/model/associations.rb, line 2803 2803: def eager_graph_with_options(associations, opts=OPTS) 2804: opts = opts.dup unless opts.frozen? 2805: associations = [associations] unless associations.is_a?(Array) 2806: ds = if eg = @opts[:eager_graph] 2807: eg = eg.dup 2808: [:requirements, :reflections, :reciprocals, :limits].each{|k| eg[k] = eg[k].dup} 2809: eg[:local] = opts 2810: ds = clone(:eager_graph=>eg) 2811: ds.eager_graph_associations(ds, model, ds.opts[:eager_graph][:master], [], *associations) 2812: else 2813: # Each of the following have a symbol key for the table alias, with the following values: 2814: # :reciprocals :: the reciprocal value to use for this association 2815: # :reflections :: AssociationReflection instance related to this association 2816: # :requirements :: array of requirements for this association 2817: # :limits :: Any limit/offset array slicing that need to be handled in ruby land after loading 2818: opts = {:requirements=>{}, :master=>alias_symbol(first_source), :reflections=>{}, :reciprocals=>{}, :limits=>{}, :local=>opts, :cartesian_product_number=>0, :row_proc=>row_proc} 2819: ds = clone(:eager_graph=>opts) 2820: ds = ds.eager_graph_associations(ds, model, ds.opts[:eager_graph][:master], [], *associations).naked 2821: end 2822: 2823: ds.opts[:eager_graph].freeze 2824: ds.opts[:eager_graph].each_value{|v| v.freeze if v.is_a?(Hash)} 2825: ds 2826: end
If the dataset is being eagerly loaded, default to calling all instead of each.
# File lib/sequel/model/associations.rb, line 2840 2840: def to_hash_groups(key_column, value_column=nil, opts=OPTS) 2841: if (@opts[:eager_graph] || @opts[:eager]) && !opts.has_key?(:all) 2842: opts = Hash[opts] 2843: opts[:all] = true 2844: end 2845: super 2846: end
Do not attempt to split the result set into associations, just return results as simple objects. This is useful if you want to use eager_graph as a shortcut to have all of the joins and aliasing set up, but want to do something else with the dataset.
# File lib/sequel/model/associations.rb, line 2852 2852: def ungraphed 2853: ds = super.clone(:eager_graph=>nil) 2854: if (eg = @opts[:eager_graph]) && (rp = eg[:row_proc]) 2855: ds = ds.with_row_proc(rp) 2856: end 2857: ds 2858: end
Call graph on the association with the correct arguments, update the eager_graph data structure, and recurse into eager_graph_associations if there are any passed in associations (which would be dependencies of the current association)
Arguments:
ds : | Current dataset |
model : | Current Model |
ta : | table_alias used for the parent association |
requirements : | an array, used as a stack for requirements |
r : | association reflection for the current association, or an SQL::AliasedExpression with the reflection as the expression and the alias base as the aliaz. |
*associations : | any associations dependent on this one |
# File lib/sequel/model/associations.rb, line 2875 2875: def eager_graph_association(ds, model, ta, requirements, r, *associations) 2876: if r.is_a?(SQL::AliasedExpression) 2877: alias_base = r.alias 2878: r = r.expression 2879: else 2880: alias_base = r[:graph_alias_base] 2881: end 2882: assoc_table_alias = ds.unused_table_alias(alias_base) 2883: loader = r[:eager_grapher] 2884: if !associations.empty? 2885: if associations.first.respond_to?(:call) 2886: callback = associations.first 2887: associations = {} 2888: elsif associations.length == 1 && (assocs = associations.first).is_a?(Hash) && assocs.length == 1 && (pr_assoc = assocs.to_a.first) && pr_assoc.first.respond_to?(:call) 2889: callback, assoc = pr_assoc 2890: associations = assoc.is_a?(Array) ? assoc : [assoc] 2891: end 2892: end 2893: local_opts = ds.opts[:eager_graph][:local] 2894: limit_strategy = r.eager_graph_limit_strategy(local_opts[:limit_strategy]) 2895: 2896: if r[:conditions] && !Sequel.condition_specifier?(r[:conditions]) && !r[:orig_opts].has_key?(:graph_conditions) && !r[:orig_opts].has_key?(:graph_only_conditions) && !r.has_key?(:graph_block) 2897: raise Error, "Cannot eager_graph association when :conditions specified and not a hash or an array of pairs. Specify :graph_conditions, :graph_only_conditions, or :graph_block for the association. Model: #{r[:model]}, association: #{r[:name]}" 2898: end 2899: 2900: ds = loader.call(:self=>ds, :table_alias=>assoc_table_alias, :implicit_qualifier=>(ta == ds.opts[:eager_graph][:master]) ? first_source : qualifier_from_alias_symbol(ta, first_source), :callback=>callback, :join_type=>local_opts[:join_type], :join_only=>local_opts[:join_only], :limit_strategy=>limit_strategy, :from_self_alias=>ds.opts[:eager_graph][:master]) 2901: if r[:order_eager_graph] && (order = r.fetch(:graph_order, r[:order])) 2902: ds = ds.order_append(*qualified_expression(order, assoc_table_alias)) 2903: end 2904: eager_graph = ds.opts[:eager_graph] 2905: eager_graph[:requirements][assoc_table_alias] = requirements.dup 2906: eager_graph[:reflections][assoc_table_alias] = r 2907: if limit_strategy == :ruby 2908: eager_graph[:limits][assoc_table_alias] = r.limit_and_offset 2909: end 2910: eager_graph[:cartesian_product_number] += r[:cartesian_product_number] || 2 2911: ds = ds.eager_graph_associations(ds, r.associated_class, assoc_table_alias, requirements + [assoc_table_alias], *associations) unless associations.empty? 2912: ds 2913: end
Check the associations are valid for the given model. Call eager_graph_association on each association.
Arguments:
ds : | Current dataset |
model : | Current Model |
ta : | table_alias used for the parent association |
requirements : | an array, used as a stack for requirements |
*associations : | the associations to add to the graph |
# File lib/sequel/model/associations.rb, line 2924 2924: def eager_graph_associations(ds, model, ta, requirements, *associations) 2925: return ds if associations.empty? 2926: associations.flatten.each do |association| 2927: ds = case association 2928: when Symbol, SQL::AliasedExpression 2929: ds.eager_graph_association(ds, model, ta, requirements, eager_graph_check_association(model, association)) 2930: when Hash 2931: association.each do |assoc, assoc_assocs| 2932: ds = ds.eager_graph_association(ds, model, ta, requirements, eager_graph_check_association(model, assoc), assoc_assocs) 2933: end 2934: ds 2935: else 2936: raise(Sequel::Error, 'Associations must be in the form of a symbol or hash') 2937: end 2938: end 2939: ds 2940: end
Replace the array of plain hashes with an array of model objects will all eager_graphed associations set in the associations cache for each object.
# File lib/sequel/model/associations.rb, line 2944 2944: def eager_graph_build_associations(hashes) 2945: hashes.replace(EagerGraphLoader.new(self).load(hashes)) 2946: end