Fixes#1105 by doronp · Pull Request #1194 · moby/swarmkit

doronp · 2016-07-20T05:48:57Z

service command should work with id prefix and with name as in network command
see:
#1097
#1096

doronp · 2016-07-20T05:49:38Z

ping @aaronlehmann @dongluochen @aluzzardi

codecov-io · 2016-07-20T06:00:54Z

Current coverage is 55.07%

Merging #1194 into master will increase coverage by <.01%

@@             master      #1194   diff @@
==========================================
  Files            77         77          
  Lines         12190      12188     -2   
  Methods           0          0          
  Messages          0          0          
  Branches          0          0          
==========================================
- Hits           6713       6712     -1   
  Misses         4559       4559          
+ Partials        918        917     -1

Powered by Codecov. Last updated by e0ffaf6...a654a6e

stevvooe · 2016-07-20T18:52:15Z

cmd/swarmctl/service/common.go

-				},
-			},
-		)
+		service, err := getServiceByName(ctx, c, input)


It's probably better to lookup by prefix, then let the caller know the name is ambiguous.

@stevvooe See #1097 In Networks we want to have the same behaviour?

Any time you are trying to resolve a set to single fuzzy item, this is the behavior to use. In psuedocode, it looks like this:

results := find(query) switch len(results) { case 0: not found case 1: found default: ambiguous -> error }

This reduces round trips and ensures that you get the right result. If you don't handle the ambiguous case, you end up with a potential security exploit.

stevvooe · 2016-07-20T22:30:26Z

@doronp Thanks for the bugfixes, but do you mind taking the time to give your PRs and commit messages descriptive titles?

Signed-off-by: Doron Podoleanu <[email protected]>

doronp · 2016-07-21T11:13:22Z

@stevvooe Done.

stevvooe · 2016-07-21T21:45:34Z

@doronp Thanks! It makes it a lot easier when we're going through all these issues and commits.

doronp · 2016-07-27T12:33:29Z

Are we good to go with this and PR #1097?

stevvooe · 2016-07-27T19:31:22Z

@doronp No. You didn't change the code based on the feedback.

doronp · 2016-07-28T06:44:38Z

@stevvooe You must mean your comment about the result length. Can you please explain in which case The code will return an arbitrary result instead of stating ambiguous? I re-read it and Each of the private methods (getServiceByPrefixedID; getServiceByName) each returns a result IFF it has one value.

stevvooe · 2016-07-28T19:31:04Z

@doronp Do the prefix lookup first, as it is least likely to fail. The way the code is written now, it incurs an extra request to get services by prefix. If you query prefix and it is unique, it should only take a single request.

There are three cases:

Query by prefix and have exact match that is unique. Return the single match.
Query by prefix and have exact match that is also prefix of another record. Return the record that matches exactly.
Query by prefix and have multiple matches sharing the query as the prefix. This case is ambiguous.

We can have a similar set of cases for query by ID Prefix. I am not sure, but there should be a single index that has both names and ids, queryable by prefix, such that this only takes 1 RPC to do in total.

doronp · 2016-08-03T12:56:16Z

@stevvooe

I think we should try name before short ID: full ID -> name -> short ID.

see
PR #1097 (same issue for networks)

I think @aluzzardi @dongluochen @aaronlehmann and ourselves agree that services and networks should behave the same.

What you and @aluzzardi suggest is contradicting. Can we please have a resolution for both issues?
Personally I think that the order suggested by @aluzzardi Is also the order of events in plain docker when querying for a container so it makes sense to me.

stevvooe · 2016-08-03T22:30:03Z

@doronp Yes, they should all behave the same. No, they are not contradictory. I am proposing a matching algorithm that is correct, safe and efficient. This PR is not.

Both git and docker implement an algorithm similar to this and it needs to be done here, as well. Please do it or we cannot merge this PR.

doronp · 2016-08-04T06:22:01Z

@stevvooe
I suspected we misunderstand each other on some basic terminology.

You wish to have:

Do the prefix lookup first

And @aluzzardi

I think we should try name before short ID: full ID -> name -> short ID.

I may have miss understood but isn't short ID==prefix?

ezrasilvera · 2016-08-10T07:17:49Z

@stevvooe Just to clarify
(1) when you speak about prefix you mean just ID prefix or also name prefix (which is currently not supported in any command AFAIK)
(2) If you check prefix first (and it's just the ID prefix) it means that if the ID prefix of service A == name of service B you would never be able to access service B by name.

stevvooe · 2016-08-10T22:55:33Z

@ezrasilvera @doronp Here is the data model.

First, we have a function, given an object that gives you thinks that can look it up. The current function for most objects will returns a name and id:

fn(Object) -> []string

Examples:

fn(task1) -> {<id1>, <name1>}
fn(task2) {<id2>, <name2>}

We call this function "vectorization". We take the result and apply the entries in a common reverse index-space, with tuples of <Result, Object>, usually in a sorted data structure, such as a red-black tree or suffix/patricia/whatever trie.

So, when we have fn(task1) -> {<id1>, <name1>}, we create two entries in our index-space:

<id1> task1
<name1> task1

When we look these up in a dataset, we have no clue whether they are ids or names and we can't make assumptions about their relationships. We can only assume that IDs are unique. We can look at the entries as a table. Here is one with concrete values:

1 task1
2 task2
3 task3
3 task4 # has name 3
33 task5 # has name 33
4 task4
5 task5
6 task6 
6 task7 # has name 6
7 task7
7 task6 # has name 7
bar task2
foo task1
fooer task3

On the left, we have a lookup key and on the right we have the target object (or likely an identifier). We show this lexicographically and each tuple is unique.

With such a dataset, we can serve queries. The query function has the following signature:

fn(query) -> []Object

From our dataset, here are a few results:

fn("fo") -> {task1, task3}
fn("foo") -> {task1, task3}
fn("bar") -> {task2}
fn("3") -> {task3, task4, task5}
fn("6") -> {task6, task7}
fn("7") -> {task6, task7}

Now, we have our problem laid out correctly, we can handle various cases. Given the matched set based on prefix scan of the index, we choose an item or return an error indicating ambiguity.

We have the following rules:

Unique match for prefix or exact match on name or id succeeds.
Multiple matches without exact match returns ambiguous error.
Exact match under multiple items gets selected, favoring id.

The results applied to some queries follow:

Query	Result	Match	Reason
"fo"	`{task1, task3}`	Error	ambiguous
"foo"	`{task1, task3}`	`task1`	exact name match
"ba"	`{task2}`	`task2`	unique prefix match
"bar"	`{task2}`	`task2`	exact name match
"fo"	`{task1, task3}`	Error	ambiguous
"3"	`{task3, task4, task5}`	`task3`	id unique match
"6"	`{task6, task7}`	`task6`	id unique match
"7"	`{task6, task7}`	`task7`	id unique match

In this setup, you ensure that every object is accessible, which is guaranteed by the id uniqueness property. We get both name and id prefix matching, with resolved ambiguities. Impact of mutual name-id cycles (task6, task7 above) is minimized by favoring id over name on exact match.

I hope this clarifies the methodology.

dperny · 2016-08-29T22:10:02Z

closing in favor of #1279

feel free to reopen if this is in error.

GordonTheTurtle added status/0-triage dco/no labels Jul 20, 2016

doronp force-pushed the Fix-#1105 branch from 7b255fc to 7acc8a5 Compare July 20, 2016 05:53

GordonTheTurtle removed the dco/no label Jul 20, 2016

stevvooe reviewed Jul 20, 2016
View reviewed changes

doronp mentioned this pull request Jul 20, 2016

fix #1096 #1097

Closed

Fixes#1105 Make service commands work with id prefix as well as name

a654a6e

Signed-off-by: Doron Podoleanu <[email protected]>

doronp force-pushed the Fix-#1105 branch from 7acc8a5 to a654a6e Compare July 21, 2016 11:12

stevvooe mentioned this pull request Aug 19, 2016

cmd: fix name prefix matching #1279

Closed

dperny closed this Aug 29, 2016

AkihiroSuda mentioned this pull request Nov 30, 2016

api: allow NW name that is the prefix of a swarm NW ID moby/moby#27938

Merged

Conversation

doronp commented Jul 20, 2016

Uh oh!

doronp commented Jul 20, 2016

Uh oh!

codecov-io commented Jul 20, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Current coverage is 55.07%

Uh oh!

stevvooe Jul 20, 2016

Choose a reason for hiding this comment

Uh oh!

doronp Jul 20, 2016

Choose a reason for hiding this comment

Uh oh!

stevvooe Jul 20, 2016

Choose a reason for hiding this comment

Uh oh!

stevvooe commented Jul 20, 2016

Uh oh!

doronp commented Jul 21, 2016

Uh oh!

stevvooe commented Jul 21, 2016

Uh oh!

doronp commented Jul 27, 2016

Uh oh!

stevvooe commented Jul 27, 2016

Uh oh!

doronp commented Jul 28, 2016

Uh oh!

stevvooe commented Jul 28, 2016

Uh oh!

doronp commented Aug 3, 2016

Uh oh!

stevvooe commented Aug 3, 2016

Uh oh!

doronp commented Aug 4, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ezrasilvera commented Aug 10, 2016

Uh oh!

stevvooe commented Aug 10, 2016

Uh oh!

dperny commented Aug 29, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

codecov-io commented Jul 20, 2016 •

edited

Loading

doronp commented Aug 4, 2016 •

edited

Loading