Hi Praveen,
While creating demo I found and fixed problem with input buffer management in my code and now I got output from the decoder in 4-cores mode.
But I faced another issue - for some reason decoder is much slower in 4-core mode than in 2-core mode. Can you please check attached logs. You can use timestamps to see difference in decoding speed. It is decoding of the same input by 2 cores and 4 cores decoder. In both modes all the code is the same, multicore API (locks, barriers, memory management) is also the same.
Regards,
Andrey Lisnevich