Open XML Filter

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
13 messages Options
Reply | Threaded
Open this post in threaded view
|

Open XML Filter

Raphael Bircher-3
Hi all

Who is willing to work on the Open XML Filters? I have to explain you, why  
I'm asking.

In the past few month, I worked on a new project witch resulted in a NGO  
located in Switzerland called Vefko. The goal is, to offer other  
Association communication and collaboration tools.

We are now at the software evaluation, and Apache Openoffice is on the  
list. At the Moment it's mainly interesting in the headless mode. But the  
poor OOXML Filter it's something negative at AOO. I know, that this topic  
came op frequently. But I also know, that this is not an easy task.

So the big question, who is interested in something like this.

Regards, Raphael

--
My introduction https://youtu.be/Ln4vly5sxYU

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Open XML Filter

Dave Fisher
Hi -

If we can use Java then Apache POI has OOXML support.

Regards,
Dave

> On Oct 19, 2017, at 7:15 PM, Raphael Bircher <[hidden email]> wrote:
>
> Hi all
>
> Who is willing to work on the Open XML Filters? I have to explain you, why I'm asking.
>
> In the past few month, I worked on a new project witch resulted in a NGO located in Switzerland called Vefko. The goal is, to offer other Association communication and collaboration tools.
>
> We are now at the software evaluation, and Apache Openoffice is on the list. At the Moment it's mainly interesting in the headless mode. But the poor OOXML Filter it's something negative at AOO. I know, that this topic came op frequently. But I also know, that this is not an easy task.
>
> So the big question, who is interested in something like this.
>
> Regards, Raphael
>
> --
> My introduction https://youtu.be/Ln4vly5sxYU
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [hidden email]
> For additional commands, e-mail: [hidden email]
>


signature.asc (817 bytes) Download Attachment
Reply | Threaded
Open this post in threaded view
|

Re: Open XML Filter

Raphael Bircher-3
Hi Dave

Am .10.2017, 04:20 Uhr, schrieb Dave Fisher <[hidden email]>:

> Hi -
>
> If we can use Java then Apache POI has OOXML support.

You mean, use POI as a library in Apache OpenOffice?

Regards, Raphael


--
My introduction https://youtu.be/Ln4vly5sxYU

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Open XML Filter

Dave Fisher
Hi Raphael,

Sent from my iPhone

> On Oct 19, 2017, at 7:50 PM, Raphael Bircher <[hidden email]> wrote:
>
> Hi Dave
>
> Am .10.2017, 04:20 Uhr, schrieb Dave Fisher <[hidden email]>:
>
>> Hi -
>>
>> If we can use Java then Apache POI has OOXML support.
>
> You mean, use POI as a library in Apache OpenOffice?

Yes.

Regards,
Dave

>
> Regards, Raphael
>
>
> --
> My introduction https://youtu.be/Ln4vly5sxYU
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [hidden email]
> For additional commands, e-mail: [hidden email]
>


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Open XML Filter

Peter Kovacs-3
During Apache POI is a cool idea. I will have a look at the weekend. I am in general interested in improving OOXML but I am still very slow in getting towards dev productivity. ( today is my certificate exam in ab initio, when I pass openoffice can move up my priority list again. YAY!)

Raphael if you are evaluating headless servers, have you tried to set one up ?
Can you do that, and check if the office process crashes at 100% cpu after transformation?
Or write a guide how to setup a server?. I did not manage and I would like to get a bug report in that area tested.


Am 20. Oktober 2017 04:59:33 MESZ schrieb Dave Fisher <[hidden email]>:

>Hi Raphael,
>
>Sent from my iPhone
>
>> On Oct 19, 2017, at 7:50 PM, Raphael Bircher
><[hidden email]> wrote:
>>
>> Hi Dave
>>
>> Am .10.2017, 04:20 Uhr, schrieb Dave Fisher <[hidden email]>:
>>
>>> Hi -
>>>
>>> If we can use Java then Apache POI has OOXML support.
>>
>> You mean, use POI as a library in Apache OpenOffice?
>
>Yes.
>
>Regards,
>Dave
>
>>
>> Regards, Raphael
>>
>>
>> --
>> My introduction https://youtu.be/Ln4vly5sxYU
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: [hidden email]
>> For additional commands, e-mail: [hidden email]
>>
>
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: [hidden email]
>For additional commands, e-mail: [hidden email]

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Open XML Filter

Raphael Bircher-3
Hi Peter

Am .10.2017, 06:54 Uhr, schrieb Peter kovacs <[hidden email]>:

> During Apache POI is a cool idea. I will have a look at the weekend. I  
> am in general interested in improving OOXML but I am still very slow in  
> getting towards dev productivity. ( today is my certificate exam in ab  
> initio, when I pass openoffice can move up my priority list again. YAY!)

Yea, I think we have to discuss this solution. Btw. good luck by the exam.
>
> Raphael if you are evaluating headless servers, have you tried to set  
> one up ?

We will do tests, for sure. But in this case only on Linux. If I run in  
trouble, I will send a mail to the dev.

> Can you do that, and check if the office process crashes at 100% cpu  
> after transformation?
> Or write a guide how to setup a server?. I did not manage and I would  
> like to get a bug report in that area tested.

We have anyway to write a doku so why not document this on a public  
point... yes

Regards Raphael

--
My introduction https://youtu.be/Ln4vly5sxYU

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Open XML Filter

Damjan Jovanovic
In reply to this post by Peter Kovacs-3
I've looked at Apache POI before and even wrote a minimal filter that saves
text cells in Calc to XLSX, see my email on 22 December 2015.

Reading OOXML in Apache POI is problematic, we would have to build the
document tree twice, first in POI, then in AOO, as a SAX-like API is not
supported, so memory usage would double, and large documents would be
problematic. Writing is better, in that there is a streaming API that
doesn't use a lot of memory, but formulas might be a problem. I had more
practical issues in cell iteration APIs I needed that were not exported
over UNO.

Could we get any funding for better OOXML suport?

Damjan


On Fri, Oct 20, 2017 at 6:54 AM, Peter kovacs <[hidden email]> wrote:

> During Apache POI is a cool idea. I will have a look at the weekend. I am
> in general interested in improving OOXML but I am still very slow in
> getting towards dev productivity. ( today is my certificate exam in ab
> initio, when I pass openoffice can move up my priority list again. YAY!)
>
> Raphael if you are evaluating headless servers, have you tried to set one
> up ?
> Can you do that, and check if the office process crashes at 100% cpu after
> transformation?
> Or write a guide how to setup a server?. I did not manage and I would like
> to get a bug report in that area tested.
>
>
> Am 20. Oktober 2017 04:59:33 MESZ schrieb Dave Fisher <
> [hidden email]>:
> >Hi Raphael,
> >
> >Sent from my iPhone
> >
> >> On Oct 19, 2017, at 7:50 PM, Raphael Bircher
> ><[hidden email]> wrote:
> >>
> >> Hi Dave
> >>
> >> Am .10.2017, 04:20 Uhr, schrieb Dave Fisher <[hidden email]>:
> >>
> >>> Hi -
> >>>
> >>> If we can use Java then Apache POI has OOXML support.
> >>
> >> You mean, use POI as a library in Apache OpenOffice?
> >
> >Yes.
> >
> >Regards,
> >Dave
> >
> >>
> >> Regards, Raphael
> >>
> >>
> >> --
> >> My introduction https://youtu.be/Ln4vly5sxYU
> >>
> >> ---------------------------------------------------------------------
> >> To unsubscribe, e-mail: [hidden email]
> >> For additional commands, e-mail: [hidden email]
> >>
> >
> >
> >---------------------------------------------------------------------
> >To unsubscribe, e-mail: [hidden email]
> >For additional commands, e-mail: [hidden email]
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [hidden email]
> For additional commands, e-mail: [hidden email]
>
>
Reply | Threaded
Open this post in threaded view
|

Re: Open XML Filter

Dave Fisher
Hi Damhan,

Sent from my iPhone

> On Oct 20, 2017, at 12:21 AM, Damjan Jovanovic <[hidden email]> wrote:
>
> I've looked at Apache POI before and even wrote a minimal filter that saves
> text cells in Calc to XLSX, see my email on 22 December 2015.
>
> Reading OOXML in Apache POI is problematic, we would have to build the
> document tree twice, first in POI, then in AOO, as a SAX-like API is not
> supported, so memory usage would double, and large documents would be
> problematic. Writing is better, in that there is a streaming API that
> doesn't use a lot of memory, but formulas might be a problem. I had more
> practical issues in cell iteration APIs I needed that were not exported
> over UNO.

There are efforts to reduce the memory requirements in POI on the read side. There is a lot of cross development between Tika and POI. Tika has a huge regression set of documents it knows about on the web.

>
> Could we get any funding for better OOXML suport?

Raphael?

Regards,
Dave


>
> Damjan
>
>
>> On Fri, Oct 20, 2017 at 6:54 AM, Peter kovacs <[hidden email]> wrote:
>>
>> During Apache POI is a cool idea. I will have a look at the weekend. I am
>> in general interested in improving OOXML but I am still very slow in
>> getting towards dev productivity. ( today is my certificate exam in ab
>> initio, when I pass openoffice can move up my priority list again. YAY!)
>>
>> Raphael if you are evaluating headless servers, have you tried to set one
>> up ?
>> Can you do that, and check if the office process crashes at 100% cpu after
>> transformation?
>> Or write a guide how to setup a server?. I did not manage and I would like
>> to get a bug report in that area tested.
>>
>>
>> Am 20. Oktober 2017 04:59:33 MESZ schrieb Dave Fisher <
>> [hidden email]>:
>>> Hi Raphael,
>>>
>>> Sent from my iPhone
>>>
>>>> On Oct 19, 2017, at 7:50 PM, Raphael Bircher
>>> <[hidden email]> wrote:
>>>>
>>>> Hi Dave
>>>>
>>>> Am .10.2017, 04:20 Uhr, schrieb Dave Fisher <[hidden email]>:
>>>>
>>>>> Hi -
>>>>>
>>>>> If we can use Java then Apache POI has OOXML support.
>>>>
>>>> You mean, use POI as a library in Apache OpenOffice?
>>>
>>> Yes.
>>>
>>> Regards,
>>> Dave
>>>
>>>>
>>>> Regards, Raphael
>>>>
>>>>
>>>> --
>>>> My introduction https://youtu.be/Ln4vly5sxYU
>>>>
>>>> ---------------------------------------------------------------------
>>>> To unsubscribe, e-mail: [hidden email]
>>>> For additional commands, e-mail: [hidden email]
>>>>
>>>
>>>
>>> ---------------------------------------------------------------------
>>> To unsubscribe, e-mail: [hidden email]
>>> For additional commands, e-mail: [hidden email]
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: [hidden email]
>> For additional commands, e-mail: [hidden email]
>>
>>


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Funds for Re: Open XML Filter

Peter Kovacs-3
@Andrea you said if we want to do something we should reach out. So I think this might be a thing:

How about creating a project plan, estimates for a budget and then start a campaign?

We could also check if we can staff the team introducing the people to interested financiers.
Which dev would be interested in general?

All the best
Peter

Am 20. Oktober 2017 17:16:59 MESZ schrieb Dave Fisher <[hidden email]>:

>Hi Damhan,
>
>Sent from my iPhone
>
>> On Oct 20, 2017, at 12:21 AM, Damjan Jovanovic <[hidden email]>
>wrote:
>>
>> I've looked at Apache POI before and even wrote a minimal filter that
>saves
>> text cells in Calc to XLSX, see my email on 22 December 2015.
>>
>> Reading OOXML in Apache POI is problematic, we would have to build
>the
>> document tree twice, first in POI, then in AOO, as a SAX-like API is
>not
>> supported, so memory usage would double, and large documents would be
>> problematic. Writing is better, in that there is a streaming API that
>> doesn't use a lot of memory, but formulas might be a problem. I had
>more
>> practical issues in cell iteration APIs I needed that were not
>exported
>> over UNO.
>
>There are efforts to reduce the memory requirements in POI on the read
>side. There is a lot of cross development between Tika and POI. Tika
>has a huge regression set of documents it knows about on the web.
>
>>
>> Could we get any funding for better OOXML suport?
>
>Raphael?
>
>Regards,
>Dave
>
>
>>
>> Damjan
>>
>>
>>> On Fri, Oct 20, 2017 at 6:54 AM, Peter kovacs <[hidden email]>
>wrote:
>>>
>>> During Apache POI is a cool idea. I will have a look at the weekend.
>I am
>>> in general interested in improving OOXML but I am still very slow in
>>> getting towards dev productivity. ( today is my certificate exam in
>ab
>>> initio, when I pass openoffice can move up my priority list again.
>YAY!)
>>>
>>> Raphael if you are evaluating headless servers, have you tried to
>set one
>>> up ?
>>> Can you do that, and check if the office process crashes at 100% cpu
>after
>>> transformation?
>>> Or write a guide how to setup a server?. I did not manage and I
>would like
>>> to get a bug report in that area tested.
>>>
>>>
>>> Am 20. Oktober 2017 04:59:33 MESZ schrieb Dave Fisher <
>>> [hidden email]>:
>>>> Hi Raphael,
>>>>
>>>> Sent from my iPhone
>>>>
>>>>> On Oct 19, 2017, at 7:50 PM, Raphael Bircher
>>>> <[hidden email]> wrote:
>>>>>
>>>>> Hi Dave
>>>>>
>>>>> Am .10.2017, 04:20 Uhr, schrieb Dave Fisher
><[hidden email]>:
>>>>>
>>>>>> Hi -
>>>>>>
>>>>>> If we can use Java then Apache POI has OOXML support.
>>>>>
>>>>> You mean, use POI as a library in Apache OpenOffice?
>>>>
>>>> Yes.
>>>>
>>>> Regards,
>>>> Dave
>>>>
>>>>>
>>>>> Regards, Raphael
>>>>>
>>>>>
>>>>> --
>>>>> My introduction https://youtu.be/Ln4vly5sxYU
>>>>>
>>>>>
>---------------------------------------------------------------------
>>>>> To unsubscribe, e-mail: [hidden email]
>>>>> For additional commands, e-mail: [hidden email]
>>>>>
>>>>
>>>>
>>>>
>---------------------------------------------------------------------
>>>> To unsubscribe, e-mail: [hidden email]
>>>> For additional commands, e-mail: [hidden email]
>>>
>>>
>---------------------------------------------------------------------
>>> To unsubscribe, e-mail: [hidden email]
>>> For additional commands, e-mail: [hidden email]
>>>
>>>
>
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: [hidden email]
>For additional commands, e-mail: [hidden email]

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Open XML Filter

Raphael Bircher-3
In reply to this post by Dave Fisher
Hi all.

Am .10.2017, 17:16 Uhr, schrieb Dave Fisher <[hidden email]>:

> Hi Damhan,
>
> Sent from my iPhone
>
>> On Oct 20, 2017, at 12:21 AM, Damjan Jovanovic <[hidden email]>  
>> wrote:
>>
>> I've looked at Apache POI before and even wrote a minimal filter that  
>> saves
>> text cells in Calc to XLSX, see my email on 22 December 2015.
>>
>> Reading OOXML in Apache POI is problematic, we would have to build the
>> document tree twice, first in POI, then in AOO, as a SAX-like API is not
>> supported, so memory usage would double, and large documents would be
>> problematic. Writing is better, in that there is a streaming API that
>> doesn't use a lot of memory, but formulas might be a problem. I had more
>> practical issues in cell iteration APIs I needed that were not exported
>> over UNO.
>
> There are efforts to reduce the memory requirements in POI on the read  
> side. There is a lot of cross development between Tika and POI. Tika has  
> a huge regression set of documents it knows about on the web.
>
>>
>> Could we get any funding for better OOXML suport?
>
> Raphael?

We are not in the position to put money into the project. However, we can  
Collect money for this project, and pay developers. That should be  
possible. We also willing to invest some menpower.

Regards, Raphael


--
My introduction https://youtu.be/Ln4vly5sxYU

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Open XML Filter

FR web forum
>We are not in the position to put money into the project. However, we can  
>Collect money for this project, and pay developers. That should be  
>possible. We also willing to invest some menpower.

Thanks to keep us up to date if you funds this filter.
Through a crowdfunding website?

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Funds for Re: Open XML Filter

Andrea Pescetti-2
In reply to this post by Peter Kovacs-3
Peter kovacs wrote:
> How about creating a project plan, estimates for a budget and then start a campaign?

I think that getting money would not be the hardest part. The hardest
part would be to find people who are trusted and competent enough. The
code is complex and different people have different competences, and
this should be taken into account too.

Regards,
   Andrea.

---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]

Reply | Threaded
Open this post in threaded view
|

Re: Funds for Re: Open XML Filter

Dave Fisher
Hi -

A third party entity could be formed to pay for development. An architecture and roadmap created either “there” or in the AOO project. Then developers could be recruited and paid.

Regards,
Dave

Sent from my iPhone

> On Oct 23, 2017, at 4:03 PM, Andrea Pescetti <[hidden email]> wrote:
>
> Peter kovacs wrote:
>> How about creating a project plan, estimates for a budget and then start a campaign?
>
> I think that getting money would not be the hardest part. The hardest part would be to find people who are trusted and competent enough. The code is complex and different people have different competences, and this should be taken into account too.
>
> Regards,
>  Andrea.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [hidden email]
> For additional commands, e-mail: [hidden email]
>


---------------------------------------------------------------------
To unsubscribe, e-mail: [hidden email]
For additional commands, e-mail: [hidden email]