Module Core_kernel.Bigstring
String type based on Bigarray, for use in I/O and C-bindings.
Types and exceptions
type t= (Core_kernel__.Import.char, Stdlib.Bigarray.int8_unsigned_elt, Stdlib.Bigarray.c_layout) Stdlib.Bigarray.Array1.tType of bigstrings
val compare : t -> t -> Core_kernel__.Import.intval sexp_of_t : t -> Ppx_sexp_conv_lib.Sexp.t
type t_frozen= tType of bigstrings which support hashing. Note that mutation invalidates previous hashes.
val compare_t_frozen : t_frozen -> t_frozen -> Core_kernel__.Import.intval hash_fold_t_frozen : Base.Hash.state -> t_frozen -> Base.Hash.stateval hash_t_frozen : t_frozen -> Base.Hash.hash_valueval sexp_of_t_frozen : t_frozen -> Ppx_sexp_conv_lib.Sexp.t
include module type of Base_bigstring with type Base_bigstring.t := t and type Base_bigstring.t_frozen := t_frozen
Types and exceptions
type t= (Base.char, Stdlib.Bigarray.int8_unsigned_elt, Stdlib.Bigarray.c_layout) Stdlib.Bigarray.Array1.tType of bigstrings
include Ppx_sexp_conv_lib.Sexpable.S with type t := t
val t_of_sexp : Sexplib0.Sexp.t -> tval sexp_of_t : t -> Sexplib0.Sexp.t
type t_frozen= tType of bigstrings which support hashing. Note that mutation invalidates previous hashes.
val compare_t_frozen : t_frozen -> t_frozen -> Base.intval hash_fold_t_frozen : Base.Hash.state -> t_frozen -> Base.Hash.stateval hash_t_frozen : t_frozen -> Base.Hash.hash_valueval sexp_of_t_frozen : t_frozen -> Ppx_sexp_conv_lib.Sexp.tval t_frozen_of_sexp : Ppx_sexp_conv_lib.Sexp.t -> t_frozen
Creation and string conversion
val create : ?max_mem_waiting_gc_in_bytes:Base.int -> Base.int -> tcreate length- parameter max_mem_waiting_gc
default = 256 M in OCaml <= 3.12, 1 G otherwise. As the total allocation of calls to
createapproachmax_mem_waiting_gc_in_bytes, the pressure in the garbage collector to be more agressive will increase.
- returns
a new bigstring having
length. Content is undefined.
val init : Base.int -> f:(Base.int -> Base.char) -> tinit n ~fcreates a bigstringtof lengthn, witht.{i} = f i.
val of_string : ?pos:Base.int -> ?len:Base.int -> Base.string -> tof_string ?pos ?len str- returns
a new bigstring that is equivalent to the substring of length
leninstrstarting at positionpos.
- parameter pos
default = 0
- parameter len
default =
String.length str - pos
val of_bytes : ?pos:Base.int -> ?len:Base.int -> Base.bytes -> tof_bytes ?pos ?len str- returns
a new bigstring that is equivalent to the subbytes of length
leninstrstarting at positionpos.
- parameter pos
default = 0
- parameter len
default =
Bytes.length str - pos
val to_string : ?pos:Base.int -> ?len:Base.int -> t -> Base.stringto_string ?pos ?len bstr- returns
a new string that is equivalent to the substring of length
leninbstrstarting at positionpos.
- parameter pos
default = 0
- parameter len
default =
length bstr - pos
- raises Invalid_argument
if the string would exceed runtime limits.
val to_bytes : ?pos:Base.int -> ?len:Base.int -> t -> Base.bytesto_bytes ?pos ?len bstr- returns
a new byte sequence that is equivalent to the substring of length
leninbstrstarting at positionpos.
- parameter pos
default = 0
- parameter len
default =
length bstr - pos
- raises Invalid_argument
if the bytes would exceed runtime limits.
Checking
val check_args : loc:Base.string -> pos:Base.int -> len:Base.int -> t -> Base.unitcheck_args ~loc ~pos ~len bstrchecks the position and length argumentsposandlenfor bigstringsbstr.- raises
Invalid_argument if these arguments are illegal for the given bigstring using
locto indicate the calling context.
val get_opt_len : t -> pos:Base.int -> Base.int Base.option -> Base.intget_opt_len bstr ~pos opt_len- returns
the length of a subbigstring in
bstrstarting at positionposand given optional lengthopt_len. This function does not check the validity of its arguments. Usecheck_argsfor that purpose.
Accessors
Blitting
include Base.Blit.S with type t := t
val blit : (t, t) Base__.Blit_intf.blitval blito : (t, t) Base__.Blit_intf.blitoval unsafe_blit : (t, t) Base__.Blit_intf.blitval sub : (t, t) Base__.Blit_intf.subval subo : (t, t) Base__.Blit_intf.subo
module To_string = Base_bigstring.To_stringmodule From_string = Base_bigstring.From_stringmodule To_bytes = Base_bigstring.To_bytesmodule From_bytes = Base_bigstring.From_bytesSearch
Accessors for parsing binary values, analogous to Binary_packing
These are in Bigstring rather than a separate module because:
1. Existing Binary_packing requires copies and does not work with bigstrings. 2. The accessors rely on the implementation of bigstring, and hence should change should the implementation of bigstring move away from Bigarray. 3. Bigstring already has some external C functions, so it didn't require many changes to the jbuild ^_^.
In a departure from Binary_packing, the naming conventions are chosen to be close to C99 stdint types, as it's a more standard description and it is somewhat useful in making compact macros for the implementations. The accessor names contain endian-ness to allow for branch-free implementations
<accessor> ::= <unsafe><operation><type><endian> <unsafe> ::= unsafe_ | '' <operation> ::= get_ | set_ <type> ::= int8 | uint8 | int16 | uint16 | int32 | uint32 | int64 | uint64 <endian> ::= _le | _be | ''
The unsafe_ prefix indicates that these functions do no bounds checking and silently truncate out-of-range numeric arguments.
val get_int8 : t -> pos:Base.int -> Base.intval set_int8_exn : t -> pos:Base.int -> Base.int -> Base.unitval get_uint8 : t -> pos:Base.int -> Base.intval set_uint8_exn : t -> pos:Base.int -> Base.int -> Base.unitval unsafe_get_int8 : t -> pos:Base.int -> Base.intval unsafe_set_int8 : t -> pos:Base.int -> Base.int -> Base.unitval unsafe_get_uint8 : t -> pos:Base.int -> Base.intval unsafe_set_uint8 : t -> pos:Base.int -> Base.int -> Base.unit
16-bit methods
val get_int16_le : t -> pos:Base.int -> Base.intval get_int16_be : t -> pos:Base.int -> Base.intval set_int16_le_exn : t -> pos:Base.int -> Base.int -> Base.unitval set_int16_be_exn : t -> pos:Base.int -> Base.int -> Base.unitval unsafe_get_int16_le : t -> pos:Base.int -> Base.intval unsafe_get_int16_be : t -> pos:Base.int -> Base.intval unsafe_set_int16_le : t -> pos:Base.int -> Base.int -> Base.unitval unsafe_set_int16_be : t -> pos:Base.int -> Base.int -> Base.unitval get_uint16_le : t -> pos:Base.int -> Base.intval get_uint16_be : t -> pos:Base.int -> Base.intval set_uint16_le_exn : t -> pos:Base.int -> Base.int -> Base.unitval set_uint16_be_exn : t -> pos:Base.int -> Base.int -> Base.unitval unsafe_get_uint16_le : t -> pos:Base.int -> Base.intval unsafe_get_uint16_be : t -> pos:Base.int -> Base.intval unsafe_set_uint16_le : t -> pos:Base.int -> Base.int -> Base.unitval unsafe_set_uint16_be : t -> pos:Base.int -> Base.int -> Base.unit
32-bit methods
val get_int32_le : t -> pos:Base.int -> Base.intval get_int32_be : t -> pos:Base.int -> Base.intval set_int32_le_exn : t -> pos:Base.int -> Base.int -> Base.unitval set_int32_be_exn : t -> pos:Base.int -> Base.int -> Base.unitval unsafe_get_int32_le : t -> pos:Base.int -> Base.intval unsafe_get_int32_be : t -> pos:Base.int -> Base.intval unsafe_set_int32_le : t -> pos:Base.int -> Base.int -> Base.unitval unsafe_set_int32_be : t -> pos:Base.int -> Base.int -> Base.unitval get_uint32_le : t -> pos:Base.int -> Base.intval get_uint32_be : t -> pos:Base.int -> Base.intval set_uint32_le_exn : t -> pos:Base.int -> Base.int -> Base.unitval set_uint32_be_exn : t -> pos:Base.int -> Base.int -> Base.unitval unsafe_get_uint32_le : t -> pos:Base.int -> Base.intval unsafe_get_uint32_be : t -> pos:Base.int -> Base.intval unsafe_set_uint32_le : t -> pos:Base.int -> Base.int -> Base.unitval unsafe_set_uint32_be : t -> pos:Base.int -> Base.int -> Base.unit
64-bit signed values
val get_int64_le_exn : t -> pos:Base.int -> Base.intval get_int64_be_exn : t -> pos:Base.int -> Base.intval get_int64_le_trunc : t -> pos:Base.int -> Base.intval get_int64_be_trunc : t -> pos:Base.int -> Base.intval set_int64_le : t -> pos:Base.int -> Base.int -> Base.unitval set_int64_be : t -> pos:Base.int -> Base.int -> Base.unitval unsafe_get_int64_le_exn : t -> pos:Base.int -> Base.intval unsafe_get_int64_be_exn : t -> pos:Base.int -> Base.intval unsafe_get_int64_le_trunc : t -> pos:Base.int -> Base.intval unsafe_get_int64_be_trunc : t -> pos:Base.int -> Base.intval unsafe_set_int64_le : t -> pos:Base.int -> Base.int -> Base.unitval unsafe_set_int64_be : t -> pos:Base.int -> Base.int -> Base.unit
64-bit unsigned values
val get_uint64_be_exn : t -> pos:Base.int -> Base.intval get_uint64_le_exn : t -> pos:Base.int -> Base.intval set_uint64_le_exn : t -> pos:Base.int -> Base.int -> Base.unitval set_uint64_be_exn : t -> pos:Base.int -> Base.int -> Base.unitval unsafe_get_uint64_be_exn : t -> pos:Base.int -> Base.intval unsafe_get_uint64_le_exn : t -> pos:Base.int -> Base.intval unsafe_set_uint64_le : t -> pos:Base.int -> Base.int -> Base.unitval unsafe_set_uint64_be : t -> pos:Base.int -> Base.int -> Base.unit
32-bit methods with full precision
val get_int32_t_le : t -> pos:Base.int -> Base.Int32.tval get_int32_t_be : t -> pos:Base.int -> Base.Int32.tval set_int32_t_le : t -> pos:Base.int -> Base.Int32.t -> Base.unitval set_int32_t_be : t -> pos:Base.int -> Base.Int32.t -> Base.unitval unsafe_get_int32_t_le : t -> pos:Base.int -> Base.Int32.tval unsafe_get_int32_t_be : t -> pos:Base.int -> Base.Int32.tval unsafe_set_int32_t_le : t -> pos:Base.int -> Base.Int32.t -> Base.unitval unsafe_set_int32_t_be : t -> pos:Base.int -> Base.Int32.t -> Base.unit
64-bit methods with full precision
val get_int64_t_le : t -> pos:Base.int -> Base.Int64.tval get_int64_t_be : t -> pos:Base.int -> Base.Int64.tval set_int64_t_le : t -> pos:Base.int -> Base.Int64.t -> Base.unitval set_int64_t_be : t -> pos:Base.int -> Base.Int64.t -> Base.unitval unsafe_get_int64_t_le : t -> pos:Base.int -> Base.Int64.tval unsafe_get_int64_t_be : t -> pos:Base.int -> Base.Int64.tval unsafe_set_int64_t_le : t -> pos:Base.int -> Base.Int64.t -> Base.unitval unsafe_set_int64_t_be : t -> pos:Base.int -> Base.Int64.t -> Base.unit
module Private = Base_bigstring.PrivateCreation and string conversion
val create : ?max_mem_waiting_gc:Core_kernel__.Byte_units0.t -> Core_kernel__.Import.int -> tcreate length- parameter max_mem_waiting_gc
default = 256 M in OCaml <= 3.12, 1 G otherwise. As the total allocation of calls to
createapproachmax_mem_waiting_gc, the pressure in the garbage collector to be more agressive will increase.
- returns
a new bigstring having
length. Content is undefined.
sub_shared ?pos ?len bstr- returns
the sub-bigstring in
bstrthat starts at positionposand has lengthlen. The sub-bigstring shares the same memory region, i.e. modifying it will modify the original bigstring. Holding on to the sub-bigstring will also keep the (usually bigger) original one around.
- parameter pos
default = 0
- parameter len
default =
Bigstring.length bstr - pos
Reading/writing bin-prot
val write_bin_prot : t -> ?pos:Core_kernel__.Import.int -> 'a Bin_prot.Type_class.writer -> 'a -> Core_kernel__.Import.intwrite_bin_prot t writer awritesatotstarting atpos, and returns the index intimmediately after the last byte written. It raises ifpos < 0or ifadoesn't fit int.
val read_bin_prot : t -> ?pos:Core_kernel__.Import.int -> ?len:Core_kernel__.Import.int -> 'a Bin_prot.Type_class.reader -> ('a * Core_kernel__.Import.int) Or_error.tThe
read_bin_prot*functions read from the region oftstarting atposof lengthlen. They return the index intimmediately after the last byte read. They raise ifposandlendon't describe a region oft.
val read_bin_prot_verbose_errors : t -> ?pos:Core_kernel__.Import.int -> ?len:Core_kernel__.Import.int -> 'a Bin_prot.Type_class.reader -> [ `Invalid_data of Error.t | `Not_enough_data | `Ok of 'a * Core_kernel__.Import.int ]
Destruction
val unsafe_destroy : t -> Core_kernel__.Import.unitunsafe_destroy bstrdestroys the bigstring by deallocating its associated data or, if memory-mapped, unmapping the corresponding file, and setting all dimensions to zero. This effectively frees the associated memory or address-space resources instantaneously. This feature helps working around a bug in the current OCaml runtime, which does not correctly estimate how aggressively to reclaim such resources.This operation is safe unless you have passed the bigstring to another thread that is performing operations on it at the same time. Access to the bigstring after this operation will yield array bounds exceptions.
- raises Failure
if the bigstring has already been deallocated (or deemed "external", which is treated equivalently), or if it has proxies, i.e. other bigstrings referring to the same data.
val unsafe_destroy_and_resize : t -> len:Core_kernel__.Import.int -> tunsafe_destroy_and_resize bstr ~lenreallocates the memory backingbstrand returns a new bigstring that starts at position 0 and has lengthlen. Iflenis greater thanlength bstrthen the newly allocated memory will not be initialized.Similar to
unsafe_destroy, this operation is safe unless you have passed the bigstring to another thread that is performing operations on it at the same time. Access tobstrafter this operation will yield array bounds exceptions.- raises Failure
if the bigstring has already been deallocated (or deemed "external", which is treated equivalently), if it is backed by a memory map, or if it has proxies, i.e. other bigstrings referring to the same data.
val get_tail_padded_fixed_string : padding:Core_kernel__.Import.char -> t -> pos:Core_kernel__.Import.int -> len:Core_kernel__.Import.int -> Core_kernel__.Import.unit -> Core_kernel__.Import.stringSimilar to
Binary_packing.unpack_tail_padded_fixed_stringand.pack_tail_padded_fixed_string.
val set_tail_padded_fixed_string : padding:Core_kernel__.Import.char -> t -> pos:Core_kernel__.Import.int -> len:Core_kernel__.Import.int -> Core_kernel__.Import.string -> Core_kernel__.Import.unitval get_head_padded_fixed_string : padding:Core_kernel__.Import.char -> t -> pos:Core_kernel__.Import.int -> len:Core_kernel__.Import.int -> Core_kernel__.Import.unit -> Core_kernel__.Import.stringval set_head_padded_fixed_string : padding:Core_kernel__.Import.char -> t -> pos:Core_kernel__.Import.int -> len:Core_kernel__.Import.int -> Core_kernel__.Import.string -> Core_kernel__.Import.unit
module Unstable : sig ... endmodule Stable : sig ... end