GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:16:38 Sequence gi568815584r:37491368_37694972 : 203605 bp : 38.52% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 4915 5648 734 2 2 67 48 303 0.779 16.84 1.02 PlyA + 5692 5697 6 1.05 2.00 Prom + 5809 5848 40 -5.65 2.01 Init + 11392 11684 293 0 2 47 50 248 0.051 13.57 2.02 Intr + 13676 13803 128 2 2 24 82 88 0.024 1.20 2.03 Intr + 47253 47431 179 1 2 22 33 167 0.136 3.32 2.04 Intr + 49752 49807 56 2 2 118 43 47 0.500 0.06 2.05 Intr + 49960 50034 75 1 0 127 95 -22 0.293 0.11 2.06 Intr + 51266 51314 49 2 1 63 100 35 0.564 -0.04 2.07 Term + 55538 55675 138 1 0 72 41 191 0.990 9.78 2.08 PlyA + 56261 56266 6 1.05 3.17 PlyA - 56473 56468 6 1.05 3.16 Term - 82176 82162 15 0 0 126 51 14 0.541 -1.24 3.15 Intr - 83938 83840 99 1 0 58 60 144 0.192 7.99 3.14 Intr - 92424 92187 238 2 1 79 28 115 0.080 1.29 3.13 Intr - 94063 93985 79 2 1 59 49 57 0.093 -3.41 3.12 Intr - 94217 94137 81 0 0 55 62 77 0.033 0.49 3.11 Intr - 101344 100033 1312 1 1 110 93 1431 0.097 132.16 3.10 Intr - 103617 103534 84 0 0 26 78 104 0.001 2.30 3.09 Intr - 106852 106686 167 2 2 95 39 151 0.005 9.66 3.08 Intr - 107143 106888 256 1 1 -9 47 174 0.559 -0.31 3.07 Intr - 108417 108224 194 1 2 -8 88 129 0.508 1.59 3.06 Intr - 108707 108557 151 2 1 59 62 110 0.501 4.31 3.05 Intr - 110524 110396 129 1 0 34 49 100 0.538 0.67 3.04 Intr - 111362 111152 211 1 1 42 80 139 0.957 6.49 3.03 Intr - 111960 111785 176 0 2 37 62 140 0.764 4.12 3.02 Intr - 120272 119957 316 1 1 122 89 117 0.191 10.34 3.01 Init - 121621 121614 8 1 2 62 98 5 0.139 -0.92 3.00 Prom - 123925 123886 40 -7.55 4.00 Prom + 127516 127555 40 -5.85 4.01 Init + 130947 131636 690 2 0 58 50 752 0.036 61.50 4.02 Intr + 132925 133047 123 0 0 -21 75 138 0.022 1.46 4.03 Intr + 140302 140375 74 0 2 60 86 39 0.129 -1.81 4.04 Term + 142102 142534 433 1 1 40 48 261 0.330 11.08 4.05 PlyA + 142971 142976 6 1.05 5.00 Prom + 147287 147326 40 -4.55 5.01 Init + 156471 156591 121 2 1 14 57 112 0.171 1.10 5.02 Intr + 158210 158308 99 0 0 79 23 110 0.060 2.76 5.03 Intr + 187035 187216 182 1 2 106 46 27 0.005 -1.03 5.04 Intr + 188784 188894 111 2 0 86 58 38 0.006 0.26 5.05 Intr + 191391 191597 207 2 0 49 107 185 0.149 14.85 5.06 Term + 196681 196749 69 0 0 52 49 85 0.264 -2.04 5.07 PlyA + 197506 197511 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 11392 11688 297 0 0 47 54 248 0.934 12.79 S.002 Term - 101344 99998 1347 1 0 110 36 1427 0.900 130.18 S.003 Term - 106473 106217 257 1 2 27 48 278 0.952 12.46 S.004 Intr - 106852 106682 171 2 0 95 39 154 0.981 10.19 S.005 Init - 115469 115302 168 2 0 74 -33 164 0.827 2.48 S.006 Sngl + 130947 131660 714 2 0 58 38 789 0.892 64.87 S.007 Term - 181905 181803 103 2 1 111 41 111 0.882 5.47 S.008 Init + 191418 191597 180 2 0 69 107 161 0.817 15.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:37491368_37694972|GENSCAN_predicted_peptide_1|244_aa XLEVLARAIRQKKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSG YKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLN EIKEDTNKWKNIPCSWVGIINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFVWN QKRVCIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITPH RSDL >gi568815584r:37491368_37694972|GENSCAN_predicted_CDS_1|735_bp ntgttggaagttctggccagggcaattaggcagaagaaggaaataaagggtattcaatta ggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattgtatatctagaaaac cccattgtctcagcccaaaacctccttaagctgataagcaacttcagcaaagtctcagga tacaaaatcaatgtacaaaaatcacaagcattcttatacaccaataacagacaaacagag agccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctagga atccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaat gaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaataatc aatatcgtgaaaatggccatactgcccaaggtaatttacagattcaatgccatccccatc aagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcgtatggaac caaaaaagagtctgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatc acactacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactgg taccaaaacagagatatagatcaatggaacagaacagagccctcagaaataacaccgcat agatctgatctttga >gi568815584r:37491368_37694972|GENSCAN_predicted_peptide_2|305_aa MQNDFDELAEVGFRRSVITKFSELKEHVQTHYKEAKNLEKRLHEWLTRINSVEKTLNDLM EMKTMAGELHDACTSFNRRCNQVQERISVIEDQINEIKLTQEEVDPLNRPITGSEIEAII NSLPPKKVQNQTDSQLNSTKAARAPPFESGLAAIHFNQQKKARDVVQLLNKVSGGLTDSG IALLLTCADHVNKSGPAYLRLLTLDLLVTQNFSTSEILSLESGFNHFLPKTVSLCPRLFH PTALHHQEAFTDLSSPSIPKVGKAGGCTEEEGWNRDHEDSDLIEKKRQSGEAITSGDQAA SFNTV >gi568815584r:37491368_37694972|GENSCAN_predicted_CDS_2|918_bp atgcagaatgactttgatgagttggcagaagtaggcttcagaaggtcggtaataacaaaa ttctctgagctaaaggagcatgttcaaacccattacaaggaagctaaaaaccttgaaaaa aggttacatgaatggctaactagaataaacagtgtagagaagaccttaaatgacctgatg gagatgaaaaccatggcaggagaacttcatgatgcatgcacaagcttcaataggcgatgc aatcaagtgcaagaaaggatatcagtgattgaagatcaaattaatgaaataaaactaacc caggaggaagttgatcctctaaatagaccaataacaggctctgaaattgaggcaataatt aatagcctaccaccaaaaaaagtccagaaccagacagattcacagctgaattctaccaaa gctgctcgtgctccaccctttgaatctgggctggctgcgattcactttaaccaacaaaag aaagcaagggatgttgtacaacttctgaacaaggtctcaggaggcctcactgattctggc attgccctcttgctgacttgtgctgaccacgtgaataaatccgggccagcttacttgaga ctactcactctggatcttcttgtcactcagaacttttctacatctgaaatcctaagtctg gagtctgggtttaatcactttctccctaagacagtttccttatgtcctcgtctttttcac cctactgccctgcatcatcaggaagcctttacagacctttcctctccttccatacccaag gttggaaaggctggtggatgtactgaggaagaaggttggaaccgggaccatgaggacagt gatctgattgaaaaaaaacgacagtctggggaagcgatcacatctggtgaccaggctgct tcattcaacactgtgtaa >gi568815584r:37491368_37694972|GENSCAN_predicted_peptide_3|1171_aa MASPVGRRAPTSPFHQVLNPSSSSSTRAHSPTKGSPNSSPSAPAPFSTLISAGVNLPQRI SEAEQRGQPLRPNNAPLLRKAPTRSHRAPGRVLLSPGIEGASAPIQVATLFLQLTIARIR ASGPGSSLGKRGNGGTEGLAGFPSILEVGSTAVSVRTMLPHPNPNARGKVQFGEEDLKAG RLGCRSSAASEPRLNRGFSGFIHPVGQWGWEEHRGHQMRRRIAQPCAFSSDSSPEEESIS AIERCGCCLSTARKSEMFDFGSHVDHPIWAKRAADSAHVQAPSPRGFKVERESPRILSAA QSRDRAPQEGDRLDEEGPAGAEHVLQQALTGGTSSFQRCRLPSRGNRPSQSFSFEYMKMR VDFGKAMLSRGQRDGFVARGRGRVGSSSGSDIRLRVLRDCAGSRPHGCSKGSFREGRSRF LRWRLPGVPAHPTVLTAADPPLEEPALPATWTAPSCRRPVLNTGKFNPGHRRSRHVPALA AEDRPGFLQRAAQSLQSTVSNGVGAYLGKEVGKEQAAAAGPGYRLASNSQTAALLYGGSR MLGTVKMEGHETSDWNSYYADTQEAYSSVPVSNMNSGLGSMNSMNTYMTMNTMTTSGNMT PASFNMSYANPGLGAGLSPGAVAGMPGGSAGAMNSMTAAGVTAMGTALSPSGMGAMGAQQ AASMNGLGPYAAAMNPCMSPMAYAPSNLGRSRAGGGGDAKTFKRSYPHAKPPYSYISLIT MAIQQAPSKMLTLSEIYQWIMDLFPYYRQNQQRWQNSIRHSLSFNDCFVKVARSPDKPGK GSYWTLHPDSGNMFENGCYLRRQKRFKCEKQPGAGGGGGSGSGGSGAKGGPESRKDPSGA SNPSADSPLHRGVHGKTGQLEGAPAPGPAASPQTLDHSGATATGGASELKTPASSTAPPI SSGPGALASVPASHPAHGLAPHESQLHLKGDPHYSFNHPFSINNLMSSSEQQHKLDFKAY EQALQYSPYGSTLPASLPLGSASVTTRSPIEPSALEPAYYQGPIMHQGGSRRPPSPVLLE SGISLHDSVQLYTRNPTAAQPWAPTTAASSAAFRSWEAIPGARGALGRRAALRPWLSTQG SQHPGRAAQPRRGAGCTRNLARKNWAPSRFHASHCGKAPQEPGRGPRDTRSRPRALQKSA IEKKTLSGPCSINLSHKYPVEAHEKVLDLGP >gi568815584r:37491368_37694972|GENSCAN_predicted_CDS_3|3516_bp atggcaagccccgtggggcgtcgtgcacctacctctcccttccaccaggttctcaatcct tccagctcctcttcaacccgtgctcattctcccacgaaaggcagcccaaactccagcccc agtgccccagcccctttttctaccctgatttctgccggggtgaacctgccacagcggatt agcgaggcggaacagaggggacaaccgctccgcccaaacaacgcacctctccttcgcaaa gcgcccacacgaagtcaccgcgccccaggccgcgttctcctctcacccggcatcgagggc gcctccgctccaattcaggtggcgactttgttcctacaactgacgattgccagaatccgg gcatcaggtccaggctcctcacttgggaaaagaggaaacggaggcacagagggcctggct gggttccctagcatattagaggtcggcagcactgcggtcagcgtcagaaccatgctgccc caccccaatcctaacgccaggggtaaggtgcagtttggagaagaggacctgaaagcagga aggttggggtgcagatcctccgccgcatcggagccacggctgaaccgtgggttcagtggg ttcatccacccggtgggacagtgggggtgggaggaacatcgaggacaccagatgcggcgg aggatcgcacagccttgtgccttctcgagtgatagcagcccagaagaggagtcaatatct gctattgaaaggtgtggatgctgcctttctacagcgaggaaaagtgaaatgtttgatttt gggtcacacgttgaccaccctatatgggcaaagcgagctgctgattctgcccacgtccag gcccctagccctaggggttttaaagtagagcgggaaagcccgaggatcctttcagcagca cagagcagagaccgcgctccccaggagggggatcggctggatgaggagggaccggctggg gctgagcacgtcctccaacaggcgctgaccgggggcacttcttccttccaacgctgtcga ctgcccagcaggggaaatcgcccttctcagtctttctcatttgaatacatgaaaatgcga gttgattttggcaaggcgatgctctcccgtggccagagggacggttttgtcgcccgcggg cgaggccgggtggggagcagctccggctccgacatccggctgcgggtactgcgggactgc gccgggagcaggccccacgggtgtagcaagggcagcttccgggaagggcggagccgcttc ctccgctggcggctacccggagtacccgcgcaccctacagtcctcactgccgcggatccc ccacttgaggagcccgccctccccgcgacctggaccgccccaagctgtcgcagacccgtc ttaaacacaggcaagtttaacccgggacaccgcaggagccgccacgtgcctgccctcgcg gctgaagacaggcctggtttcctccaacgagcagcacaatccttgcaaagcacagtttcc aatggtgtaggtgcctatttgggaaaagaggttggaaaagagcaggctgcagccgctgga cctggctaccgactggccagcaactcccagacagcagcattactttatggtggctccagg atgttaggaactgtgaagatggaagggcatgaaaccagcgactggaacagctactacgca gacacgcaggaggcctactcctccgtcccggtcagcaacatgaactcaggcctgggctcc atgaactccatgaacacctacatgaccatgaacaccatgactacgagcggcaacatgacc ccggcgtccttcaacatgtcctatgccaacccgggcctaggggccggcctgagtcccggc gcagtagccggcatgccggggggctcggcgggcgccatgaacagcatgactgcggccggc gtgacggccatgggtacggcgctgagcccgagcggcatgggcgccatgggtgcgcagcag gcggcctccatgaatggcctgggcccctacgcggccgccatgaacccgtgcatgagcccc atggcgtacgcgccgtccaacctgggccgcagccgcgcgggcggcggcggcgacgccaag acgttcaagcgcagctacccgcacgccaagccgccctactcgtacatctcgctcatcacc atggccatccagcaggcgcccagcaagatgctcacgctgagcgagatctaccagtggatc atggacctcttcccctattaccggcagaaccagcagcgctggcagaactccatccgccac tcgctgtccttcaatgactgcttcgtcaaggtggcacgctccccggacaagccgggcaag ggctcctactggacgctgcacccggactccggcaacatgttcgagaacggctgctacttg cgccgccagaagcgcttcaagtgcgagaagcagccgggggccggcggcgggggcgggagc ggaagcgggggcagcggcgccaagggcggccctgagagccgcaaggacccctctggcgcc tctaaccccagcgccgactcgcccctccatcggggtgtgcacgggaagaccggccagcta gagggcgcgccggcccccgggcccgccgccagcccccagactctggaccacagtggggcg acggcgacagggggcgcctcggagttgaagactccagcctcctcaactgcgccccccata agctccgggcccggggcgctggcctctgtgcccgcctctcacccggcacacggcttggca ccccacgagtcccagctgcacctgaaaggggacccccactactccttcaaccacccgttc tccatcaacaacctcatgtcctcctcggagcagcagcataagctggacttcaaggcatac gaacaggcactgcaatactcgccttacggctctacgttgcccgccagcctgcctctaggc agcgcctcggtgaccaccaggagccccatcgagccctcagccctggagccggcgtactac caagggcccattatgcaccagggaggaagtaggcgccctcctagtcccgtgctcctggaa agcggaatttctttgcatgattcagtccagctatacacccgtaacccaacagcggctcaa ccctgggcgccaactaccgctgcgtcctccgccgcttttcgcagctgggaagccatccct ggtgcccggggcgcgctgggcagaagggccgccctgcgcccctggctgagcacgcagggc tctcagcatcccggccgcgccgcccaaccccggcgcggagctggctgtacccgcaacctg gcaaggaaaaactgggccccctctcgcttccacgccagccattgcggaaaagcgccccag gaacctgggcggggaccgcgggacaccaggtcccgcccccgggccctgcagaaatctgcc atcgaaaagaagactctctcaggtccctgctctatcaatctttctcataagtacccggtg gaggcccatgaaaaagtgttggatcttggtccctga >gi568815584r:37491368_37694972|GENSCAN_predicted_peptide_4|439_aa MSAAALLQEVLGGAPRPSGLGEAAAPGKTRSFRPRDFYLRSSAFLRHQALKKPPVIASGF GTARPVVLLPPPEPPVKRRARGVLESSRHAAPRRVFHLGREREQSQEVAPLAGPCMAKER KASSVSAEDGYMEASSGRRKVRIRSNFVSESGAREAREAAGLGAQGEQESWPPSDAREAA WQALLPSRVIPTSIEEIIASLQSEAQLASDQTIKELIRSVLGQNYDITMEIFYESNKKTN QGCPMKSLSKEKKGLGAPTSDIGSMLYDIVKVGEFLLDNIVKSVFQLGSILSVTFSPIVM SCVPQLEMQKSPAYCIDLAGSCRPELFLFSHLASRYFFIATQEQPNTISSINGLLQVPPT PSQWDGDSGVSKLSTSSRSKEATRSLFPLVSMEAMWRTVMRHSYSFQLGWDQQRPSGEPK LLVPPCSNKEAPIPGASKG >gi568815584r:37491368_37694972|GENSCAN_predicted_CDS_4|1320_bp atgtccgcggccgccctcctgcaggaggtgctcggaggcgcgccgcgtccctcgggcctg ggtgaggcggcggctccaggcaagacccggtcgtttcgcccccgggacttttacttgcgg agctccgcgttcctacggcaccaggccctgaaaaagcccccagtcatcgcctcggggttc ggcacggccagacccgtggtcctgctgcctccgcccgagccacccgtgaagcgcagagcg cgcggggtcttggagagctcgcggcacgcggctcccaggcgggtcttccacttgggaagg gagcgcgaacagagccaggaggtagctcctctcgcagggccctgcatggccaaagagagg aaggccagctccgtctccgcagaggacggctacatggaggccagcagcgggcggaggaaa gtgaggatccgcagcaacttcgtgagcgagagcggggcccgcgaggcgcgggaagcggcg gggttaggagcccagggagaacaggagagctggccgcccagcgacgcgcgggaggccgcc tggcaggcgctgctgccctcccgcgtgatccccacgtccatcgaagagatcatcgcctct ctgcagtccgaggcccagctggcctccgaccagaccatcaaagagctcatacggagcgtc cttggccagaactacgacattacaatggaaattttctatgaatctaataagaaaacaaat cagggatgcccaatgaagagccttagcaaagagaagaaaggcctcggtgcccctacttct gacataggctccatgctctacgacatagttaaggttggggaatttctcctggataatatc gtgaagagtgttttccaacttggttccattctctccgtcactttcagtcccattgtgatg agctgtgtacctcagttggaaatgcagaaatcacctgcctactgcattgatctcgctggg agctgcagaccggagctgttcctattcagccatcttgccagcaggtatttctttatagca acacaagaacagcctaatacaatttccagtatcaacgggctgttacaagttccccctacc ccttcccagtgggatggagacagtggagtgtctaaactttcaacatcgtccagaagtaaa gaggccactcgctccctcttccctctggtgtcaatggaggccatgtggagaacagtaatg agacactcctattctttccaactaggatgggatcagcagaggcctagtggggagccaaaa ctcctagtcccaccatgcagtaacaaggaggcccccatccctggtgccagtaaaggctga >gi568815584r:37491368_37694972|GENSCAN_predicted_peptide_5|262_aa MKEAEGSLVEEKNQENVFGGNQVKKHFKKDAITVSNTVKSRFLKPESRKEALVCNFMLCG SAFYLRPWALSGLWFLCLASWVSPLHGQTFSQSLKDPMPISGALSLCSALISTLFLTNSS PFVLLSSDFCLFNLDISLMGKMYLKTSPMQAETPEIQAEYKFQMGAEESQMSVHKELSET MSSILQIEQEDIEWGPSEAESIVFKPQEISQVQPAEELSKPLEDGQPTSDSKEAKWVSLT VRGKPTPLKDESQDKQHLPQAD >gi568815584r:37491368_37694972|GENSCAN_predicted_CDS_5|789_bp atgaaagaagctgaggggagcctggtagaagagaaaaaccaggagaatgtgtttggtgga aaccaagtgaaaaaacatttcaagaaggatgcaataactgtgtcaaatactgtcaagtct agatttttgaagcccgaaagccggaaagaggctcttgtctgtaactttatgctctgcggt agtgccttttacctgaggccatgggccctcagtggtctatggtttctttgtctagcctca tgggtttctccattgcatgggcagactttcagccaaagcctcaaggaccctatgccgatt tctggagctctttctctatgcagtgccctcatctccacactgttccttacaaattctagc ccctttgtcctcctctcatctgatttctgtctcttcaatttagatatttctttaatggga aaaatgtacctcaaaacatcacctatgcaagcagaaacacctgagatacaagcagagtat aaattccagatgggtgctgaggaatcgcaaatgagcgtgcataaggaactttctgaaacc atgtcaagtattctccagattgaacaagaggatatagaatggggaccctcagaagcagaa agcatagtatttaaacctcaggaaatttcgcaagttcagccagcagaggaattaagtaaa cctttggaagatggacagcccacaagtgattcaaaagaagccaagtgggtgtctttaacg gtcagagggaagcccactcccctgaaggatgagtcccaggacaagcagcatttaccacaa gctgactga