GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:00:19 Sequence gi568815586r:110335158_110545552 : 210395 bp : 44.81% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4346 4564 219 1 0 37 55 163 0.518 6.40 1.02 Intr + 5502 5837 336 2 0 43 89 480 0.903 39.32 1.03 Intr + 7071 7291 221 2 2 120 82 363 0.999 36.00 1.04 Intr + 8075 8277 203 2 2 25 68 166 0.995 7.23 1.05 Intr + 9729 9814 86 1 2 69 38 68 0.907 -0.56 1.06 Intr + 10092 10225 134 2 2 42 92 99 0.599 5.14 1.07 Intr + 10844 10961 118 2 1 99 100 115 0.990 14.27 1.08 Intr + 11044 11259 216 0 0 71 18 288 0.226 18.80 1.09 Intr + 14212 14466 255 0 0 77 64 62 0.196 0.34 1.10 Intr + 15135 15175 41 2 2 124 73 38 0.189 2.92 1.11 Term + 19677 19884 208 0 1 27 39 154 0.147 1.21 1.12 PlyA + 19928 19933 6 1.05 2.35 PlyA - 24696 24691 6 1.05 2.34 Term - 35266 35067 200 2 2 -22 37 168 0.508 -1.74 2.33 Intr - 35999 35775 225 0 0 112 71 80 0.601 6.66 2.32 Intr - 37040 36922 119 1 2 76 70 25 0.166 -0.39 2.31 Intr - 39176 38991 186 1 0 110 45 316 0.298 28.30 2.30 Intr - 41059 40909 151 2 1 107 102 103 0.999 12.82 2.29 Intr - 42460 42236 225 2 0 105 83 101 0.959 9.26 2.28 Intr - 46791 46595 197 2 2 108 116 41 0.989 7.96 2.27 Intr - 47803 47686 118 2 1 124 51 94 0.566 8.82 2.26 Intr - 51312 51170 143 2 2 -2 115 69 0.917 0.60 2.25 Intr - 52735 52582 154 2 1 87 85 56 0.659 4.33 2.24 Intr - 53466 53355 112 0 1 66 48 70 0.729 0.75 2.23 Intr - 60063 59944 120 0 0 46 106 52 0.913 3.49 2.22 Intr - 61273 61109 165 1 0 73 91 84 0.547 7.36 2.21 Intr - 66301 66196 106 0 1 57 81 44 0.512 0.82 2.20 Intr - 68663 68370 294 1 0 29 86 234 0.587 13.42 2.19 Intr - 70451 70319 133 0 1 84 74 40 0.625 1.90 2.18 Intr - 70814 70704 111 0 0 79 91 34 0.677 3.15 2.17 Intr - 73378 73297 82 1 1 75 115 66 0.944 7.21 2.16 Intr - 101047 100953 95 2 2 60 116 30 0.906 2.68 2.15 Intr - 101526 101400 127 0 1 77 95 41 0.980 3.95 2.14 Intr - 101995 101927 69 1 0 120 86 -20 0.562 0.38 2.13 Intr - 105231 105155 77 1 2 77 94 140 0.995 12.73 2.12 Intr - 110394 110295 100 0 1 77 89 21 0.518 0.78 2.11 Intr - 118714 118586 129 1 0 38 86 139 0.995 9.59 2.10 Intr - 120525 120429 97 2 1 70 107 61 0.997 6.21 2.09 Intr - 120773 120658 116 2 2 96 81 104 0.991 9.75 2.08 Intr - 122477 122353 125 0 2 47 47 74 0.890 -0.50 2.07 Intr - 124705 124538 168 2 0 76 68 210 0.929 17.72 2.06 Intr - 130057 129949 109 1 1 83 119 163 0.619 18.76 2.05 Intr - 133830 133570 261 0 0 58 52 225 0.455 13.58 2.04 Intr - 156965 156878 88 1 1 104 9 66 0.008 0.27 2.03 Intr - 158074 157839 236 1 2 60 92 121 0.911 6.19 2.02 Intr - 161046 160855 192 0 0 23 49 151 0.135 4.39 2.01 Init - 167191 167132 60 1 0 81 49 86 0.059 5.48 2.00 Prom - 169506 169467 40 -2.06 3.00 Prom + 170135 170174 40 -3.86 3.01 Init + 183059 183066 8 1 2 40 103 0 0.317 -2.87 3.02 Intr + 184637 184759 123 2 0 69 116 36 0.793 4.20 3.03 Intr + 187020 187254 235 0 1 73 93 67 0.825 3.39 3.04 Term + 192215 192364 150 1 0 103 53 77 0.759 3.61 3.05 PlyA + 192486 192491 6 1.05 4.04 PlyA - 195830 195825 6 1.05 4.03 Term - 201938 201880 59 0 2 97 37 61 0.700 -0.25 4.02 Intr - 203116 202987 130 1 1 72 99 114 0.997 11.17 4.01 Intr - 204788 204665 124 1 1 106 86 119 0.778 14.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 93684 93840 157 2 1 80 53 130 0.805 8.85 S.002 Term - 100060 99998 63 1 0 72 49 42 0.834 -3.41 S.003 Init - 125223 125196 28 0 1 34 89 1 0.855 -5.44 S.004 Sngl + 125831 126079 249 1 0 75 41 356 0.997 24.68 S.005 Term - 129382 129372 11 2 2 108 40 12 0.925 -3.24 S.006 Intr + 151168 151269 102 0 0 95 32 101 0.836 5.57 S.007 Term + 151377 151547 171 2 0 58 54 93 0.825 0.73 S.008 Term - 156965 156848 118 1 1 104 42 71 0.966 2.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:110335158_110545552|GENSCAN_predicted_peptide_1|678_aa GAPEGVIDRCTHIRVGSTKVPMTSGVKQKIMSVIREWGSGSDTLRCLALATHDNPLRREE MHLEDSANFIKYETNLTFVGCVGMLDPPRIEVASSVKLCRQAGIRVIMITGDNKGTAVAI CRRIGIFGQDEDVTSKAFTGREFDELNPSAQRDACLNARCFARVEPSHKSKIVEFLQSFD EITAMTGDGVNDAPALKKAEIGIAMGSGTAVAKTASEMVLADDNFSTIVAAVEEGRAIYN NMKQFIRYLISSNVGEVVCIFLTAALGFPEALIPVQLLWVNLVTDGLPATALGFNPPDLD IMNKPPRNPKEPLISGWLFFRYLAIGCYVGAATVGAAAWWFIAADGGPRVSFYQLSHFLQ CKEDNPDFEGVDCAIFESPYPMTMALSVLVTIEMCNALNSLSENQSLLRMPPWENIWLVG SICLSMSLHFLILYVEPLPLIFQITPLNVTQWLMVLKISLPVILMDETLKFVARNYLEPG KECVQPATKSCSFSACTDGISWPFVLLIMPLVAEGPAISVACCHPVPPLASLSFAQKTNN HTYPNWDTTLQNADDPFWRKLSLELSELPGKQGIWPTSLTTAAPTSPRTGASALTEQYWS NRFLNHFAEIKKGLLGEMADSRDGTENVQYVPEEYQSTRKEGLKNSNQRCGYVKGTQEPT EKATTAEECEQQNKAVLD >gi568815586r:110335158_110545552|GENSCAN_predicted_CDS_1|2037_bp ggtgctcctgaaggtgtcattgacaggtgcacccacattcgagttggaagtactaaggtt cctatgacctctggagtcaaacagaagatcatgtctgtcattcgagagtggggtagtggc agcgacacactgcgatgcctggccctggccactcatgacaacccactgagaagagaagaa atgcaccttgaggactctgccaactttattaaatatgagaccaatctgaccttcgttggc tgcgtgggcatgctggatcctccgagaatcgaggtggcctcctccgtgaagctgtgccgg caagcaggcatccgggtcatcatgatcactggggacaacaagggcactgctgtggccatc tgtcgccgcatcggcatcttcgggcaggatgaggacgtgacgtcaaaagctttcacaggc cgggagtttgatgaactcaacccctccgcccagcgagacgcctgcctgaacgcccgctgt tttgctcgagttgaaccctcccacaagtctaaaatcgtagaatttcttcagtcttttgat gagattacagctatgactggcgatggcgtgaacgatgctcctgctctgaagaaagccgag attggcattgctatgggctctggcactgcggtggctaaaaccgcctctgagatggtcctg gcggatgacaacttctccaccattgtggctgccgttgaggaggggcgggcaatctacaac aacatgaaacagttcatccgctacctcatctcgtccaacgtcggggaagttgtctgtatt ttcctgacagcagcccttggatttcccgaggctttgattcctgttcagctgctctgggtc aatctggtgacagatggcctgcctgccactgcactggggttcaaccctcctgatctggac atcatgaataaacctccccggaacccaaaggaaccattgatcagcgggtggctctttttc cgttacttggctattggctgttacgtcggcgctgctaccgtgggtgctgctgcatggtgg ttcattgctgctgacggtggtccaagagtgtccttctaccagctgagtcatttcctacag tgtaaagaggacaacccggactttgaaggcgtggattgtgcaatctttgaatccccatac ccgatgacaatggcgctctctgttctagtaactatagaaatgtgtaacgccctcaacagc ttgtccgaaaaccagtccttgctgaggatgcccccctgggagaacatctggctcgtgggc tccatctgcctgtccatgtcactccacttcctgatcctctatgtcgaacccttgccactc atcttccagatcacaccgctgaacgtgacccagtggctgatggtgctgaaaatctccttg cccgtgattctcatggatgagacgctcaagtttgtggcccgcaactacctggaacctggt aaagagtgtgtgcagcctgccaccaaatcctgctcgttctcggcatgcaccgatgggatt tcctggccgtttgtgctgctcataatgcccctggtggctgaaggcccagccatcagtgtc gcttgttgccaccccgtgcctcccttggcctctctgagctttgcccagaagaccaacaat catacataccctaactgggacaccactctgcagaatgcagatgatccattctggaggaag ctgtcccttgagctcagtgagctcccaggcaagcagggcatctggccgacttccctcaca acagctgctcccacatcccctcggactggagcttcagccctgactgagcaatactggagt aaccgcttcctaaaccattttgcagaaattaaaaaagggctccttggagaaatggctgat tctagagatgggacagaaaacgtacagtacgtgcctgaagaataccaaagcaccagaaag gaagggctcaaaaacagcaatcaaagatgtggttatgtgaaaggaacacaggagccaact gaaaaagccacaacagccgaagaatgtgagcaacaaaataaagcagtattggactag >gi568815586r:110335158_110545552|GENSCAN_predicted_peptide_2|1629_aa MAAIRPKKQARKARLSSTAQLVLVLGDLHIPHRCNSLPAKFKKLLVPGKIQHILCTGNLC TKESYDYLKTLAGDVHIVRGDFDENLNYPEQKVVTVGQFKIGLIHGHQVIPWGDMASLAL LQRQFDVDILISGHTHKFEAFEHENKFYINPGSATGAYNALETNIIPSFVLMDIQASTVV TYVYQLIGDDVKRAALQKSYAPSSRTPDPGRASPRRRDREPCAGAAVPASLPRQHACVRL LVFRRGGSDAGFSPLRSRSDTARTASGERGGQIRDPGVTSTYCATMVQHCEALNRSVQVV NLDPAAEHFNYSVMADIRELIEVDDVMEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGH VEDDYILFDCPGQIELYTHLPVMKQLVQQLEQWEFRVCGVFLVDSQFMVESFKFISGILA ALSAMISLEIPQVNIMTKMDLLSKKAKKEIEKFLDPDMYSLLEDSTSDLRSKKFKKLTKA ICGLIDDYSMVRFLPYDQSDEESMNIVLQHIDFAIQYGEDLEFKEPKAYHSSLMDPDTKL IGNMALLPIRSQFKGPAPRETKDTDIVDEAIYYFKANVFFKNYEIKNEADRTLIYITLYI SECLKKLQKCNSKSQGEKEMYTLGITNFPIPGEPGFPLNAIYAKPANKQEDEVMRAYLQQ LRQETGLRLCEKVFDPQNDKPSKWLKTDTARSPRKPTGPSQTLWVTLTVEGKVEWTNGLL KTHLTKLSLQLKKDSVKDRAPKLTNQAGSHTPPPIPLEAALKNIAHYLSIPPPKIFAAAT LYDYFVLFFLLTTTTGAGTRVSFYSGTSVVTSCLGVVPRVDPMDPGDAAILESSLRILYR LFESVLPPLPAALQSRMNVIDHVRDMAAAGLHSNVRLLSSLLLTMSNNNPTLGYSHFSTL RITHQLKAQWLCAISTHMLIECVEEKYQLLVYHADSLFHDKEYRNAVSKYTMALQQKKAL SKTSKVRPSTGNSASTPQSQCLPSEIEVKYKMAECYTMLKQDKDAIAILDGIPSRQRTPK INMMLANLYKKAGQERPSVTSYKEVLRQCPLALDAILGLLSLSVKGAEVASMTMNVIQTV PNLDWLSVWIKAYAFVHTGDNSRAISTICSLEKKSLLRDNVDLLGSLADLYFRAGDNKNS VLKFEQAQMLDPYLIKGMDVYGYLLAREGRLEDVENLGCRLFNISDQHAEPWVVSGCHSF YSKRYSRALYLGAKAIQLNSNSVQALLLKGAALRNMGRVQEAIIHFREAIRLAPCRLDCY EGLIECYLASNSIREAMVMANNVYKTLGANAQTLTLLATVCLEDPVTQEKAKTLLDKALT QRPDYIKAVVKKAELLSREQKYEDGIALLRNALANQSDCVLHRILGDFLVAVNEYQEAMD QYSIALSLDPNDQKSLEGMQKMEKEESPTDATQEEDVDDMEGSGEEGDLEGSDSEAAQWA DQEQWFGMHPAESPCALQAPAAQHLLGRILILCSSSPRVTARLEEAAEGVPWKSIEKPAR GELEVPNSCRPDPRGPLGVNSAAVRGWEEPASKGLTCGFLRSQPYTGLPVAGQGLRGAAS QSLGGTTTVENSLAIQKFKQYYYPRIQCSTSGCTPEEVKAGTPKDCTSMSTQHCPQQQKV ETTHVSLDE >gi568815586r:110335158_110545552|GENSCAN_predicted_CDS_2|4890_bp atggctgccatccgccctaaaaagcaggctcggaaggcgcgcctcagctcgacggcccag ttggtgttggtattaggagatctgcacatcccacaccggtgcaacagtttgccagctaaa ttcaaaaaactcctggtgccaggaaaaattcagcacattctctgcacaggaaacctttgc accaaagagagttatgactatctcaagactctggctggtgatgttcatattgtgagagga gacttcgatgagaatctgaattatccagaacagaaagttgtgactgttggacagttcaaa attggtctgatccatggacatcaagttattccatggggagatatggccagcttagccctg ttgcagaggcaatttgatgtggacattcttatctcgggacacacacacaaatttgaagca tttgagcatgaaaataaattctacattaatccaggttctgccactggggcatataatgcc ttggaaacaaacattattccatcatttgtgttgatggatatccaggcttctacagtggtc acctatgtgtatcagctaattggagatgatgtgaaacgggcggctctgcagaagagctac gctccgtccagtcggaccccggaccctggccgggcatctccgcggcgccgagaccgcgag ccgtgtgcgggagcagctgtcccagcatccctcccccgacagcacgcgtgcgtcaggctg ctggtattccggagaggcggctcggatgctggcttctcgcccctccgaagccgttcggac actgcccgaacagcttcgggagagcggggcgggcagatccgcgaccccggagttacgagc acctactgtgccaccatggtccagcactgtgaagccctcaaccggtctgtccaagttgta aacctggatccagcagcagaacacttcaactactccgtgatggctgacatccgggaactg atcgaggtggatgatgtaatggaggatgattctctgcgattcggtcccaacggaggattg gtattttgcatggagtactttgccaataattttgactggctggagaactgtcttggccat gtagaggacgactatatcctttttgattgtccaggtcagattgagttgtacactcacctg cctgtgatgaaacagctggtccagcagctcgagcagtgggagttccgagtctgtggagtt tttcttgttgattctcagttcatggtggagtcattcaagtttatttctggcatcttggca gccctgagtgccatgatctctctagaaattccgcaagtcaacatcatgacaaaaatggat ctgctgagtaaaaaagcaaaaaaggaaattgagaaatttttagatccagacatgtattct ttattagaagattctacaagtgacttaagaagcaaaaaattcaagaaactgactaaagct atatgtggactgattgatgactacagcatggttcgatttttaccttacgatcagtcagat gaagaaagcatgaacattgtattgcagcatattgattttgccattcaatatggagaagac ctagaatttaaagaaccaaaggcttaccactcttctctcatggatcctgataccaaactc atcggaaacatggcactgttgcctatcagaagtcaattcaaaggacctgcccccagagag acaaaagatacagatattgtggatgaagccatctattacttcaaggccaatgtcttcttc aaaaactatgaaattaagaatgaagctgataggaccttgatatatataactctctacatt tctgaatgtctgaagaaactgcaaaagtgcaattccaaaagccaaggtgagaaagaaatg tatacgctgggaatcactaattttcccattcctggagagcctggttttccacttaacgca atttatgccaaacctgcaaacaaacaggaagatgaagtgatgagagcctatttacaacag ctaaggcaagagactggactgagactttgtgagaaagttttcgaccctcagaatgataaa cccagcaagtggctgaagactgacactgcccgatcacctcggaagcctacaggaccatca cagacgctttgggtaactcttacagtggaaggaaaggtagaatggactaatggtctttta aaaacacacctcaccaagctcagcctccaacttaaaaaggactctgtcaaggatagagcc ccaaaactcaccaaccaagcaggttcccacacaccacccccaatcccacttgaagcagcc ctgaaaaacatcgcccattatctctccataccacccccaaaaattttcgctgccgcaaca ctttacgactatttcgttttattcttcttgttaacgaccacgacaggagcagggactcga gtttctttttattcgggcacttcagtagttacctcttgcttgggggtggtgccgcgtgtg gacccgatggaccccggcgacgccgccattttggagtcttccctaaggatcctctaccgg cttttcgagtcagtgctgccgccgctgcccgcggctttgcagagcaggatgaatgtgata gaccacgtgcgggacatggcggccgcggggctgcactccaacgtgcggctcctcagcagc ttgttacttacaatgagtaataacaacccgactctcgggtactcccacttttctaccctt cgcatcacccaccaattgaaagcgcagtggctgtgcgcaataagcactcacatgcttatt gagtgtgttgaggaaaagtaccagcttttggtgtatcatgcagattctctctttcatgat aaggaatatcggaatgctgtgagtaagtataccatggctttacagcagaagaaagcgcta agtaaaacttcaaaagtgagaccttcaactggaaattctgcatctactccacaaagtcag tgtcttccatctgaaattgaagtgaaatacaaaatggctgaatgttatacaatgctaaaa caagataaagatgccattgctatacttgatgggatcccttcaagacaaagaactcccaaa ataaacatgatgctggcaaacctgtacaagaaggctggtcaggagcgcccttcagtcacc agctataaggaggtgctgaggcagtgcccattagcccttgatgccattctaggcttgttg tccctttctgtaaaaggggcagaggtggcatccatgacaatgaatgtgatccaaaccgtg cctaacttggactggctctctgtgtggatcaaagcgtatgcttttgtgcacactggtgac aactcaagagcaatcagtaccatctgttcactagagaaaaaatccttattgcgagataac gtggacctattgggaagcttggcagatctgtacttcagagctggagacaataaaaactct gtcctcaagtttgaacaggcacagatgttggatccttatctgataaaaggaatggatgta tatggctacctactggcacgagaagggcggctagaggatgttgagaaccttggatgccgc cttttcaatatctctgatcagcatgcagaaccgtgggtggtttctggctgtcacagcttc tatagcaaacgctactcccgggccctctatttaggagccaaggccattcagctgaacagt aatagtgttcaagctctgctacttaagggagcagcacttaggaacatgggcagagtccaa gaagcaataatccactttcgggaggccatacggctcgcaccttgtcgcttagattgttat gaaggtcttatcgaatgttacttagcctccaacagtattcgagaagcaatggtaatggct aacaacgtttacaaaactctgggagcaaatgcacagacccttacccttttagccaccgtt tgtcttgaagacccagtgacacaggagaaagccaaaacattattagataaagccctgacc caaaggccagattacattaaggctgtggtgaaaaaagcagaactacttagcagagaacag aaatatgaagatggaattgctttgctgaggaacgcactggctaatcagagtgactgtgtc ctgcatcggatcctaggagatttccttgtagctgtcaatgagtatcaggaggcaatggac cagtatagtatagcactaagtttggaccccaatgaccagaagtctctagaggggatgcag aagatggagaaggaggagagtcccacggatgccactcaggaggaggatgtggacgacatg gaagggagtggggaagaaggggacctggagggcagcgacagtgaggcggcccagtgggct gaccaggagcagtggttcggcatgcaccctgctgagagcccctgtgctctccaggctcct gcagcccagcacctgctaggacggatcctcattctctgcagctccagccccagggtgaca gccaggttggaagaggcggctgaaggtgttccctggaaaagtattgagaaacctgcgaga ggggagctcgaagttcccaacagctgcaggcctgaccccagagggcccttgggagttaac tctgcagctgtccgtgggtgggaggaaccggcctccaaaggcctcacctgtggcttcctg cgttcccagccctacactggcttacctgtggcaggccagggcctgcgcggagctgcttcc cagagcttaggtggcacaaccactgtggagaacagtctggcgattcagaaattcaaacag tactactaccctaggatccagtgttccacctccgggtgtacacccgaagaagtgaaagct ggaacgccaaaagattgcacgtccatgtccacgcagcactgtccacagcagcaaaaggtg gaaacaacccacgtgtccttagatgagtag >gi568815586r:110335158_110545552|GENSCAN_predicted_peptide_3|171_aa MCIPLALSIDDMLVEANFILATLADEQSRASSPQSLCLSQKRKRSDLIEKKAGKNVTGQA LECISKKAAPRRLYPKETLTNISALENCGSPAMKRVDGDVSEVSESSVSNTEEVPGSLCL RKCDITSSEVHGLSTWTSGNHFPACHSQYHGYLESRFNSCKASLAEGIFTF >gi568815586r:110335158_110545552|GENSCAN_predicted_CDS_3|516_bp atgtgcatacctctggctttgagtattgatgatatgttagtggaagctaactttattttg gccacattagctgatgaacaaagtagagcatcttcaccacagtcactgtgtctttcacag aaacgaaaaaggtcagatctgattgaaaaaaaggctggcaaaaatgtaactggccaggcc ctggaatgtatttcaaaaaaagcagcaccaagaaggctttatcctaaggagactctcaca aacatatctgcattggaaaactgtggcagccctgcaatgaaaagagtggatggagatgtc agtgaagtatcagaaagcagtgtcagcaacacagaggaagtgccagggtctctgtgtctc agaaagtgtgacataacatcttcagaggttcatgggcttagtacatggacctctgggaac cattttcctgcctgccacagccagtatcacggctatctggagagcaggttcaacagctgc aaggcaagccttgcagagggcatcttcaccttctga >gi568815586r:110335158_110545552|GENSCAN_predicted_peptide_4|104_aa XPDAADSTSFDVQLGDIILTATDGLFDNMPDYMILQELKKLKNSNYESIQQTARSIAEQA HELAYDPNYMSPFAQFACDNGLNVRGGKPDDITVLLSIVAEYTD >gi568815586r:110335158_110545552|GENSCAN_predicted_CDS_4|315_bp nntccggatgctgctgatagcacgtctttcgatgtccagctaggagacattatcctgacg gcaacagatggactctttgacaacatgcctgattatatgattcttcaggagctaaaaaag ttaaagaattcaaattatgagagtatacaacagactgccagaagcattgctgagcaagct catgagctggcctatgacccaaattatatgtcaccttttgcacagtttgcatgtgacaat ggattgaatgtgagaggtggaaagccagatgacatcaccgtccttctttcaatagtggct gagtatacagactag