GENSCAN 1.0 Date run: 6-Nov-116 Time: 05:17:19 Sequence gi568815597r:58530846_58731530 : 200685 bp : 40.51% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2332 2489 158 0 2 75 92 76 0.099 6.13 1.02 Intr + 16166 16322 157 2 1 40 -1 160 0.460 1.39 1.03 Intr + 17958 18189 232 2 1 90 -25 150 0.289 0.32 1.04 Intr + 19756 19964 209 2 2 91 67 135 0.975 9.57 1.05 Term + 20050 20244 195 0 0 50 42 183 0.991 6.33 1.06 PlyA + 21873 21878 6 1.05 2.02 PlyA - 22818 22813 6 1.05 2.01 Sngl - 46311 45340 972 0 0 111 43 1786 0.934 170.88 2.00 Prom - 46869 46830 40 -8.05 3.02 PlyA - 47231 47226 6 1.05 3.01 Sngl - 48843 48760 84 0 0 93 48 250 0.996 10.18 3.00 Prom - 52009 51970 40 -3.65 4.00 Prom + 53677 53716 40 -5.75 4.01 Init + 55262 55340 79 1 1 81 92 82 0.959 9.17 4.02 Intr + 55604 55864 261 0 0 65 58 137 0.602 4.94 4.03 Intr + 58142 58191 50 0 2 54 50 52 0.011 -4.52 4.04 Term + 62506 62637 132 0 0 127 49 69 0.432 4.01 4.05 PlyA + 63792 63797 6 1.05 5.04 PlyA - 65431 65426 6 1.05 5.03 Term - 67503 67345 159 0 0 78 55 102 0.097 2.86 5.02 Intr - 70658 70514 145 1 1 76 17 125 0.044 3.56 5.01 Init - 73363 73281 83 1 2 67 98 30 0.060 2.39 5.00 Prom - 76503 76464 40 -5.35 6.07 PlyA - 77229 77224 6 1.05 6.06 Term - 80763 80642 122 1 2 77 54 75 0.187 0.66 6.05 Intr - 89070 88917 154 0 1 125 107 26 0.581 6.92 6.04 Intr - 89410 89344 67 0 1 57 93 60 0.633 1.39 6.03 Intr - 92957 92801 157 0 1 41 85 95 0.280 2.65 6.02 Intr - 93817 93653 165 2 0 35 44 125 0.264 1.81 6.01 Init - 93946 93844 103 1 1 60 80 142 0.673 9.08 6.00 Prom - 97258 97219 40 -5.75 7.17 PlyA - 97375 97370 6 1.05 7.16 Term - 100619 100205 415 1 1 50 48 395 0.241 25.45 7.15 Intr - 109974 109897 78 2 0 71 76 57 0.000 0.55 7.14 Intr - 125391 125236 156 2 0 97 83 77 0.452 6.30 7.13 Intr - 130666 130561 106 2 1 105 77 48 0.983 3.75 7.12 Intr - 134786 134654 133 2 1 85 97 46 0.984 4.50 7.11 Intr - 136381 136193 189 1 0 43 67 190 0.995 11.26 7.10 Intr - 137076 137002 75 0 0 78 100 40 0.883 2.99 7.09 Intr - 137837 137787 51 2 0 93 87 19 0.575 0.49 7.08 Intr - 142805 142728 78 2 0 78 78 58 0.844 2.63 7.07 Intr - 144735 144632 104 1 2 36 103 94 0.736 4.67 7.06 Intr - 146211 146081 131 2 2 60 43 45 0.452 -3.38 7.05 Intr - 151700 150940 761 2 2 11 95 653 0.752 47.63 7.04 Intr - 154406 154308 99 2 0 46 121 60 0.956 4.49 7.03 Intr - 156989 156855 135 2 0 42 37 118 0.760 1.94 7.02 Intr - 158271 158193 79 2 1 117 59 66 0.915 5.23 7.01 Init - 169207 169140 68 1 2 107 72 121 0.985 13.00 7.00 Prom - 169921 169882 40 -7.25 8.05 PlyA - 169982 169977 6 1.05 8.04 Term - 179195 179176 20 0 2 133 39 -17 0.249 -4.60 8.03 Intr - 179574 179440 135 1 0 94 78 79 0.562 7.12 8.02 Intr - 184009 183903 107 0 2 82 75 106 0.765 7.64 8.01 Intr - 189109 188931 179 1 2 8 63 139 0.306 1.20 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 16065 16114 50 2 2 94 97 32 0.889 3.25 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:58530846_58731530|GENSCAN_predicted_peptide_1|316_aa MNRHVFTGAYINLNSMIYKKSAAKNQVTVVKGRNTGLTYRRPTLKSQFGYFVSLYGSTGQ KGRDSKTAKRRRLPGSESAFIEIVFGVWEQCFHMIFGGTHFKKDEDPVIPMDIIKPSPVI ASPTVSPSLQRRAFSELLSDASAPHASAFPMAIEYDASSGPPMEHSPLSAFGLYTQKKFF QANHPTSSPPVSTGPLKSRCQGGIRCARHIYCVENPGKDNGAGAGIDRKGPQIMTLVKGE RERRKTGLEAPQRWASTSVHTMHDHWLRAVYGEHALVTNGVGEPEGQGQGLAVNYAPQSR SERHIFPVTAAGGFRV >gi568815597r:58530846_58731530|GENSCAN_predicted_CDS_1|951_bp atgaacagacatgtattcacaggagcatacatcaacttgaattctatgatctataagaaa agcgctgctaaaaaccaggtaactgtagtgaaaggaagaaacactggactaacatacaga agacccacgttaaagtcccagttcggttactttgtcagtctctacggcagtaccggccaa aagggaagagactcgaagactgccaaaagacgaaggcttcctggttctgaatctgcattc atagaaatagtctttggagtatgggaacagtgctttcatatgatttttggagggacgcat tttaagaaagatgaagatcctgttatcccaatggatattatcaagcccagccctgtgatt gcctctcctaccgtcagcccttccctccagaggagagccttctctgaacttcttagtgat gctagtgcccctcacgccagtgcattccccatggccatagaatatgatgcatcctctggg cctcccatggaacattcccctctgtctgccttcgggctctacacacaaaaaaaattcttt caggcaaatcaccccacttctagtccacctgtatcaacaggccctctaaaaagcagatgc cagggtggaatcagatgtgcaagacatatttactgcgtggaaaatcctgggaaggataat ggagcaggagcaggaatagacaggaagggccctcagatcatgacactggttaaaggagag agggaaagaagaaagactggattagaagcacctcagagatgggcatccactagtgttcac accatgcatgatcattggctaagagcagtctatggggagcatgcgcttgttacaaatgga gttggggagccagaaggtcaggggcagggcctggcagtcaactatgctccccagagcaga tctgagcggcacattttcccggtcacggctgctggaggatttagagtttga >gi568815597r:58530846_58731530|GENSCAN_predicted_peptide_2|323_aa MARGPGLAPPPLRLPLLLLVLAAVTGHTAAQDNCTCPTNKMTVCSPDGPGGRCQCRALGS GMAVDCSTLTSKCLLLKARMSAPKNARTLVRPSEHALVDNDGLYDPDCDPEGRFKARQCN QTSVCWCVNSVGVRRTDKGDLSLRCDELVRTHHILIDLRHRPTAGAFNHSDLDAELRRLF RERYRLHPKFVAAVHYEQPTIQIELRQNTSQKAAGDVDIGDAAYYFERDIKGESLFQGRG GLDLRVRGEPLQVERTLIYYLDEIPPKFSMKRLTAGLIAVIVVVVVALVAGMAVLVITNR RKSGKYKKVEIKELGELRKEPSL >gi568815597r:58530846_58731530|GENSCAN_predicted_CDS_2|972_bp atggctcggggccccggcctcgcgccgccaccgctgcggctgccgctgctgctgctggtg ctggcggcggtgaccggccacacggccgcgcaggacaactgcacgtgtcccaccaacaag atgaccgtgtgcagccccgacggccccggcggccgctgccagtgccgcgcgctgggctcg ggcatggcggtcgactgctccacgctgacctccaagtgtctgctgctcaaggcgcgcatg agcgcccccaagaacgcccgcacgctggtgcggccgagtgagcacgcgctcgtggacaac gatggcctctacgaccccgactgcgaccccgagggccgcttcaaggcgcgccagtgcaac cagacgtcggtgtgctggtgcgtgaactcggtgggcgtgcgccgcacggacaagggcgac ctgagcctacgctgcgatgagctggtgcgcacccaccacatcctcattgacctgcgccac cgccccaccgccggcgccttcaaccactcagacctggacgccgagctgaggcggctcttc cgcgagcgctatcggctgcaccccaagttcgtggcggccgtgcactacgagcagcccacc atccagatcgagctgcggcagaacacgtctcagaaggccgccggtgacgtggatatcggc gatgccgcctactacttcgagagggacatcaagggcgagtctctattccagggccgcggc ggcctggacttgcgcgtgcgcggagaacccctgcaggtggagcgcacgctcatctattac ctggacgagattcccccgaagttctccatgaagcgcctcaccgccggcctcatcgccgtc atcgtggtggtcgtggtggccctcgtcgccggcatggccgtcctggtgatcaccaaccgg agaaagtcggggaagtacaagaaggtggagatcaaggaactgggggagttgagaaaggaa ccgagcttgtag >gi568815597r:58530846_58731530|GENSCAN_predicted_peptide_3|27_aa MDCRRGWQEHDDDDDDDDDDDDNDDDK >gi568815597r:58530846_58731530|GENSCAN_predicted_CDS_3|84_bp atggactgccggagaggttggcaggagcacgatgatgatgatgatgatgatgacgacgac gacgacaacgatgatgacaaatga >gi568815597r:58530846_58731530|GENSCAN_predicted_peptide_4|173_aa MASQENPSKDKALILSDNLLPLDGMDNISSHIGPGLSVLRGSTALLLSAVMSHKTHTRDG SRGEEEWERGKVQKSRKNEAGRKEETSLFSNSGMCGPKTKGKKVLVVPLTLVGLGKAANA LTVLREASSQKYLLRTYYVLVSVEVLEKKAVNNRDACPLGAYIPVMTGEFDLG >gi568815597r:58530846_58731530|GENSCAN_predicted_CDS_4|522_bp atggcctctcaagagaatccaagtaaggataaggctttgattttatccgacaacttacta cctttagatggtatggataacatctccagtcacatcggtccagggctgagtgtgctgagg ggctcaactgcactcctactctcagccgtcatgagtcacaagactcacaccagggatgga agtagaggcgaggaggagtgggagagaggaaaagttcagaagtctaggaagaatgaggca ggcaggaaagaggagacttcattgttctctaattctgggatgtgtgggcccaagacaaag ggcaagaaagtccttgttgttcctctgaccttagtgggattaggtaaagcagccaatgct ctaactgttctcagggaagcctcaagtcagaaatacttattaagaacctactatgtgcta gtatcagtcgaggtgctagagaagaaagcagtgaacaacagagatgcttgcccccttgga gcttatatcccagtgatgactggggaatttgacttgggatag >gi568815597r:58530846_58731530|GENSCAN_predicted_peptide_5|128_aa MAELVKIHQPSIHCLQETHLTHKDSNLRARKNNLKFHMEPKERPDIAKARLSKRKKSGGI TLLDFKLYYKAVVTKTGAGSQIGKLPLQQLSARIKNPPKPANSHNLETEIPRKPPKSRSA LWRPDFRP >gi568815597r:58530846_58731530|GENSCAN_predicted_CDS_5|387_bp atggcagaattggtaaaaatccaccaaccaagtatccactgtcttcaagagactcaccta acacataaggactcaaacttaagagccaggaaaaacaatctgaaatttcatatggaacca aaagaaagacccgacatagccaaagcaagactgagcaaaaggaaaaaatctggaggcatc acattgctggacttcaagctatactacaaggctgtggttaccaaaacaggagctggcagc caaattgggaagctgcccctgcagcagctctctgccaggatcaaaaacccaccaaaacca gcaaacagccacaacctagaaactgaaatacccaggaagccaccaaaatcaagaagtgcc ttgtggcggccagacttccgtccatga >gi568815597r:58530846_58731530|GENSCAN_predicted_peptide_6|255_aa MKLRTLAVSATALKVARLEFVPFDVRMCSEFLSSGVKLQTFAVSVTALNALRLELFVPPG GLMVSLASGVKLQIFTVSVTAHKSSVDPKTLGWSMGLGAVEQGAALIGEAWAAQEPMEGV GGSGMAGCRSRALPRGKAAKAWREVDSESWRFWCKVANSVPTYKVLFHPALECVPVLTTL ALVPPAACTIPTAPLILCQAPEILGFPSSPVSTSADNSGFMGSNESLDWPPSALCPPVFL TLLTAASTQLIKPKA >gi568815597r:58530846_58731530|GENSCAN_predicted_CDS_6|768_bp atgaagctgcggaccctcgcggtgagtgctacagctcttaaggtggcgcgtctggagttt gttccttttgatgttcggatgtgttcggagtttctttcttccggagtgaagctgcagacc ttcgcggtgagtgttacagctcttaatgcactgcgtctggagttgttcgttcctcccggt gggctcatggtctcgctggcttcaggagtgaagctgcagatcttcacggtgagtgttaca gctcataaaagcagtgtggacccaaagacccttgggtggtcgatgggactgggcgccgtg gagcagggggcggcgctcatcggggaggcttgggccgcacaggagcccatggagggggtg ggaggctcaggcatggcgggctgcaggtctcgagccctgccccgcggaaaggcagctaag gcctggagagaagtggattctgaaagctggcggttctggtgtaaagttgctaatagtgtg cccacttataaggttctcttccacccagcactggagtgtgtccctgtcctcaccacttta gccctggtccctcctgcagcatgcacaattcccactgctcctttgattctgtgccaggca cctgagattcttggcttcccttcttctcctgtctccacttcagctgataactcaggcttc atgggttcaaatgaaagtcttgactggcccccatctgccctttgtcctccagtctttctc accttattgacagcagcatccacacagctgatcaagccaaaagcctga >gi568815597r:58530846_58731530|GENSCAN_predicted_peptide_7|885_aa MAAEEADVDIEGDVVAAAGAQPGVHSPTKPASYSVKWTIEEKELFEQGLPGAIYRYRQKR SGQYGLLPSRTLIRCSHDEAVLALVPAGCRCGVQAKFGRRWTKISKLIGSRTVLQVKSYA RQYFKNKVKCGLDKETPNQKTGHNLQVKNEDKGTKAWTPSCLRGRADPNLNAVKIEKLSD DEEVDITDEVDELSSQTPQKNSSSDLLLDFPNSKMHETNQGEFITSDSQEALFSKSSRGC LQNEKQDETLSSSEITLWTEKQSNGDKKSIELNDQKFNELIKNCNKHDGRGIIVDARQLP SPEPCEIQKNLNDNEMLFHSCQMVEESHEEEELKPPEQEIEIDRNIIQEEEKQAIPEFFE GRQAKTPERYLKIRNYILDQWEICKPKYLNKTSVRPGLKNCGDVNCIGRIHTYLELIGAI NFGCEQAVYNRPQTVDKVRIRDRKDAVEAYQLAQRLQSMRTRRRRVRDPWGNWCDAKDLE GQTFEEPFQVKVASEALLIMDLHAHVSMAEVIGLLGGRYSEVDKVVEVCAAEPCNSLSTG LQCEMDPVSQTQASETLAVRGFSVIGWYHSHPAFDPNPSLRDIDTQAKYQSYFSRGGAKF IGMIVSPYNRNNPLPYSQITCLVISEEISPDGSYRLPYKFEVQQMLEEPQWGLVFEKTRW IIEKYRLSHRSGVEVCDMQTVSTVRTSAALNPTEETVASTCQPEQVSSALPSSHFSSWVL MRLLQVIATCPKAPVAQMEGSMPSEYRRAGKGTQAPRLAENFCVCHLATGDMLRAMVASG SELGKKLKATMDAGKLVSDEMVVELIEKNLETPLCKNGFLLDGFPRTVRQAEMLDDLMEK RKEKLDSVIEFSIPDSLLIRRITGRLITPRVAVPTTRSSTLQKSP >gi568815597r:58530846_58731530|GENSCAN_predicted_CDS_7|2658_bp atggcggctgaagaggcggatgtggatatcgaaggggacgtggtagcggcggcgggggca cagccaggggtacactctcctacaaaaccagccagttactcagtaaagtggacgatagaa gaaaaagagctgtttgaacaagggctgcctggggcaatttataggtatcggcagaagagg tctgggcagtatggcttgctgcccagcaggacattgataagatgttcccatgatgaggca gttctggcccttgttccggcgggatgtcgttgtggtgttcaggctaaatttggccgaaga tggaccaaaatttcaaagctaattggaagccgcactgttttacaagtgaagagttatgca agacagtattttaaaaataaggtcaaatgcggtctggataaagaaacaccaaatcagaag accggccataatcttcaagttaaaaatgaagataaagggacaaaggcatggacaccatca tgtttaaggggacgtgctgatcccaacttgaatgctgtaaaaattgaaaagttatctgat gatgaagaagtagacatcacagatgaggtggacgagttgtcttctcaaacaccccagaag aattctagcagtgatctcttgttagactttcctaatagtaaaatgcatgaaaccaatcaa ggagaattcattacttctgacagccaggaagctctcttttctaagtcttccaggggctgt cttcaaaatgaaaagcaagatgaaacactttcaagctcagaaattacactgtggactgag aaacagagcaatggtgacaaaaaatcaattgaattaaatgaccagaaatttaatgaattg attaaaaactgcaacaagcatgatggaaggggaataatagttgatgccaggcagttgcct tctccagagccttgtgaaattcagaaaaatttgaatgataatgaaatgctttttcattct tgccaaatggtagaggaaagccatgaggaagaagagcttaagccaccagaacaggaaata gaaatagatagaaatatcattcaagaagaagaaaaacaagcaattcctgagttttttgag gggcgccaagctaaaacaccagaacgctatttgaaaattagaaattatattttggatcaa tgggagatatgcaaaccaaaatacttaaataagacctcagtacgtcctggcctgaagaac tgtggagatgttaattgtattggacggattcatacatacctcgaattgataggagcaatc aattttggatgtgaacaggctgtgtataataggccacaaacagttgacaaagtacgaatc agagacagaaaagatgcagtagaagcataccaacttgcccagcgtctgcagtctatgcgt acaaggagacgtagggtccgagacccatggggaaactggtgtgatgcaaaggacttagaa ggacaaacgtttgaggagccatttcaggtgaaagtggcttcagaagcacttttaataatg gatttgcatgctcatgtttctatggcagaagtgattggtctgttaggaggaagatactca gaagttgataaagtagttgaagtctgtgcagcagaaccatgtaacagtctgagtacagga ctacagtgtgagatggatcctgtatcacaaacacaggcctcagaaaccttggctgttaga ggcttcagtgttattggatggtatcattctcatcctgcttttgatcctaatccttcctta cgagatattgacacacaagctaaataccagagttacttctccagaggaggtgcaaagttc attgggatgattgttagtccctataatcgaaataatcccttaccatattctcagattacc tgcctggttataagtgaggaaattagcccagatggctcttatcgcttaccttacaaattt gaagtacagcagatgttagaagaacctcagtggggattagtatttgaaaagacaagatgg ataatagaaaaatacaggctctcccataggagtggggttgaggtctgtgacatgcagact gtaagcacagtcagaacctctgcagctctcaacccaacagaggagactgtggcttcaaca tgccaacctgagcaggtctcctctgcattaccttcctcccatttctcttcctgggtcctg atgaggctgctacaggtaatagctacatgccctaaagccccagttgctcagatggaaggc tccatgcccagtgagtacaggagggccggtaaaggcacccaggcacccagattggctgaa aacttctgtgtctgccatttagctactggggacatgctgagggccatggtggcttctggc tcagagctaggaaaaaagctgaaggcaactatggatgctgggaaactggtgagtgatgaa atggtagtggagctcattgagaagaatttggagacccccttgtgcaaaaatggttttctt ctggatggcttccctcggactgtgaggcaggcagaaatgctcgatgacctcatggagaag aggaaagagaagcttgattctgtgattgaattcagcatcccagactctctgctgatccga agaatcacaggaaggctgatcaccccaagagtggccgttcctaccacgaggagttcaacc ctccaaaagagcccatga >gi568815597r:58530846_58731530|GENSCAN_predicted_peptide_8|146_aa SHYSTCAEQRFQWRAREEAEKPVRWLLPSSSESGLDRHERSKGGEKGLDSKCNFDRLDVG LCLSLRSPDAERERCTVLSVCTCVKLVAMAIAGVKGGPRGTESIWHQEWHEIQAWSTIPI HLELLGKRYNFSTGVAEKKRYNVRWI >gi568815597r:58530846_58731530|GENSCAN_predicted_CDS_8|441_bp agtcattattcaacgtgcgctgagcaaagatttcagtggagggcaagagaagaagcagag aagccagttaggtggctactgccgtcatccagtgaaagtggcttggaccgccatgaaaga agtaaaggtggtgagaaggggctggattctaaatgcaactttgacagattggatgtaggg ttgtgcttgagcctgaggagccctgatgcagaaagggagagatgcacggtgctgtcagtt tgcacttgtgtgaagcttgttgccatggcaatagctggagtaaaaggtggccctcgtgga actgagtccatctggcatcaggaatggcatgagatccaggcctggtcaaccatacccatc cacctggagctattgggaaagaggtataatttttccacgggggtcgctgagaagaaaagg tataatgtcagatggatatag