GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:59:40 Sequence gi568815581f:59793185_60046785 : 253601 bp : 43.66% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 14868 14915 48 2 0 93 77 28 0.958 3.17 1.02 Intr + 15612 15692 81 2 0 109 89 35 0.950 5.53 1.03 Intr + 24528 24589 62 2 2 103 106 -14 0.092 -0.57 1.04 Intr + 45111 45213 103 0 1 50 110 70 0.141 5.58 1.05 Term + 46584 46727 144 2 0 76 42 73 0.697 -0.59 1.06 PlyA + 47884 47889 6 1.05 2.08 PlyA - 48582 48577 6 1.05 2.07 Term - 67240 67138 103 0 1 98 39 52 0.801 -0.95 2.06 Intr - 70663 70480 184 2 1 58 106 121 0.970 9.75 2.05 Intr - 73565 73425 141 0 0 112 86 44 0.875 6.92 2.04 Intr - 81519 81355 165 1 0 100 98 9 0.820 3.03 2.03 Intr - 85150 84979 172 1 1 136 28 147 0.903 13.02 2.02 Intr - 90716 90672 45 2 0 46 103 44 0.001 0.31 2.01 Init - 92238 92026 213 0 0 69 90 16 0.000 -1.36 2.00 Prom - 96152 96113 40 -7.56 3.00 Prom + 98482 98521 40 -8.36 3.01 Init + 100001 100141 141 1 0 69 60 200 0.353 15.46 3.02 Intr + 117378 117427 50 2 2 55 93 46 0.045 -0.72 3.03 Intr + 119500 119620 121 1 1 79 102 57 0.932 6.70 3.04 Intr + 121451 121519 69 1 0 86 55 69 0.733 2.78 3.05 Intr + 133251 133398 148 2 1 84 49 137 0.895 9.21 3.06 Intr + 136933 136990 58 2 1 118 119 -23 0.988 1.74 3.07 Intr + 138438 138551 114 0 0 81 1 116 0.591 1.76 3.08 Intr + 141250 141340 91 1 1 90 42 48 0.920 0.40 3.09 Intr + 142009 142116 108 0 0 89 116 16 0.967 4.98 3.10 Intr + 143280 143357 78 2 0 127 111 27 0.996 8.65 3.11 Intr + 147652 147759 108 0 0 21 119 112 0.988 8.08 3.12 Intr + 152222 152334 113 1 2 81 68 65 0.956 2.98 3.13 Term + 153367 153604 238 1 1 59 48 104 0.890 -0.86 3.14 PlyA + 153742 153747 6 1.05 4.00 Prom + 166911 166950 40 -3.56 4.01 Init + 170352 170427 76 2 1 20 13 138 0.665 0.75 4.02 Term + 171417 171730 314 1 2 113 36 189 0.907 11.36 4.03 PlyA + 172450 172455 6 1.05 5.25 PlyA - 173843 173838 6 1.05 5.24 Term - 180550 180358 193 0 1 -20 42 385 0.869 19.99 5.23 Intr - 183586 183472 115 2 1 74 95 148 0.956 13.61 5.22 Intr - 194420 194263 158 1 2 28 99 62 0.407 0.95 5.21 Intr - 196160 196080 81 1 0 70 98 71 0.577 5.05 5.20 Intr - 202630 202448 183 0 0 81 95 19 0.616 0.80 5.19 Intr - 202855 202769 87 0 0 72 106 92 0.967 8.49 5.18 Intr - 205053 204954 100 1 1 47 98 6 0.958 -3.33 5.17 Intr - 206978 206847 132 0 0 112 51 81 0.981 7.42 5.16 Intr - 213216 213066 151 0 1 -56 57 293 0.547 11.54 5.15 Intr - 215625 215443 183 0 0 43 80 92 0.332 3.88 5.14 Intr - 215864 215741 124 1 1 101 45 70 0.408 4.59 5.13 Intr - 216445 216310 136 2 1 34 73 82 0.463 0.93 5.12 Intr - 216998 216846 153 0 0 85 80 85 0.323 7.44 5.11 Intr - 219315 219195 121 0 1 97 110 135 0.994 16.67 5.10 Intr - 219907 219859 49 0 1 97 117 75 0.991 9.98 5.09 Intr - 221225 221116 110 2 2 102 82 110 0.998 10.88 5.08 Intr - 221663 221454 210 2 0 100 77 226 0.743 21.81 5.07 Intr - 222187 222107 81 1 0 52 90 59 0.809 2.33 5.06 Intr - 222709 222670 40 0 1 80 116 -3 0.953 -0.07 5.05 Intr - 224080 223995 86 1 2 81 79 90 0.865 6.02 5.04 Intr - 224315 224243 73 1 1 87 82 55 0.664 4.21 5.03 Intr - 224746 224621 126 0 0 52 44 91 0.409 0.89 5.02 Intr - 225414 225324 91 1 1 86 40 34 0.696 -2.55 5.01 Init - 226191 226137 55 0 1 104 13 207 0.685 14.25 5.00 Prom - 227581 227542 40 -6.16 6.00 Prom + 229559 229598 40 -7.36 6.01 Init + 232598 232672 75 1 0 100 5 167 0.665 8.69 6.02 Intr + 232719 232927 209 2 2 3 31 294 0.423 13.08 6.03 Intr + 237430 237562 133 1 1 11 100 85 0.189 2.65 6.04 Term + 246810 246968 159 2 0 94 48 61 0.073 0.64 6.05 PlyA + 247047 247052 6 1.05 7.03 PlyA - 247862 247857 6 1.05 7.02 Term - 250950 250379 572 1 2 61 49 329 0.846 20.90 7.01 Intr - 252446 252320 127 2 1 52 73 68 0.481 1.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 117396 117427 32 2 2 83 93 24 0.863 1.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:59793185_60046785|GENSCAN_predicted_peptide_1|145_aa MPKELASRMHVSAWEEDFASRAKLAVQKLVQKVGFFGILACASKIFVIITFSKHIVEQMV AFIGAVPGIGPSLQKPFQEYLEAQRQKLHHKSEMGTPQGENWLSWMFEKLVVVMVCYFIL SIINSMAQSYAKRIQQRLNSEEKTK >gi568815581f:59793185_60046785|GENSCAN_predicted_CDS_1|438_bp atgccgaaagaactagccagccgaatgcatgtttcagcatgggaggaggactttgcctcc cgggccaaactggcagttcaaaaactagtacagaaagttggattttttggaattttggcc tgtgcttcaaaaatttttgttataataacattcagcaagcacatagtggagcaaatggtg gctttcattggtgctgtccccggcataggtccatctctgcagaagccatttcaggagtac ctggaggctcaacggcagaagcttcaccacaaaagcgaaatgggcacaccacagggagaa aactggttgtcctggatgtttgaaaagttggtcgttgtcatggtgtgttacttcatccta tctatcattaactccatggcacaaagttatgccaaacgaatccagcagcggttgaactca gaggagaaaactaaataa >gi568815581f:59793185_60046785|GENSCAN_predicted_peptide_2|340_aa MNIMRAMKPNSCIHMTFQETGVGDIFLVYMSSCQCIGSPQFNLRSSWSQLYHGSLSCREL WVVAIPHVIIRNYLVSKLVELPGTYQVIVQNYNSILTLSHLYRSSDALLLHENDAIHKIC AKLMNIKQISFSDINQVLAHQLGRDLMEHLVPHPEFKMLSVRNIPHMSENSLAYTTFTWA GLLKHLRQMLISNAKMEEGIDRHVWPPLSGLPPLSKMSLNKDLHFNTSIANLVILRGKDV QSADVEGFKDPALYTSWLKPVNAFNVWKTQRAFSKYEKSAVLVSNSQFLVKPLDMIVGKA WNMFASKAYIHQYTKFGIEEEDFLDSFTSLEQVVASYCNL >gi568815581f:59793185_60046785|GENSCAN_predicted_CDS_2|1023_bp atgaatatcatgagagccatgaagccgaacagctgcatacacatgacattccaagaaaca ggtgtgggggacatattcctggtatacatgtctagctgccagtgcatcggttcaccccag ttcaatctcaggtccagctggtcccagctataccatggatccctctcatgcagggagctt tgggtagtcgccataccccatgtcatcatccggaactacctggtatcaaaacttgtagaa ctacctggtacctaccaggttattgttcaaaactacaactccattttgacactttctcac ttgtaccgatcttcagacgccctccttcttcatgagaatgatgccatccataagatctgt gcaaaactgatgaatatcaagcagatctcctttagtgatatcaatcaagtcctcgcacat cagctgggaagagacttaatggagcatttagttccccatcctgaattcaagatgctgagt gttcgtaacattcctcacatgtctgagaattcattggcatacaccacatttacttgggct ggcctcctcaagcatttgagacagatgctcatttctaatgcaaagatggaagaaggtatt gataggcatgtatggcctcctttatcaggacttcctcctcttagtaaaatgtctctcaac aaggacctgcattttaacacttccattgctaacttggtcattcttcgtgggaaagatgtg caaagtgcagatgtggagggatttaaagatccagctctgtatacttcctggttgaagcct gttaatgctttcaacgtgtggaaaacccagcgggcctttagcaaatatgagaagtctgca gtgttggtcagcaacagccagttcttagtaaaaccacttgatatgattgttgggaaggca tggaatatgtttgcttcaaaagcctacattcatcagtacacaaaatttggaatcgaagaa gaggactttttagacagtttcacgtcattagagcaggttgttgccagttactgtaatctc tga >gi568815581f:59793185_60046785|GENSCAN_predicted_peptide_3|478_aa MRRRRRRDGFYPAPDFRDREAEDMAGVFDIDLDQPEDAGSEDELEEGGQLNESMDHGGVG PYELGMEHCEKFEISETSVNRGPEKIRPECFELLRVLGKGGYGKVFQVRKVTGANTGKIF AMKVLKKAMIVRNAKDTAHTKAERNILEEVKHPFIVDLIYAFQTGGKLYLILEYLSGGEL FMQLEREGIFMEDTACFYLAEISMALGHLHQKGIIYRDLKPENIMLNHQGGDIPAPEILM RSGHNRAVDWWSLGALMYDMLTGAPPFTGENRKKTIDKILKCKLNLPPYLTQEARDLLKK AHPFFRHINWEELLARKVEPPFKPLLQSEEDVSQFDSKFTRQTPVDSPDDSTLSESANQV FLGFTYVAPSVLESVKEKFSFEPKIRSPRRFIGSPRTPVSPVKFSPGDFWGRGASASTAN PQTPVEYPMETSGIEQMDVTMSGEASAPLPIRQPNSGPYKKQAFPMISKRPEHLRMNL >gi568815581f:59793185_60046785|GENSCAN_predicted_CDS_3|1437_bp atgaggcgacgaaggaggcgggacggcttttacccagccccggacttccgagacagggaa gctgaggacatggcaggagtgtttgacatagacctggaccagccagaggacgcgggctct gaggatgagctggaggaggggggtcagttaaatgaaagcatggaccatgggggagttgga ccatatgaacttggcatggaacattgtgagaaatttgaaatctcagaaactagtgtgaac agagggccagaaaaaatcagaccagaatgttttgagctacttcgggtacttggtaaaggg ggctatggaaaggtttttcaagtacgaaaagtaacaggagcaaatactgggaaaatattt gccatgaaggtgcttaaaaaggcaatgatagtaagaaatgctaaagatacagctcataca aaagcagaacggaatattctggaggaagtaaagcatcccttcatcgtggatttaatttat gcctttcagactggtggaaaactctacctcatccttgagtatctcagtggaggagaacta tttatgcagttagaaagagagggaatatttatggaagacactgcctgcttttacttggca gaaatctccatggctttggggcatttacatcaaaaggggatcatctacagagacctgaag ccggagaatatcatgcttaatcaccaaggtggagacataccggcccctgaaatcttgatg agaagtggccacaatcgtgctgtggattggtggagtttgggagcattaatgtatgacatg ctgactggagcacccccattcactggggagaatagaaagaaaacaattgacaaaatcctc aaatgtaaactcaatttgcctccctacctcacacaagaagccagagatctgcttaaaaag gctcatccattctttagacacattaactgggaagaacttctggctcgaaaggtggagccc ccctttaaacctctgttgcaatctgaagaggatgtaagtcagtttgattccaagtttaca cgtcagacacctgtcgacagcccagatgactcaactctcagtgaaagtgccaatcaggtc tttctgggttttacatatgtggctccatctgtacttgaaagtgtgaaagaaaagttttcc tttgaaccaaaaatccgatcacctcgaagatttattggcagcccacgaacacctgtcagc ccagtcaaattttctcctggggatttctggggaagaggtgcttcggccagcacagcaaat cctcagacacctgtggaatacccaatggaaacaagtggcatagagcagatggatgtgaca atgagtggggaagcatcggcaccacttccaatacgacagccgaactctgggccatacaaa aaacaagcttttcccatgatctccaaacggccagagcacctgcgtatgaatctatga >gi568815581f:59793185_60046785|GENSCAN_predicted_peptide_4|129_aa MAEGIRKTVMKYGYPADVNEHQSSLGLTSHDQKRTEVSGATARTAAYTASSPSVGAINRK PRKLFSQPGGNGGGARAPRTFSRARARPAPAPPPSPAPAPRRVESSRLPGCFGSENKRAA PENGLQAWR >gi568815581f:59793185_60046785|GENSCAN_predicted_CDS_4|390_bp atggcagaaggcatccgcaaaacagtcatgaagtatgggtacccagcagatgttaatgag caccagtcctcactgggcctaacctctcatgaccagaagcggacggaggtgtcgggagcg acagcaagaacagcggcatacaccgcctccagcccttcagtcggggccatcaaccgcaaa ccccgcaagctcttctctcagcccggcggcaacggcggcggcgcgcgcgctccccggacg ttctcacgcgcgcgcgcgcgccctgccccggctccacctcctagccccgcccccgcaccg cgccgagtggagagctcacggctgccggggtgttttggcagcgagaacaagcgagctgcg cctgagaacggccttcaggcctggcgctaa >gi568815581f:59793185_60046785|GENSCAN_predicted_peptide_5|945_aa MDALMLALGALMLALGLGDPKSAQVSDPDGAEPGPGRATGTDPLVQIHWKLGISEALSIP GGRLYRDLWSLTPVCLHIPGIAHHGPFTLGRMDVVEVAGSWWAQEREDIIMKYEKGHRAG LPEDKGPKSFGSYNNNVDHLGIVHETELPPLTAREVKQIRREISRKSKWVKMLGEWDTYK NSRKLIDRAYQGIPMNIRGPMWSVLLNIEEIKLKNPGRYQVRSARAQPTGQAVSGAQVSS WRERQDHPGELGVKIMKEKGKRSSEHIQQMDLDVSGTLRRHIFFRDRYGTKQRELLYILL AYEEYNPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQERLTKTSRCGPWA RFWNRFVDAWARDDDTVLKHLRASMKKLTRKQGDLPPPACQDNEPGARETRESVSLTHGA FRERAQAGPRAQGQSQEFSQKWERQTRARVVGIQACASFTWREDPLQGGQAGPSRPTSPV PMAHLGGPQGSWRFLQWNSMPRLPTDLDVGDPWFRRYDFRQSCWVRAISQEDQPATCWQA EHPAERETQSVYWELKAYVYLNAVRVRCMNKFRLSCLEEEQLLLLVTSTFGNGDCSGQFE IEKSCELLFQMAESVDYDYDVQDTILDGLLILPCYGSMTTDQQRRIFLPPPPGIRKCVIS TNISATSLTIDGTRYVVDGGFVKQLNHNPRLGLDILEVVPISKSEALQRSGRAGRTSSGK CFRIYSKDFWNQCMPDHVIPEIKRTSLTSVVLTLKCLAIHDVIRFPYLDPPNERLILEAL KQLYQCDAIDRSGHVTRLGLSMVEFPLPPHLTCAVIKAASLDCEDLLLPIAAMLSVENVF IRPVDPEYQKEAEQRHRELAAKAGGFNDFATLAVIFEQCKSSGFRKFLVHNVKELEVLLM CNKSYCAEITHHVSSKNRKAIVERAAQLAIKVTNPNARLRSKENE >gi568815581f:59793185_60046785|GENSCAN_predicted_CDS_5|2838_bp atggatgctctgatgctggccctgggcgctctgatgctggccctgggcctcggggatccc aaatctgcccaggtttccgatcccgatggggcagagcctggtcctggcagagccactggt acagatccactggtacagatccactggaaactgggcatctctgaggccctgagcatccca ggaggccgattgtacagagacctctggtcgctgaccccagtctgcctccacatccctgga atagcccatcatgggcccttcacccttggcaggatggacgtggtagaggtcgcgggtagt tggtgggcacaagagcgagaggacatcattatgaaatacgaaaagggacaccgagctggg ctgccagaggacaaggggcctaagtcttttggaagctacaacaacaacgtcgatcatttg gggattgtacatgagacggagctgcctcctctgactgcgcgggaggtgaagcaaattcgg cgggagatcagccgaaagagcaagtgggtgaaaatgctgggagaatgggacacctacaaa aacagcagaaagctcatagatcgagcgtaccagggaattcccatgaacatccggggcccg atgtggtcagtcctcctgaacattgaggaaatcaagttgaaaaaccccggaagataccag gtacgctcagccagagcacaaccaacaggacaggccgtgtcaggggcccaggtctccagc tggagggaacgtcaagaccaccctggggagctgggggtgaagatcatgaaggagaagggc aagaggtcatctgaacacatccagcagatggacctggacgtaagcgggacattaaggagg catatattcttcagggatcgatacggaaccaagcagcgggaactactttacatcctcctg gcgtatgaggagtataacccggaggtgggctactgcagggacctgagccacatcgccgcc ttgttcctcctttatcttcctgaggaggatgcattctgggcactggtgcagctgctggcc agtgagaggcactccctgcaggagcgcctcacgaagacgtccaggtgtggcccgtgggca cgtttttggaaccggttcgttgatgcctgggccagggatgatgacactgtgctcaagcat cttagggcctctatgaagaaactaacaagaaagcagggggacctgccacccccagcttgc caggacaacgagcctggagccagggagacaagggaatcggtgtccctgacccacggagct ttcagggagagggcgcaggcgggaccccgggcccagggccagagccaagagttcagccag aagtgggaacgccaaacccgagcaagggtcgtcggcatccaggcctgtgccagcttcacg tggcgggaagaccctctgcaagggggacaggcaggcccctccaggcccaccagcccggtt cccatggcccatttgggaggacctcagggttcctggagattcctgcagtggaactccatg ccccgcctcccaacggacctggacgtaggggacccttggttccgccgttatgatttcaga cagagctgctgggtccgtgccatatcccaggaggaccagccggccacctgctggcaggct gaacaccctgcggagcgggaaacccagagcgtctactgggagctgaaggcttatgtctac ctcaatgctgtccgcgttcgctgtatgaacaagttcaggctgagctgcctggaggaggag cagctgctgttgctggtgaccagcacgttcgggaacggagactgctctggccagtttgaa atagaaaaaagttgtgagttactttttcagatggcagagtctgttgattatgattatgat gttcaagataccatcctcgatggcttgttaatattgccgtgttatggatcaatgacaact gatcaacagaggaggatatttttgccaccgccacctggaattagaaaatgtgtcatatcc accaatatttctgcaacgtctttgacaatagatggaaccagatatgtggtagatggtggc tttgtgaagcagttaaatcacaaccccagattagggttggacatcctggaggtggttcca atttcaaagagcgaggcattacagcgaagtggccgagctggcaggacttcttcaggaaaa tgctttcggatctatagtaaagatttttggaaccagtgtatgcctgaccatgtgatccct gaaattaagagaactagtttgacatctgtagttctgaccttaaagtgccttgccatacac gatgtcataaggtttccctatttggatccacctaatgagagacttattttagaagctctt aaacagctttaccagtgtgatgctattgacaggagtggccatgtcaccagattgggtttg tctatggtggagtttcctttgcctccacatctgacatgtgcagtaataaaagctgcttcc ctggattgtgaagatttactacttccaatagcagcaatgttgtctgtggaaaacgtcttc attagacctgttgatccagagtaccagaaggaagcagaacagagacatcgagaattggca gctaaagctggaggatttaatgactttgcaactttagctgtcatctttgaacaatgcaaa tcaagtggcttccggaagttcctggtccacaacgtcaaggagctggaagtgctgctgatg tgcaacaaatcttactgtgccgagatcactcaccatgtttcctccaagaaccgcaaagcc atcgtggaaagagctgcccaactggccatcaaagtcaccaaccccaatgccaggctgcgc agcaaagaaaatgagtag >gi568815581f:59793185_60046785|GENSCAN_predicted_peptide_6|191_aa MPGLLSRPRRGSLRARPCGGGEALPEHVAIDVCPGPIRPIQQISGYFPHFPRDLPHDAPA RPATASAGRRRPSDGARDHDKDGDHLFGSLTIQHRPWQLHPPISHSHPLLFSTRDLKPMD HNGLADPCVKLHLRPGARKVPEACQLAPDLQDSSCLTAITKAHDVLTVFLPAALKIQGNK YLTFIECQKLF >gi568815581f:59793185_60046785|GENSCAN_predicted_CDS_6|576_bp atgccgggcctgctgagccgcccccggcgggggtcgctccgggccaggccgtgcgggggc ggcgaggcgctgcctgagcatgtggccatcgacgtgtgccccggccccatccgccccatc cagcagatctctggctacttcccccacttcccgcgggacctgccccatgacgcccccgcg cgcccagccactgccagcgccggccgccgccgcccctctgacggcgcccgcgaccacgac aaggatggcgaccatctcttcggaagcctaacgatccagcacaggccctggcagctccat cccccgatttcccattcacaccctttgctgttctctacccgggacctgaagcccatggac cacaatgggctggcagacccctgtgtcaagctgcacctgcggccaggagccaggaaggtt cccgaggcatgccagctagcacctgacctacaagattccagctgtttaactgccatcaca aaggctcatgatgtgttgacagtatttctgcctgctgctctgaaaattcaaggaaataaa tacctgacatttattgaatgccagaagctgttctaa >gi568815581f:59793185_60046785|GENSCAN_predicted_peptide_7|232_aa LSTISIYYFNGQENRKEKNWNEREYKLEIPYELCTEVDAINKWTAPWTSQAYNALTSVVT SCKNFKVRIRSAAALSVPGKREQYGSVDQYARIWNALVTALQKSEDTIDFLEFKYCVSLR TQICQALIHLLSLASASDLPCMKETLELSGNMVQSYILQFLKSGAEGDDTGAPHSPQERD QMVRMALKHMGSIQAPTGDTARRAIMGFLEEILAVCFDSSGSQGALPGLTNQ >gi568815581f:59793185_60046785|GENSCAN_predicted_CDS_7|699_bp ttgtctacaataagcatatattactttaatggtcaagaaaacaggaaggaaaaaaactgg aatgagagggaatataaactggaaatcccctatgagctgtgtacagaagtggatgccatt aataaatggacagccccatggacctcccaggcctacaatgccctgacatcggtcgtgaca tcatgcaagaacttcaaagtgcgcatcagatctgcagctgccctttccgtcccggggaag agagagcagtacgggtctgttgaccagtatgctcggatctggaatgcattggtcaccgct ttacagaagagtgaagacaccatagactttttggaattcaagtactgtgtcagcctacgg acccaaatctgccaggcactgattcacctcttgagcttggccagtgcctcggacctccct tgtatgaaagaaacccttgaactgagtgggaatatggtccagtcctatattctacagttt ttaaaatcaggagcagagggagatgacactggagcaccccacagcccacaggaaagagac cagatggtcagaatggcccttaaacacatgggcagcatccaggcaccaactggagacaca gccagaagggccatcatgggctttttagaagagatcctggccgtttgttttgactcatct ggatcacaaggggcactcccagggttaacaaatcagtga