GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:23:46 Sequence gi568815591f:55853196_56055175 : 201980 bp : 44.80% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 17435 17474 40 -1.86 1.01 Init + 18664 18771 108 0 0 77 39 96 0.236 3.82 1.02 Intr + 21963 22054 92 2 2 77 93 -12 0.152 -3.01 1.03 Intr + 24547 24591 45 1 0 70 83 84 0.887 3.62 1.04 Intr + 24756 24956 201 0 0 -12 89 204 0.931 8.90 1.05 Intr + 25894 25925 32 1 2 91 37 33 0.334 -3.73 1.06 Intr + 34089 34303 215 1 2 108 41 172 0.290 12.93 1.07 Intr + 34366 34485 120 0 0 34 105 113 0.728 8.29 1.08 Term + 34787 34804 18 1 0 128 49 -3 0.727 -1.88 1.09 PlyA + 35854 35859 6 1.05 2.00 Prom + 38529 38568 40 -0.36 2.01 Init + 68403 68639 237 2 0 92 43 163 0.824 10.21 2.02 Intr + 69967 70093 127 0 1 43 68 162 0.531 9.95 2.03 Intr + 70412 70504 93 0 0 38 111 52 0.667 2.44 2.04 Term + 85787 86811 1025 0 2 44 32 391 0.219 21.38 2.05 PlyA + 88285 88290 6 1.05 3.00 Prom + 89468 89507 40 -5.06 3.01 Init + 92627 92676 50 1 2 69 34 71 0.243 0.22 3.02 Intr + 98621 98735 115 2 1 -7 105 105 0.431 3.15 3.03 Term + 101714 101983 270 1 0 94 48 161 0.765 8.18 3.04 PlyA + 103286 103291 6 1.05 4.00 Prom + 106699 106738 40 -1.76 4.01 Init + 111415 111506 92 0 2 99 105 173 0.999 18.06 4.02 Intr + 124931 125070 140 2 2 107 65 72 0.436 6.91 4.03 Term + 147879 148009 131 1 2 0 54 117 0.037 -2.16 4.04 PlyA + 149500 149505 6 1.05 5.04 PlyA - 149514 149509 6 1.05 5.03 Term - 151868 151725 144 2 0 81 47 180 0.996 11.11 5.02 Intr - 153621 153530 92 1 2 89 63 21 0.734 -0.59 5.01 Init - 154783 154672 112 1 1 71 29 73 0.385 0.07 5.00 Prom - 155012 154973 40 -4.86 6.07 PlyA - 157240 157235 6 1.05 6.06 Term - 158674 158567 108 1 0 121 38 137 0.999 10.51 6.05 Intr - 161976 161828 149 1 2 109 34 103 0.992 6.95 6.04 Intr - 164184 164039 146 2 2 80 107 12 0.933 2.23 6.03 Intr - 166539 166405 135 2 0 100 116 66 0.994 10.28 6.02 Intr - 168036 167878 159 2 0 93 23 145 0.791 7.60 6.01 Init - 181274 180766 509 2 2 39 110 160 0.579 7.73 6.00 Prom - 181862 181823 40 -6.66 7.00 Prom + 191313 191352 40 -5.06 7.01 Init + 198654 198790 137 2 2 91 67 303 0.998 26.11 7.02 Intr + 198942 199077 136 0 1 61 35 122 0.840 4.77 7.03 Intr + 201174 201308 135 2 0 101 21 99 0.748 5.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 16492 16390 103 1 1 60 68 115 0.975 5.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:55853196_56055175|GENSCAN_predicted_peptide_1|276_aa MDNYHTERIAGEIDIPEMSGDGKEQRRAEAISISHRAVCCMLYAVCSSSVAGGRGAGLLS VAAVLGSPAAEQFFPEYENPERDDPSTIEKLSKNKQKPITPETAEKLAHDLKIVKYVECS ALTQKGLKNVFDEATLAALEPPESKKSHRVDPVFDETHERPENSGKARGRGPRGPHSPDE KQTRESWWESGAPAMRIAMKEAGSERRQRMRIGDLRPASCAAIAGRLSSEQAAGPSAPSP SRVHRGGGGGGGGGGGGVRGRSLPKRPLSAEIFGLL >gi568815591f:55853196_56055175|GENSCAN_predicted_CDS_1|831_bp atggataattaccacacagaaagaattgctggagagattgacatacctgagatgtctggg gatggcaaggaacaaagaagggcagaagctatcagtatcagccacagagctgtatgctgt atgctctatgcagtatgcagcagttctgttgcagggggtcggggagcagggttgctgtca gtggcggcagtcctaggcagtcctgcagcagaacagttcttccctgaatacgagaaccca gaaagagatgatccctctactattgagaaactttccaagaacaaacagaagcctatcact ccagagactgctgaaaagctggcccatgacctgaagattgtcaagtatgtggagtgttct gcactcacacagaaaggcctaaagaatgtatttgacgaagcaacattggctgcgctggag cctccagaatcgaagaagagccacagagtagatccagtatttgatgaaactcatgaaaga ccggaaaattctgggaaggctcgaggccgcggtccccggggtccgcatagtcccgatgaa aaacagacccgggaaagctggtgggagtcaggagccccggcgatgaggattgcgatgaag gaagcaggctcggagcgccgccagcgcatgcgtattggggatctgaggccagcgtcttgc gccgccattgcggggaggctgtcctcagagcaggctgcggggccctcggcaccttctccc tcccgggtccaccgcggcggcggcggcggcggcggcggcggcggcggcggcgtcaggggg cggagcctgccgaagcgccctttgtctgcggagatttttggtcttttatag >gi568815591f:55853196_56055175|GENSCAN_predicted_peptide_2|493_aa MVEIARELELEVQPEDVVELLLKDEEKKWFLEMESVPGEDALNNVEMTIVDLEYSINLLD KAAGGFEKTDSNFERSSTVESLTFQDVAVDFTREEWDQLYPAQKNLYRDVMLENYRNLVA LGYQLCKPEVIAQLELEEEWVIERDSLLDTHPDGENRPEIKKSTTSQNISDENQTHEMIM ERLAGDSFWYSILGGLWDFDYHPEFNQENHKRYLGQVTLTHKKITQERSLECNKFAENCN LNSNLMQQRIPSIKIPLNSDTQGNSIKHNSDLIYYQGNYVRETPYEYSECGKIFNQHILL TDHIHTAEKPSECGKAFSHTSSLSQPQMLLTGEKPYKCDECGKRFSQRIHLIQHQRIHTG EKPFICNGCGKAFRQHSSFTQHLRIHTGEKPYKCNQCGKAFSRITSLTEHHRLHTGEKPY ECGFCGKAFSQRTHLNQHERTHTGEKPYKCNECGKAFSQSAHLNQHRKIHTREKLCEYKC EQTVRHSPSFSST >gi568815591f:55853196_56055175|GENSCAN_predicted_CDS_2|1482_bp atggtggaaatagcaagagaactagaattagaagtgcagcctgaagatgtggtggaattg ctgcttaaggatgaggaaaaaaagtggtttcttgagatggaatcagttcctggtgaagat gctttgaataatgtggaaatgacaatagtggatttagaatattccataaacttattagat aaagcagcaggaggatttgaaaagactgactccaattttgaaagaagttctactgtggaa tcactgacgtttcaggatgtggccgtggacttcaccagagaggagtgggaccagctgtac cctgcccaaaagaacctctatcgagacgtgatgctggagaactacaggaatctagttgca ctggggtatcagctttgtaagccagaggtaatcgcgcagttggagctagaggaagaatgg gtgatagaaagagacagcctgctggatactcatccagatggagaaaacagacccgaaatc aaaaagtcaaccacaagccagaatatttctgatgaaaatcaaacccatgagatgataatg gagagactcgcaggagacagcttctggtactccatcctaggaggactctgggattttgat taccatccagagtttaaccaagaaaaccacaagagatatttaggacaagtaactttgacc cacaaaaagatcacacaggagagaagccttgagtgtaataaatttgcagaaaactgtaat ctgaactcaaaccttatgcagcagagaattccttccattaaaatacccctgaattctgac acacagggaaacagcatcaaacataattcagacttgatttactatcagggaaattatgta agagagactccctatgaatatagtgagtgtggaaaaatcttcaatcaacatattcttctt actgatcatattcatactgcagagaaacccagtgagtgtgggaaggccttcagccacacc tcatctcttagccagcctcagatgttgcttacaggagagaagccctataagtgtgatgaa tgtggaaaaagattcagccagaggatacatctcattcaacatcagagaattcacacagga gaaaagccttttatatgcaatggatgtgggaaagccttccgtcagcattcatcctttact caacatctgaggattcatactggagaaaagccctataaatgtaatcaatgtggtaaagct tttagccgcatcacatcccttactgaacatcatagacttcataccggagagaaaccttac gaatgtggtttctgtggcaaagccttcagtcagaggacacatctgaatcaacatgaaaga actcatacaggagagaaaccctataaatgtaatgaatgcgggaaagcctttagccagagt gcacaccttaatcaacacaggaaaatccatactcgggagaaattatgtgaatataaatgt gagcaaactgttcgccacagtccttcatttagcagcacataa >gi568815591f:55853196_56055175|GENSCAN_predicted_peptide_3|144_aa MEFSEEDEQNLDIPEKRRCTAYERGSVGLSQCVRRDAIPLRPALPGAGGCIVLAEYFNKR KTYFAHDALQQCTVGDIVLLRALPVPRAKHVKHELAEIVFKVGKVIDPVTGKPCAGTTYL ESPLSSETTQLSKNLEELNISSAQ >gi568815591f:55853196_56055175|GENSCAN_predicted_CDS_3|435_bp atggaattctcagaagaggatgagcagaacttggacatcccagaaaaaaggagatgcacg gcctacgagaggggtagtgtgggactgagccaatgcgtacggcgtgacgccatcccgctg cgaccggcgctccccggggcaggtggctgcatagtcttggcggagtattttaataagcgg aaaacctactttgctcacgatgcccttcagcagtgcacagttggggatattgtgcttctc agagctttacctgttccacgagcaaagcatgtgaaacatgaactggctgagatcgttttc aaagttggaaaagtcatagatccagtgacaggaaagccctgtgctggaactacctacctg gagagtccgttgagttcggaaaccacccagctaagcaaaaatctggaagaactcaatatc tcttcagcacagtga >gi568815591f:55853196_56055175|GENSCAN_predicted_peptide_4|120_aa MAARVLRARGAAWAGGLLQRAAPCSLLPRLRTWTSSSNRSREDSWLKSLFVRKVDPRKDA HSNLLAKKETSNLYKLQLRLEELTKGVTVVRGEDQDEEEPAKETKKPPGNSADTQASEAS >gi568815591f:55853196_56055175|GENSCAN_predicted_CDS_4|363_bp atggcggcgcgagtgctgcgcgcccgcggagcggcctgggccggcggcctcctgcagcgg gcggccccctgcagcctcctgcccaggctccggacatggacatcttccagcaacagatct cgagaagacagctggctaaaatccttatttgtccggaaagttgatccaagaaaagatgcc cactccaatctcctagccaaaaaggaaacaagcaatctatacaaattacagttgagacta gaggagctcaccaagggagtgactgtagtcagaggagaggatcaagatgaggaggaacct gcaaaagagactaagaaaccaccagggaactcagcagacactcaagcaagtgaggcgtca tga >gi568815591f:55853196_56055175|GENSCAN_predicted_peptide_5|115_aa MADCPCDAIVHSDYEHRLRKQKELRIGIMKPRAEIFQAGKYPTRVHDSCEITFRAGTRGN DFHTILYLSLNVVDPEVCLTQLPPGGHLPVGQLHAACSAGPTDPHTDTDCADMLQ >gi568815591f:55853196_56055175|GENSCAN_predicted_CDS_5|348_bp atggctgattgtccgtgtgacgccattgttcattctgattatgaacaccgcttacgtaaa cagaaagagctaagaattgggatcatgaagccaagagcagaaatttttcaggctgggaaa taccctaccagagttcatgacagctgtgagattacattcagagctggaacccgaggcaat gatttccacaccattctttatctttccctcaatgtggtggatccagaagtctgccttaca caactgccccctggtggccaccttcccgtgggacagttacatgcagcctgttcagcaggc cccactgaccctcacaccgacacggactgtgcagatatgctgcagtga >gi568815591f:55853196_56055175|GENSCAN_predicted_peptide_6|401_aa MPNVDFKRGPPTEISLIAKSVQQSLASLTYPYPKWWWRQRGAASEGPGALFLCKAEVPQA PCTSQTAGAEPPEEEGAGSGSPSPGAGPNRPPSPALQSRQIWCLQFTEALAELIRSHPAY QQISRERSVEALGLHSSPPWELLARASEPALAPCSRKVAAAGAPEAQDTGRKILPAMVSH SELRKLFYSADAVCFDVDSTVIREEGIDELAKICGVEDAVSEMTRRAMGGAVPFKAALTE RLALIQPSREQVQRLIAEQPPHLTPGIRELVSRLQERNVQVFLISGGFRSIVEHVASKLN IPATNVFANRLKFYFNGEYAGFDETQPTAESGGKGKVIKLLKEKFHFKKIIMIGDGATDM EACPPADAFIGFGGNVIRQQVKDNAKWYITDFVELLGELEE >gi568815591f:55853196_56055175|GENSCAN_predicted_CDS_6|1206_bp atgccaaacgtggatttcaaaagaggccccccgacggagatttcactaattgctaaatca gtgcagcagtcccttgcttccctcacctatccttatcctaagtggtggtggcggcagcgc ggagcagcttctgaaggtccgggagctcttttcctctgcaaagcggaggtgccacaggca ccgtgcacctctcagactgcaggggctgagcccccggaggaggagggggctggttcaggg agcccctcccctggggcggggccaaatcgtcctcccagcccggctttgcagagcagacag atctggtgtctccagtttacagaagcacttgcagaactcatcagaagccaccccgcttat cagcagatctcaagagagcgttcggtggaggccctgggtctgcacagctcacctccctgg gaactgctcgcccgagcgtcggagccggcgctggccccctgcagccggaaggttgcagcc gcaggagccccggaggcccaggacacagggaggaaaattcttccagcgatggtctcccac tcagagctgaggaagcttttctactcagcagatgctgtgtgttttgatgttgacagcacg gtcatcagagaagaaggaatcgatgagctagccaaaatctgtggcgttgaggacgcggtg tcagaaatgacacggcgagccatgggcggggcagtgcctttcaaagctgctctcacagag cgcttagccctcatccagccctccagggagcaggtgcagagactcatagcagagcaaccc ccacacctgacccccggcataagggagctggtaagtcgcctacaggagcgaaatgttcag gttttcctaatatctggtggctttaggagtattgtagagcatgttgcttcaaagctcaat atcccagcaaccaatgtatttgccaataggctgaaattctactttaacggtgaatatgca ggttttgatgagacgcagccaacagctgaatctggtggaaaaggaaaagtgattaaactt ttaaaggaaaaatttcattttaagaaaataatcatgattggagatggtgccacagatatg gaagcctgtcctcctgctgatgctttcattggatttggaggaaatgtgatcaggcaacaa gtcaaggataacgccaaatggtatatcactgattttgtagagctgctgggagaactggaa gaataa >gi568815591f:55853196_56055175|GENSCAN_predicted_peptide_7|136_aa MAAVKTLNPKAEVARAQAALAVNISAARGLQDVLRTNLGPKGTMKIPTVPAISSRIDAVS AENKAALERGVPRPLGLPGTLRTRHNLCGDGQIQHPTASLIAKVATAQDDITGDGTTSNV LIIGELLKQADLYISE >gi568815591f:55853196_56055175|GENSCAN_predicted_CDS_7|408_bp atggcggcggtgaagaccctgaaccccaaggccgaggtggcccgagcgcaggcggcgctg gcggtcaacatcagcgcagcgcggggtctgcaggacgtgctaaggaccaacctggggccc aagggcaccatgaagatccccaccgtcccggccatttcttcccgcatcgatgcagtctcc gcggagaacaaagcggctctagagcgcggcgttccccgccctctggggcttccaggcacc ctgaggactcggcacaatctctgcggggacgggcaaattcaacacccaacagcttcctta atagcaaaggtagcaacagcccaggatgatataactggtgatggtacgacttctaatgtc ctaatcattggagagctgctgaaacaggcggatctctacatttctgaa