GENSCAN 1.0 Date run: 4-Nov-116 Time: 05:40:20 Sequence gi568815581r:46668323_46918597 : 250275 bp : 47.11% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6092 6291 200 0 2 126 57 188 0.521 18.57 1.02 Intr + 24581 24746 166 1 1 86 76 255 0.992 23.63 1.03 Intr + 25525 25599 75 2 0 63 82 41 0.583 0.49 1.04 Intr + 26153 26340 188 0 2 37 77 139 0.702 7.11 1.05 Intr + 36437 36532 96 1 0 80 81 101 0.921 8.81 1.06 Intr + 42641 42797 157 1 1 82 100 223 0.997 22.48 1.07 Intr + 45531 45664 134 1 2 96 64 142 0.852 12.96 1.08 Intr + 46957 47002 46 0 1 86 63 -16 0.685 -6.32 1.09 Term + 49451 49758 308 0 2 -6 45 372 0.580 18.78 1.10 PlyA + 52132 52137 6 1.05 2.02 PlyA - 53246 53241 6 1.05 2.01 Sngl - 53811 53260 552 0 0 68 41 521 0.997 39.52 2.00 Prom - 59539 59500 40 -6.26 3.00 Prom + 62468 62507 40 -6.06 3.01 Init + 62563 62639 77 0 2 70 99 55 0.292 5.36 3.02 Intr + 70335 70369 35 0 2 105 89 15 0.169 1.17 3.03 Intr + 74101 74192 92 2 2 32 92 53 0.178 -0.19 3.04 Intr + 81451 81585 135 0 0 58 83 206 0.984 17.86 3.05 Intr + 83181 83294 114 2 0 71 110 118 0.998 12.94 3.06 Term + 85370 85513 144 1 0 110 38 61 0.944 1.21 3.07 PlyA + 85670 85675 6 1.05 4.14 PlyA - 87154 87149 6 1.05 4.13 Term - 100477 99998 480 1 0 92 49 758 0.993 67.10 4.12 Intr - 101726 101461 266 0 2 94 101 408 0.999 39.93 4.11 Intr - 105587 105346 242 1 2 115 79 394 0.875 38.39 4.10 Intr - 114173 114095 79 0 1 127 49 27 0.001 1.31 4.09 Intr - 132939 132892 48 1 0 68 94 30 0.029 0.25 4.08 Intr - 136781 136733 49 2 1 47 101 44 0.071 -0.15 4.07 Intr - 147617 147563 55 1 1 103 51 74 0.298 4.18 4.06 Intr - 150483 150196 288 2 0 74 83 111 0.235 5.56 4.05 Intr - 151681 151599 83 1 2 106 37 49 0.134 0.04 4.04 Intr - 151989 151842 148 2 1 110 47 74 0.199 5.74 4.03 Intr - 157402 157240 163 2 1 70 67 106 0.609 5.73 4.02 Intr - 163240 163126 115 1 1 103 92 78 0.988 9.72 4.01 Init - 174544 174509 36 1 0 73 40 79 0.476 1.61 4.00 Prom - 176428 176389 40 -4.16 5.00 Prom + 181373 181412 40 -4.96 5.01 Init + 183317 183393 77 1 2 90 60 285 0.999 24.46 5.02 Intr + 183722 183877 156 2 0 78 35 123 0.139 5.13 5.03 Intr + 195612 195848 237 0 0 50 63 149 0.344 5.33 5.04 Intr + 204195 204451 257 0 2 113 59 209 0.977 17.59 5.05 Intr + 206779 207044 266 2 2 66 105 353 0.994 32.03 5.06 Intr + 207923 208226 304 1 1 140 91 280 0.226 29.66 5.07 Intr + 208946 209153 208 0 1 6 110 90 0.090 1.14 5.08 Intr + 216202 216296 95 1 2 84 81 58 0.232 4.31 5.09 Intr + 217571 217641 71 0 2 85 59 57 0.243 1.40 5.10 Intr + 235214 235352 139 1 1 111 80 77 0.609 9.24 5.11 Intr + 237086 237182 97 0 1 50 91 34 0.432 -1.03 5.12 Intr + 238971 239120 150 0 0 50 66 88 0.356 2.08 5.13 Intr + 239997 240101 105 0 0 53 86 57 0.202 1.33 5.14 Term + 240295 240337 43 1 1 65 48 68 0.304 -2.87 5.15 PlyA + 243563 243568 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:46668323_46918597|GENSCAN_predicted_peptide_1|456_aa XCKHVKGILLYGPPGCGKTLLARQIGKMLNAREPKVVNGPEILNKYVGESEANIRKLFAD AEEEQRRLGANSGLHIIIFDEIDAICKQRGSMAGSTGVHDTVVNQLLSKIDGVEQLNNIL VIGMTNRPDLIDEALLRPGRLEVKMEIGLPDEKGRLQILHIHTARMRGHQLLSADVDIKE LAVETKNFSGAELEGLVRAAQSTAMNRHIKASTKVEVDMEKAESLQVTRGDFLASLENDI KPAFGTNQEDYASYIMNGIIKWGDPVTRVLDDGELLVQQTKNSDRTPLVSVLLEGPPHSG KTALAAKIAEESNFPFIKICSPDKMIGFSETAKCQAMKKENLNRITGTSDFPRVGKELEQ CQRQANKVTEITLNNFDKVLEHDGKLTELEQRSDQLLDMSSAFSKTTKTLAQKKCWENIH CQIYLGLVVGGSLLIILIEQLAIFLPQSDTSNAPQT >gi568815581r:46668323_46918597|GENSCAN_predicted_CDS_1|1371_bp ngttgtaaacatgttaaaggcatcctgttatatggacccccaggttgtggtaagactctc ttggctcgacagattggcaagatgttgaatgcaagagagcccaaagtggtcaatgggcca gaaatccttaacaaatatgtgggagaatcagaggctaacattcgcaaactttttgctgat gctgaagaggagcaaaggaggcttggtgctaacagtggtttgcacatcatcatctttgat gaaattgatgccatctgcaagcagagagggagcatggctggtagcacgggagttcatgac actgttgtcaaccagttgctgtccaaaattgatggcgtggagcagctaaacaacatccta gtcattggaatgaccaatagaccagatctgatagatgaggctcttcttagacctggaaga ctggaagttaaaatggagataggcttgccagatgagaaaggccgactacagattcttcac atccacacagcaagaatgagagggcatcagttactctctgctgatgtagacattaaagaa ctggccgtggagaccaagaatttcagtggtgctgaattggagggtctggtgcgagcagcc cagtccactgctatgaatagacacataaaggccagtactaaagtggaagtggacatggag aaagcagaaagcctgcaagtgacgagaggagacttccttgcttctttggagaatgatatc aaaccagcctttggcacaaaccaagaagattatgcaagttacattatgaacggtatcatc aaatggggtgacccagttactcgagttctagatgatggggagctgctggtgcagcagact aagaacagtgaccgcacaccattggtcagcgtgcttctggaaggccctcctcacagtggg aagactgctttagctgcaaaaattgcagaggaatccaacttcccgttcatcaagatctgt tctcctgataaaatgattggcttttctgaaacagccaaatgtcaggccatgaagaaggaa aatctgaatagaataactgggaccagtgatttccctagagtagggaaagagttggagcag tgccagcggcaagcgaacaaggtgacggaaatcacgcttaacaactttgacaaggtcctg gagcatgatggaaagctgaccgaactggagcagcgttcagaccaactcctggatatgagc tcagccttcagcaagacaacaaagaccctggcccagaagaagtgctgggagaacatccat tgccagatctacttggggctagtggtgggtggtagcctgctcatcatcctgattgagcag ctggccatctttctccctcagagtgacaccagtaatgccccacagacctag >gi568815581r:46668323_46918597|GENSCAN_predicted_peptide_2|183_aa MARSRTSSSPAISQALLELEMNSDLKAQLRELNITAAKETEVGGGRKAIIIFVPVPQLKS FQKIQVRLVRELEKKFSGKHVVFIAQRRILPKPTRKSRTKNKQKCPRSRTLTAVHDAFLE DLVFPSEIVGKRIPVKLDSSRLIKVHLDKAQQNNVEHKVETFSGVYKKLTGKDVNFEFPE FQL >gi568815581r:46668323_46918597|GENSCAN_predicted_CDS_2|552_bp atggcgagaagccggacgagttcgagtccggccatctcccaggctcttctggagctggag atgaactcggacctcaaggctcagctcagggagctgaatattacggcagccaaggaaact gaagttggtggtggtcggaaagctatcataatctttgttcccgttcctcaactgaaatct ttccagaaaatccaagtccggctagtacgcgaattggagaaaaagttcagtgggaagcat gtcgtctttatcgctcagaggagaattctgcctaagccaactcgaaaaagccgtacaaaa aataagcaaaagtgtcccaggagccgtactctgacagctgtgcacgatgccttccttgag gacttggtcttcccaagcgaaattgtgggcaagagaatccccgtcaaactagatagcagc cggctcataaaggttcatttggacaaagcacagcagaacaatgtggaacacaaggttgaa actttttctggtgtctataagaagctcacgggcaaggatgttaattttgaattcccagag tttcaattgtaa >gi568815581r:46668323_46918597|GENSCAN_predicted_peptide_3|198_aa MAIIKKSGNDKRWCRCGEIGTCMHCCGFAISCYLQNPSSMLALETSVKPNRLKSCPDEGE LARNTVNIGRKLLIIGTTSRKDVLQEMEMLNAFSTTIHVPNIATGEQLLEALELLGNFKD KERTTIAQQVKGKKVWIGIKKLLMLIEMSLQVSDQVNASYACGIESESGALRPDLQRLTF HRPLSLNPFMSNGFRAAE >gi568815581r:46668323_46918597|GENSCAN_predicted_CDS_3|597_bp atggctatcatcaaaaagtcaggtaatgacaagcgctggtgcagatgtggagaaattgga acctgtatgcactgctgtggttttgctatcagctgttacttgcagaatccaagcagtatg ctagccctggaaacatcagtaaagccaaaccgacttaagtcctgtcctgatgaaggagaa ctagcaagaaatacagtaaatataggccgcaagcttcttatcattgggaccactagccgc aaagatgtccttcaggagatggaaatgcttaacgctttcagcaccaccatccacgtgccc aacattgccacaggagagcagctgttggaagctttggagcttttgggcaacttcaaggat aaggaacgcaccacaattgcacagcaagtcaaagggaagaaggtctggataggaatcaag aagttactaatgctgatcgagatgtccctacaggtcagtgatcaagttaatgcttcttat gcatgtgggatagagagtgagagtggggcactcaggcctgatcttcagcgactgacattt cataggcctctgagtttgaaccccttcatgtcaaatggatttcgtgcagctgagtga >gi568815581r:46668323_46918597|GENSCAN_predicted_peptide_4|683_aa MNVDIFQCDSEETLKKASDGPRTEKVTQDLAQPFWTTGRQLRFVLHLSLQRNIYVFGPLM NDCPQQMASSLQAKTWAAFAHQCSAAKGQLHTELLHEDALKKWTSPAMGWERSRSHDKPR RLSRPLVPPRPFPRAPCAGSSRVRRGLADQKGQQFPTQRSLLPTGSASFTPDRGCAESWC LRPRALIGCSLTSSNPAAPRWAREGGGCGWRCASDKPESHFQSQVDFVPTIGGVAPPLHG RGQTSSSAPLLMEPHLLGLLLGLLLGGTRVLAGYPIWWCWAQGCVAVNIFADIYLWWVRG LTDFKNEATYLCAWPLPPCTRSSPNPNMEVSVEGGALLGPGTSWASVQYCPGASRSLALG QQYTSLGSQPLLCGSIPGLVPKQLRFCRNYIEIMPSVAEGVKLGIQECQHQFRGRRWNCT TIDDSLAIFGPVLDKATRESAFVHAIASAGVAFAVTRSCAEGTSTICGCDSHHKGPPGEG WKWGGCSEDADFGVLVSREFADARENRPDARSAMNKHNNEAGRTTILDHMHLKCKCHGLS GSCEVKTCWWAQPDFRAIGDFLKDKYDSASEMVVEKHRESRGWVETLRAKYSLFKPPTER DLVYYENSPNFCEPNPETGSFGTRDRTCNVTSHGIDGCDLLCCGRGHNTRTEKRKEKCHC IFHWCCYVSCQECIRIYDVHTCK >gi568815581r:46668323_46918597|GENSCAN_predicted_CDS_4|2052_bp atgaatgtggacatcttccagtgtgattcagaagagaccctgaagaaagcctctgatggg cctagaacagagaaggtgacccaggaccttgcccagcctttctggacaactggaagacag ttacgcttcgtcctccacctctccctccaacgaaacatttacgtatttggtcctctgatg aatgactgtccccagcagatggcaagttccttgcaggccaaaacctgggctgcttttgct catcagtgctcagctgccaaggggcagctgcacacagagttgctccatgaagatgcgttg aagaaatggaccagtccagcaatgggctgggaacgcagcaggagccatgacaagcccagg cggctctcccgacccttggtgcccccgaggccatttccccgcgctccctgtgccggcagc agccgcgtgcggagagggctcgccgaccagaaggggcagcagttccctacacagcggtcc ctgctccccaccggcagtgcttccttcaccccagaccggggctgcgcagagtcctggtgc ctcaggccgcgggcgctgattggctgctcgctgacatcctcaaacccggctgctccgcgc tgggctcgggaggggggcggctgcgggtggaggtgcgcttctgacaagcccgaaagtcat ttccaatctcaagtggactttgttccaactattgggggcgtcgctccccctcttcatggt cgcgggcaaacttcctcctcggcgcctcttctaatggagccccacctgctcgggctgctc ctcggcctcctgctcggtggcaccagggtcctcgctggctacccaatttggtggtgctgg gcccagggctgtgtggctgtgaatatctttgcagacatctacctgtggtgggttcgtggt ctcactgacttcaagaatgaagccacgtacctttgcgcctggcccctgccgccctgcacc cgctcctctcccaaccccaacatggaagtttccgtggagggtggagcccttctgggccca ggaacaagttgggcctctgtccagtactgcccaggagccagcaggtccctggccctgggc cagcagtacacatctctgggctcacagcccctgctctgcggctccatcccaggcctggtc cccaagcaactgcgcttctgccgcaattacatcgagatcatgcccagcgtggccgagggc gtgaagctgggcatccaggagtgccagcaccagttccggggccgccgctggaactgcacc accatagatgacagcctggccatctttgggcccgtcctcgacaaagccacccgcgagtcg gccttcgttcacgccatcgcctcggccggcgtggccttcgccgtcacccgctcctgcgcc gagggcacctccaccatttgcggctgtgactcgcatcataaggggccgcctggcgaaggc tggaagtggggcggctgcagcgaggacgctgacttcggcgtgttagtgtccagggagttc gcggatgcgcgcgagaacaggccggacgcgcgctcggccatgaacaagcacaacaacgag gcgggccgcacgactatcctggaccacatgcacctcaaatgcaagtgccacgggctgtcg ggcagctgtgaggtgaagacctgctggtgggcgcagcctgacttccgtgccatcggtgac ttcctcaaggacaagtatgacagcgcctcggagatggtagtagagaagcaccgtgagtcc cgaggctgggtggagaccctccgggccaagtactcgctcttcaagccacccacggagagg gacctggtctactacgagaactcccccaacttttgtgagcccaacccagagacgggttcc tttggcacaagggaccggacttgcaatgtcacctcccacggcatcgatggctgcgatctg ctctgctgtggccggggccacaacacgaggacggagaagcggaaggaaaaatgccactgc atcttccactggtgctgctacgtcagctgccaggagtgtattcgcatctacgacgtgcac acctgcaagtag >gi568815581r:46668323_46918597|GENSCAN_predicted_peptide_5|734_aa MRPPPALALAGLCLLALPAAAASYFGRQSRSLMPSRDPHAGALSAAILASGEPSLPRTPS PPSAFALRSGDREGWVDGWAKGVGHPWGLQENKWNLQHGTSLPRACANNGTVLIIEGVTH KAFPRGLALPGPKPSLSLPVRGKRVIGAQGAFKNLSSLTGREVLTPFPGLGTAAAPAQGG AHLKQCDLLKLSRRQKQLCRREPGLAETLRDAAHLGLLECQFQFRHERWNCSLEGRMGLL KRGFKETAFLYAVSSAALTHTLARACSAGRMERCTCDDSPGLESRQAWQWGVCGDNLKYS TKFLSNFLGSKRGNKDLRARADAHNTHVGIKAVKSGLRTTCKCHGVSGSCAVRTCWKQLS PFRETGQVLKLRYDSAVKVSSATNEALGRLELWAPARQGSLTKGLAPRSGDLVYMEDSPS FCRPSKYSPGTADGEKAQVVEAIRSEKGELSVLEWTRVRSVPWSPVSLIWLFKYASAEAS FLSRELGYAHSPSRAQNYHQCSLMTEAQGLVNEVAALPVLTTATVVVLRRLHTESLAAST TRTKLCRSLNLELLQDEVGLPGTCILIDAVTLTGKQLEGKNHGFHLSGFQYPTPNLAYYR CSTEFVVPFKDDAWNNPLPARVDSSKKTLFHPSAESRGFLWRKKKRKCHALDPPGKERLT VAVSTRGTWGQLTLVLQTQVLGSVGRGVLSPYASGSWALPFQSCKMIAINAACLLEALKI GDVAAIWLPMKEPY >gi568815581r:46668323_46918597|GENSCAN_predicted_CDS_5|2205_bp atgcgccccccgcccgcgctggccctggccgggctctgcctgctggcgctgcccgccgcc gccgcctcctacttcgggcgtcagtcccgctctctgatgccctctcgggatcctcacgcc ggtgccctgtctgccgccatcctggcctccggcgagccgtccttgcctcggactccttcc ccaccctccgccttcgccctgcggagcggagaccgagagggctgggtggatggctgggcg aagggtgtgggccatccgtggggcctgcaggagaacaagtggaatctgcagcatgggaca tctctgcctagagcctgtgcaaacaatggcactgtcctcatcattgagggggtcacgcac aaggcattccccagaggcctggcccttccagggcccaagcccagcctgagcctgcctgtg cgtgggaagagggtgatcggagcccagggtgcattcaagaacctgtcaagcctgaccggg cgggaagtcctgacgcccttcccaggattgggcactgcggcagccccggcacagggcggg gcccacctgaagcagtgtgacctgctgaagctgtcccggcggcagaagcagctctgccgg agggagcccggcctggctgagaccctgagggatgctgcgcacctcggcctgcttgagtgc cagtttcagttccggcatgagcgctggaactgtagcctggagggcaggatgggcctgctc aagagaggcttcaaagagacagctttcctgtacgcggtgtcctctgccgccctcacccac accctggcccgggcctgcagcgctgggcgcatggagcgctgcacctgtgatgactctccg gggctggagagccggcaggcctggcagtggggcgtgtgcggtgacaacctcaagtacagc accaagtttctgagcaacttcctggggtccaagagaggaaacaaggacctgcgggcacgg gcagacgcccacaatacccacgtgggcatcaaggctgtgaagagtggcctcaggaccacg tgtaagtgccatggcgtatcaggctcctgtgccgtgcgcacctgctggaagcagctctcc ccgttccgtgagacgggccaggtgctgaaactgcgctatgactcggctgtcaaggtgtcc agtgccaccaatgaggccttgggccgcctagagctgtgggcccctgccaggcagggcagc ctcaccaaaggcctggccccaaggtctggggacctggtgtacatggaggactcacccagc ttctgccggcccagcaagtactcacctggcacagcagatggggagaaagcacaggtggta gaagccatccgttcagagaaaggagagctttctgtgcttgagtggacccgagtgagatct gtgccctggagccctgtgtccttgatttggcttttcaaatatgcctccgctgaggcctca ttcttgtctcgagagctgggttatgcacactcacccagccgtgctcaaaactaccaccag tgcagcctgatgacagaggcccagggtttggtaaatgaagttgctgctctgccagtgctg acaacagccacagtagttgttttgcgcaggcttcatacagaatccctggctgcctcaacc acaagaacaaagctgtgcagaagcctgaacctggagcttctgcaagatgaggtagggctt ccaggtacttgtatcctgattgatgctgtcactctcactggaaaacaattggaaggtaag aaccatggcttccacctctctggattccagtaccctacaccaaatctggcttattatagg tgctcaacagaatttgtagttccctttaaagatgatgcttggaataacccacttccagca agagtggacagcagcaagaagactttgtttcatccttcagctgagtcgcggggcttcctt tggaggaagaagaagcggaaatgtcatgctctggacccacctggtaaagagaggttgacc gtggcagtgagtactcggggcacctggggccagctgacccttgtgcttcagacacaggtg ttgggctcggtgggaagaggggtcctctcaccctatgcctcagggtcctgggcgcttccc tttcaaagctgcaaaatgatcgccattaatgcagcttgtctcctggaggcgctgaaaata ggagatgtggccgccatctggctgccaatgaaagaaccatattga