GENSCAN 1.0 Date run: 5-Nov-116 Time: 03:07:32 Sequence gi568815588f:68237222_68442051 : 204830 bp : 42.31% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1879 2001 123 1 0 93 61 36 0.332 0.18 1.02 Intr + 2074 2192 119 1 2 55 110 47 0.627 2.79 1.03 Intr + 4717 4839 123 2 0 87 92 34 0.296 3.34 1.04 Intr + 7811 7926 116 0 2 84 36 86 0.260 2.25 1.05 Intr + 12613 12878 266 0 2 57 80 121 0.146 3.59 1.06 Intr + 17977 18097 121 1 1 53 86 45 0.551 0.38 1.07 Intr + 22893 22928 36 2 0 38 109 70 0.416 1.64 1.08 Intr + 23201 23464 264 1 0 8 41 201 0.482 4.19 1.09 Term + 24033 24257 225 2 0 57 48 379 0.966 26.60 1.10 PlyA + 24418 24423 6 1.05 2.00 Prom + 30611 30650 40 -5.65 2.01 Init + 37394 37556 163 1 1 74 59 106 0.789 6.24 2.02 Intr + 38696 38807 112 0 1 35 38 102 0.873 -1.58 2.03 Term + 40574 41102 529 2 1 78 53 350 0.913 23.24 2.04 PlyA + 42035 42040 6 1.05 3.09 PlyA - 43881 43876 6 1.05 3.08 Term - 47068 46956 113 2 2 102 44 113 0.994 6.04 3.07 Intr - 51440 51262 179 1 2 29 80 191 0.992 11.04 3.06 Intr - 51798 51710 89 0 2 82 110 46 0.999 4.05 3.05 Intr - 54818 54789 30 2 0 113 81 22 0.721 1.41 3.04 Intr - 55017 54908 110 1 2 43 76 121 0.189 5.48 3.03 Intr - 59764 59665 100 1 1 64 33 105 0.191 1.36 3.02 Intr - 69685 69540 146 2 2 33 101 138 0.595 8.68 3.01 Init - 73515 73401 115 0 1 47 59 48 0.330 -1.78 3.00 Prom - 81453 81414 40 -4.55 4.00 Prom + 83281 83320 40 -4.95 4.01 Init + 94195 94424 230 0 2 80 16 318 0.977 20.74 4.02 Intr + 94476 94588 113 0 2 20 76 110 0.669 2.10 4.03 Intr + 95165 95287 123 0 0 98 27 56 0.232 0.14 4.04 Intr + 95548 95652 105 2 0 122 42 80 0.474 6.07 4.05 Intr + 99978 100112 135 1 0 87 77 113 0.924 9.72 4.06 Intr + 100637 100775 139 0 1 42 75 72 0.857 -0.10 4.07 Intr + 101282 101466 185 2 2 30 68 138 0.937 4.51 4.08 Intr + 101919 102005 87 1 0 48 83 141 0.995 8.62 4.09 Intr + 102219 102334 116 1 2 89 82 56 0.984 4.35 4.10 Intr + 103953 104088 136 2 1 27 85 171 0.999 9.92 4.11 Intr + 104364 104459 96 1 0 98 71 104 0.997 8.76 4.12 Intr + 104538 104630 93 1 0 82 115 75 0.996 8.62 4.13 Term + 105937 106106 170 2 2 69 42 104 0.873 0.96 4.14 PlyA + 106755 106760 6 1.05 5.00 Prom + 107701 107740 40 -6.15 5.01 Sngl + 113080 113289 210 0 0 98 42 140 0.927 5.34 5.02 PlyA + 115810 115815 6 1.05 6.13 PlyA - 118769 118764 6 1.05 6.12 Term - 125264 125204 61 1 1 105 43 73 0.255 0.80 6.11 Intr - 139751 139632 120 1 0 10 68 130 0.190 1.89 6.10 Intr - 142300 142203 98 1 2 51 87 113 0.996 5.39 6.09 Intr - 144178 144011 168 1 0 17 110 213 0.998 15.62 6.08 Intr - 146693 146577 117 2 0 71 96 129 0.998 11.74 6.07 Intr - 146931 146830 102 0 0 76 107 52 0.976 5.35 6.06 Intr - 157230 157107 124 2 1 67 108 154 0.990 14.97 6.05 Intr - 159660 159559 102 2 0 96 88 32 0.809 2.37 6.04 Intr - 164516 164399 118 0 1 75 72 128 0.966 8.50 6.03 Intr - 167623 167450 174 2 0 94 64 92 0.891 6.39 6.02 Intr - 170069 169965 105 0 0 28 82 120 0.575 4.67 6.01 Init - 181395 181341 55 0 1 95 61 20 0.049 1.58 6.00 Prom - 183393 183354 40 -2.65 7.00 Prom + 186804 186843 40 -1.95 7.01 Init + 187339 187590 252 0 0 93 61 388 0.951 33.99 7.02 Term + 187648 187776 129 0 0 2 36 217 0.727 5.10 7.03 PlyA + 188561 188566 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 55012 54908 105 1 0 80 76 114 0.808 9.67 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:68237222_68442051|GENSCAN_predicted_peptide_1|464_aa XSSPNCVRPCPSKKFLLPLCSEWLCFPDRTLTGARDISSRPQIRDFVSWTGITTSICSAL CQATSGIRRPLAYWNSRSNSAAHLSHSLVATPNCHIICHSTSLDSPQQACFATIRGTLNW VPDILGIVPRTQSAGSLEFGCVYRNPGKEGSANREKGNGGRGQRQRSSRTPILLASIAMK LPLMEERRPCGTGQAAHTPSMSGPEKSNPIPSYMKAFLPHGSSLDLAYPSLLGMIHLRNM TEAIAIAVARIVTRTTLEGYSLKCERALGLQSYSLVLNPDLTTLLSDLKKREGAVIIEEL TQHPPGSMGHSMSVSSVGFLSSSFSVGYDGDDGGYSGSCANALARSNGLLVGNEKITMQT LNNHLSGLLPGQGERPGRGHLRAGGEGPWLGFDIELQQLSMKAALEGTLEETRALFGAKL AQIQALVRGVEAQQGDVGADSQQQNQHQWFMDIKSQLEEEITIY >gi568815588f:68237222_68442051|GENSCAN_predicted_CDS_1|1395_bp nnctcctctcccaattgtgtaagaccttgtccttctaagaaattccttcttccattatgc tcagagtggctctgctttcctgaccggaccctgactggagcaagggatatttcatccaga cctcagatacgagactttgtatcttggacagggattactacttctatctgttcagccctg tgccaggcaacatcagggataagacggcctttagcctattggaattcacgatccaattct gcagcacatttgtcccattccctagtggctactcccaattgccatataatttgtcactct acgtcacttgattcacctcagcaagcttgctttgctactatccgaggaacactgaattgg gtcccagatattttggggatcgtgccccgaacccagtctgcaggcagcctagaatttggc tgtgtatacaggaacccaggaaaggaaggaagtgccaatagagagaagggaaatggtggc agggggcagaggcagaggtcttcaagaactcccattctcttggcgtccattgcaatgaaa ctacctctgatggaagagaggagaccttgtggtacaggccaggcagcccacactccatca atgtctggccctgaaaaaagcaaccccatccccagctacatgaaagcttttctcccacat ggatcatcacttgatcttgcttatccatctctcttgggaatgatccacctgagaaatatg acagaagccattgccatagctgttgccagaatagtaactagaacaacacttgaggggtat tccttgaagtgtgaaagggcattaggtttgcagtcgtatagcctggttttgaatccagac ctaaccactttactgagtgaccttaagaagagagaaggggctgtgatcatcgaggaactc acacagcatccaccaggatccatgggccacagcatgtcagtatcctctgttggtttcttg tcctcatccttttctgtgggctatgatggggacgatgggggttacagtggcagctgtgct aatgccctggccaggtccaatgggctgctggtaggcaacgagaagatcaccatgcagacc ctcaataaccacctatctggcctcctacctggccaaggtgaacgccctggaagaggtcac ctgagagctgggggtgaaggtccatggctgggttttgacattgagctgcagcagctcagc atgaaagccgccctggaaggcacactggaagaaacaagggccctctttggagccaagctg gcacagatccaggcgctggtcaggggtgttgaagcccagcagggtgacgtgggtgctgac agccagcagcagaaccagcaccagtggttcatggacatcaagtctcagctggaggaggag attactatctactga >gi568815588f:68237222_68442051|GENSCAN_predicted_peptide_2|267_aa MASSEDDRSGKVHFQAHLCGCWSKASVHCCVGFFLRLLMMRQLASHRARDSREGVHFADS PCFGSLVTKELSDYNLVSQTDMQMLFGKSENRDLVPHIPAALAMAKRGQGIAQAIASECA SPKPWQLPHGVEPAGAQKSRTEVWEPLPRFQRMYGNAWLSRQKCAAGMEPSWRTSAKAVR KGNVGCKPPQRVPTGALPSGAMRGGPLSSRPHNGRSTNSLHHAPGKSTNTQHQPVKAAIR GAIPCKATGVELPKAVEAHLLHQRGLD >gi568815588f:68237222_68442051|GENSCAN_predicted_CDS_2|804_bp atggcatcatctgaggatgaccggagcgggaaggtccacttccaagctcatttatgtggc tgttggtcaaaggcctcagttcattgctgcgtgggcttcttcttaaggctgctcatgatg aggcagctggcttcccacagggcaagggattcaagagagggagtccattttgctgactct ccttgttttggctccctggtcaccaaggagctttctgattacaatttggtttcccaaaca gatatgcaaatgctgtttgggaaatcagagaatagggacttggtgccccacatcccagct gctctagccatggctaaaaggggccaaggtatagctcaggccattgcttcagagtgtgca agtcccaagccttggcagcttccacatggtgttgagcctgcgggagcacagaagtcaaga actgaggtttgggaacctctgcctagatttcagaggatgtatggaaatgcctggttgtcc aggcagaagtgtgctgcagggatggagccctcatggagaacctctgctaaggcagtgcgg aagggaaatgtggggtgcaagcccccacaaagagtccccactggggcactgcctagtgga gctatgagaggagggccactgtcctccagaccccataatggtagatcaaccaacagcttg caccatgcacctggaaaatccacaaacactcaacatcagcctgtgaaagcagccataagg ggggctataccctgcaaagccacaggggtagagctgcccaaggctgtggaagcccatctg ttgcatcagcgtggcctggattga >gi568815588f:68237222_68442051|GENSCAN_predicted_peptide_3|293_aa MLSEISQTQKDKYCIISLSEESKKVDTVEVESRMVVVREEDVKQATSNFENLQKQLARKM KLPIFIADAFTARAFRGNPAAVCLLENELDEDMHQKIAREMNLSETAFIRKLHPTDNFAQ KNMNSTLTFVTLSGELRARRAEDGIVLDLPLYPAHPQDFHEVEDLIKTAIGNTLVQDICY SPDTQKLLVRLSDVYNRSFLENLKVNTENLLQVENTGKVKGLILTLKGEPGGQTQAFDFY SRYFAPWVGVAEDPVTAFQCSHRGGELGISLRPDGRVDIRGGAAVVLEGTLTA >gi568815588f:68237222_68442051|GENSCAN_predicted_CDS_3|882_bp atgctaagtgaaataagccaaacacagaaagacaaatactgcataatctcactcagtgag gaatctaagaaagttgacactgtagaagtagagagtagaatggtggttgtcagagaagaa gacgtaaagcaggctaccagcaattttgagaacttgcaaaaacagcttgcaaggaaaatg aagcttcctattttcatagcagatgcattcacagcaagagcatttcgtgggaatcctgct gctgtttgcctcctagaaaatgaattggatgaagacatgcatcagaaaattgcaagggag atgaacctctctgaaactgcttttatccgaaaactgcacccgacagacaactttgcacaa aaaaacatgaatagcacgctcacgtttgtcactctgagtggagaactaagggccagacga gcagaggatggcatcgtcctggacttgcctctttatccagcccacccccaggacttccat gaagtagaggacttgataaagactgccataggcaacacactggtccaggacatctgttat tctccagatacccaaaagctcctcgtccgcctcagtgacgtttacaacaggtcgtttctg gagaacctgaaagtgaacacggagaatctgctgcaagttgaaaacacagggaaggtgaaa gggcttattcttacccttaaaggagagcctggtgggcagacccaagcatttgacttttac tcaagatattttgcaccgtgggttggtgtggctgaagacccagtgacagcttttcagtgt tcccaccgaggaggagagctgggaatttcccttcgtccagacggaagggttgacattaga ggaggtgcagctgttgttttagagggcacactgacagcctag >gi568815588f:68237222_68442051|GENSCAN_predicted_peptide_4|575_aa MNTPKKPGSRAAHWDEARWRAASNSGPGEGGGHPAREPKSAPVDAFIFAAQKHSYFSPHN ANMVDQKIRPAFAGRTTICSRWGQITGELLQSATARFLLHTGRTAIAIAPRRQNGRAACR WEDLLRWNSVDDAEARSRGHPAFSPFPAGLASAILVSAGLPGLTGLRVTSDADSQDERGW SWGRKANGDEAFKSNGIEMDWVMKHNGPNDASDGTVRLRGLPFGCSKEEIVQFFQGLEIV PNGITLTMDYQGRSTGEAFVQFASKEIAENALGKHKERIGHRYIEIFRSSRSEIKGFYDP PRRLLGQRPGPYDRPIGGRGGYYGAGRGSMYDRMRRGGDGYDGGYGGFDDYGGYNNYGYG NDGFDDRMRDGRGMGGHGYGGAGDASSGFHGGHFVHMRGLPFRATENDIANFFSPLNPIR VHIDIGADGRATGEADVEFVTHEDAVAAMSKDKNNMQHRYIELFLNSTPGGGSGMGGSGM GGYGRDGMDNQGGYGSVGRMGMGNNYSGGYGTPDGLGGYEMLLINVIVGDNSMASVLECF VSVTNSKIKLVEGGTTYDVYDLGEGAGAAYICGAS >gi568815588f:68237222_68442051|GENSCAN_predicted_CDS_4|1728_bp atgaatacacccaagaaacccggatcccgcgcggcacattgggacgaagcgcgctggcgg gcggccagcaactctgggccaggggaaggaggcggacacccagcccgagagccgaaatcg gccccagtggacgctttcatcttcgcggcccagaaacactcctatttttcaccgcacaat gcaaacatggtggaccagaaaatccgccccgcgtttgccggtcgaaccacaatttgttca cggtgggggcagatcaccggagaacttctccagagcgctacggcacggttccttctacac actggcagaactgccattgccatcgccccgaggcggcagaatgggagggcggcttgccga tgggaggacttgctcaggtggaattcagtggacgacgccgaggcccgaagtaggggacat ccagcattttcccctttccctgctggcttggcctccgccattttggtctcggcagggcta cctgggctgacgggactgcgagtgacttctgacgcagattcccaagatgagagaggctgg agctgggggaggaaagccaatggcgacgaagcatttaaatcaaacggtattgagatggat tgggttatgaaacataatggtccaaatgacgctagtgatgggacagtacgacttcgtgga ctaccatttggttgcagcaaagaggaaatagttcagttctttcaagggttggaaatcgtg ccaaatgggataacattgacgatggactaccaggggagaagcacaggggaggccttcgtg cagtttgcttcaaaggagatagcagaaaatgctctggggaaacacaaggaaagaataggg cacaggtatattgagatcttcagaagtagcaggagtgaaatcaaaggattttatgatcca ccaagaagattgctgggacagcgaccgggaccatatgatagaccaataggaggaagaggg ggttattatggagctgggcgtggaagtatgtatgacagaatgcgacgaggaggtgatgga tatgatggtggttatggaggttttgatgactatggtggctataataattacggctatggg aatgatggctttgatgacagaatgagagatggaagaggtatgggaggacatggctatggt ggagctggtgatgcaagttcaggttttcatggtggtcatttcgtacatatgagagggttg ccttttcgtgcaactgaaaatgacattgctaatttcttctcaccactaaatccaatacga gttcatattgatattggagctgatggcagagccacaggagaagcagatgtagagtttgtg acacatgaagatgcagtagctgccatgtctaaagataaaaataacatgcaacatcgatat attgaactcttcttgaattctactcctggaggcggctctggcatgggaggttctggaatg ggaggctacggaagagatggaatggataatcagggaggctatggatcagttggaagaatg ggaatggggaacaattacagtggaggatatggtactcctgatggtttgggtggttatgaa atgttattaataaatgtcattgtgggagataatagtatggcgtctgtcctagaatgcttt gtgtcagttactaattctaaaatcaaattggtagaaggtggtactacatacgatgtctat gatttaggagagggagccggggcagcctatatatgtggagcctcttga >gi568815588f:68237222_68442051|GENSCAN_predicted_peptide_5|69_aa MLGLIKALDLTSSSQATSRQEQVNKHCKQTFRENPECEKLFRTAVLDPSAGQQHGKEKKV NSDMNPVST >gi568815588f:68237222_68442051|GENSCAN_predicted_CDS_5|210_bp atgcttggcctaatcaaggctttagatttaacttccagttcacaggcaacaagtagacag gaacaagttaacaaacactgcaaacaaacattcagagaaaatccagaatgtgaaaaactg tttaggacagctgttctagacccttcagctggtcaacagcatggaaaagaaaaaaaagtc aactcagatatgaatccagtttccacataa >gi568815588f:68237222_68442051|GENSCAN_predicted_peptide_6|447_aa MAFSGLGVRGGNLVDGSTAGLAADAVVAAERLEGAAGQAEGASARDRAAAAAMATKDPTA VERANLLNMAKLSIKGLIESALSFGRTLDSDYPPLQQFFVVMEHCLKHGLKVRKSFLSYN KTIWGPLELVEKLYPEAEEIGASVRDLPGLKTPLGRARAWLRLALMQKKMADYLRCLIIQ RDLLSEFYEYHALMMEEEGAVIVGLLVGLNVIDANLCVKGEDLDSQLAIAKNNIIKLQEE NHQLRSENKLILMKTQQHLEVTKVDVETELQTYKHSRQGLDEMYNEARRQLRDESQLRQD VENELAVQVSMKHEIELAMKLLEKDIHEKQDTLIGLRQQLEEVKAINIEMYQKLQGSEDG LKEKNEIIARLEEKTNKITAAMRQLEQRLQQAEKAQMEAEDEDEKYLQECLSKSDSLQKQ ISQKEKQLLCFGVAALRLIEHIARYLE >gi568815588f:68237222_68442051|GENSCAN_predicted_CDS_6|1344_bp atggcctttagcgggttgggggtgaggggcgggaacctggtggatgggtcaacagcgggc ctggctgcagatgcggtggtggccgccgagcgcctggaaggagctgctggacaggccgag ggagcctccgcccgagaccgcgcagccgccgccgccatggctacaaaagaccccacagct gtagagagagcaaacttgttaaacatggctaaactgagtatcaaaggactcattgaatct gctctgagctttggccgcactttggattctgactatccccccttgcagcaattctttgtt gttatggaacattgcctgaaacacggtcttaaagtaagaaaatcatttttgagttacaac aaaaccatctggggccctttggaactggtggagaagctgtaccccgaagcagaggaaata ggagctagtgtccgggatctacctggtctgaagacccctctgggtcgagcaagagcgtgg cttcgattagccctcatgcaaaaaaaaatggccgattacttacgttgcttaattattcag agggatctcttgagtgagttttatgagtatcacgcactaatgatggaagaagaaggagca gtaattgttgggctgctggttggcctgaatgtgatcgatgctaatctgtgtgtgaaggga gaggatttagactcacaattagcaatagcaaagaataacatcattaaactccaggaagaa aatcatcaattacgaagtgaaaataaattgattttaatgaaaacacagcagcacctagag gttaccaaagtagatgtggaaactgagcttcaaacatataagcattctcgtcaggggcta gatgaaatgtacaatgaagccagaaggcagcttcgagatgaatctcagttacgacaggat gtagagaatgagctagcagtacaagttagtatgaagcatgagattgaacttgccatgaag ttgctggagaaagatatccatgagaaacaagatactctgataggccttcgacaacaacta gaggaagttaaagcaattaacatagagatgtatcaaaagttgcagggttctgaagatggc ttgaaagaaaaaaatgaaataattgcccgactagaagaaaaaaccaataaaattactgca gccatgaggcagctggaacaaagattgcagcaagcagagaaggcgcaaatggaagctgaa gatgaggatgagaaatatctacaagaatgtctcagtaaatctgatagtctgcagaaacaa atctcccaaaaggagaaacagcttctgtgttttggggtagcagctttgcgcctgattgaa catattgccagatatttggaataa >gi568815588f:68237222_68442051|GENSCAN_predicted_peptide_7|126_aa MKFNPFVNLDRSKNRKRHFHAPLHVHRKIMSSPLSKELRQKYNVRSTPIRKDDEVQVVQG HYKGQQIGKVVQVYRKKDVIYTEQVVITRLNLNKDRKKIIEHKAKSRQVRKEKGKYKEEL IEKMQE >gi568815588f:68237222_68442051|GENSCAN_predicted_CDS_7|381_bp atgaagttcaatcccttcgtgaacttggaccgcagcaaaaaccgcaaacgtcacttccat gcccccttgcacgtgcaccggaagatcatgtcatccccgctctccaaggagctgcggcag aagtacaatgtccgctccacacccatccgcaaggacgacgaggtccaggtagttcaagga cactacaaaggtcagcaaattggcaaggtagtccaggtgtacagaaagaaagatgtcatc tacactgagcaggtggttatcaccaggctaaatctcaacaaggatcggaaaaaaattatt gaacacaaagccaagtctcgacaagtcagaaaagagaaaggcaaatataaggaggaactt attgagaaaatgcaggaataa