GENSCAN 1.0 Date run: 4-Nov-116 Time: 06:28:39 Sequence gi568815592r:130044523_130315273 : 270751 bp : 38.71% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4760 4871 112 1 1 67 116 123 0.995 12.03 1.02 Intr + 6727 6886 160 2 1 90 47 123 0.994 6.52 1.03 Intr + 8337 8469 133 0 1 8 101 269 0.992 19.93 1.04 Intr + 10163 10249 87 1 0 62 55 76 0.658 0.95 1.05 Intr + 10649 10733 85 1 1 102 84 61 0.995 5.57 1.06 Intr + 12695 12811 117 0 0 98 67 29 0.682 1.32 1.07 Intr + 12884 12975 92 0 2 106 80 83 0.835 8.09 1.08 Intr + 15514 15618 105 0 0 73 99 81 0.811 7.19 1.09 Intr + 17129 17235 107 1 2 16 69 96 0.085 -1.41 1.10 Intr + 21372 21489 118 0 1 52 58 85 0.246 1.45 1.11 Intr + 21831 21966 136 2 1 102 6 144 0.767 6.82 1.12 Intr + 23808 23899 92 1 2 63 92 113 0.999 7.99 1.13 Intr + 26454 26605 152 2 2 78 77 162 0.895 12.14 1.14 Intr + 34036 34112 77 1 2 88 89 74 0.970 5.74 1.15 Intr + 39098 39183 86 0 2 81 83 41 0.981 1.52 1.16 Intr + 41618 41728 111 1 0 80 106 100 0.979 10.66 1.17 Intr + 54178 54256 79 0 1 73 61 54 0.123 -0.59 1.18 Intr + 56879 57040 162 0 0 19 71 133 0.085 3.73 1.19 Intr + 59927 60053 127 0 1 58 89 62 0.064 2.12 1.20 Intr + 68149 68264 116 1 2 63 59 104 0.011 4.17 1.21 Intr + 76359 76436 78 1 0 25 53 118 0.037 0.60 1.22 Intr + 88930 89171 242 2 2 56 44 209 0.564 9.65 1.23 Intr + 92764 92835 72 0 0 102 80 34 0.829 2.68 1.24 Intr + 93138 93290 153 2 0 49 52 98 0.538 1.65 1.25 Term + 95088 95231 144 2 0 82 49 72 0.636 -0.37 1.26 PlyA + 95943 95948 6 1.05 2.09 PlyA - 96586 96581 6 1.05 2.08 Term - 100282 99998 285 1 0 89 38 159 0.948 5.42 2.07 Intr - 110503 110303 201 1 0 46 70 168 0.513 9.36 2.06 Intr - 124185 124099 87 0 0 51 87 69 0.302 2.35 2.05 Intr - 128684 128488 197 0 2 -39 7 209 0.070 -1.69 2.04 Intr - 128986 128809 178 1 1 48 71 95 0.892 2.37 2.03 Intr - 131486 131319 168 2 0 78 69 185 0.990 14.82 2.02 Intr - 139665 139581 85 2 1 62 88 124 0.968 8.70 2.01 Init - 139977 139916 62 0 2 74 74 43 0.265 2.27 2.00 Prom - 140222 140183 40 -9.15 3.00 Prom + 140751 140790 40 -4.35 3.01 Init + 143715 143792 78 2 0 94 50 89 0.133 6.81 3.02 Intr + 149486 149663 178 1 1 125 75 12 0.422 2.17 3.03 Term + 150922 151301 380 2 2 62 39 224 0.398 8.87 3.04 PlyA + 153638 153643 6 1.05 4.07 PlyA - 153864 153859 6 1.05 4.06 Term - 159231 159121 111 0 0 109 47 67 0.074 2.28 4.05 Intr - 163836 163605 232 2 1 41 86 89 0.039 0.85 4.04 Intr - 165086 164973 114 1 0 78 59 57 0.026 0.44 4.03 Intr - 170004 169815 190 1 1 59 70 213 0.092 14.32 4.02 Intr - 170772 170673 100 0 1 79 113 43 0.558 4.66 4.01 Init - 173915 173730 186 2 0 -1 37 192 0.709 4.30 4.00 Prom - 176852 176813 40 -8.45 5.03 PlyA - 177112 177107 6 1.05 5.02 Term - 179708 179486 223 1 1 -3 41 251 0.490 6.71 5.01 Init - 187686 187613 74 0 2 120 48 44 0.767 4.19 5.00 Prom - 193109 193070 40 -5.15 6.00 Prom + 193834 193873 40 -5.45 6.01 Init + 198616 198684 69 0 0 69 99 0 0.225 0.30 6.02 Intr + 204091 204226 136 0 1 100 37 130 0.261 8.32 6.03 Intr + 212660 212760 101 0 2 96 25 66 0.031 0.01 6.04 Intr + 221564 221617 54 1 0 85 38 100 0.012 2.96 6.05 Intr + 231540 231667 128 2 2 42 78 99 0.164 2.86 6.06 Intr + 251691 251853 163 0 1 34 73 119 0.250 4.06 6.07 Term + 252434 252574 141 1 0 51 42 94 0.427 -1.95 6.08 PlyA + 252855 252860 6 1.05 7.03 PlyA - 255756 255751 6 1.05 7.02 Term - 266938 266822 117 1 0 80 32 98 0.335 0.96 7.01 Init - 267208 267023 186 1 0 97 66 81 0.703 5.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 162994 163284 291 0 0 18 54 271 0.853 11.06 S.002 Term - 170004 169811 194 1 2 59 48 234 0.907 13.10 S.003 Term + 221564 221635 72 1 0 85 55 135 0.923 6.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:130044523_130315273|GENSCAN_predicted_peptide_1|980_aa FRVNEFGALEVITDENEMENVKKATATTTWMVPTAQEVFSEKTGMPFRLKDPVKVEGLQF CENCCQYGNVDECLSGGNYCSQNCARHIKDKDQKEERDVEEDNEEEDPKCSRKKKPKLSL KADTKEDGEERDDEMVKIPSRHLSTATWSKEEEAEGVDWEYLAYENKQDVRILRGSQRAR RKRRGDSAVLKQGANHSSLFILVLGKTSQEERQRYQSWKSEKSQAAYQSGTCLPPKGKKA WCWASYLEEEKAVAVPAKLFKEHQSFPYNKNGFKVGMKLEGVDPEHQSVYCVLTVAEFLL DPFELEEEGQSFLVDDLCLSFEVDDIAQQERKAFGGEKKLDFCAYAPAVVCSLQCRADCP LWGHARAECAVLVCGYRIKLHFDGYSDCYDFWVNADALDIHPVGWCEKTGHKLHPPKGYK EEEFNWQTYLKTCKAQAAPKSLFENQNITVIPSGFRVGMKLEAVDKKNPSFICVATVTDM VDNRFLVHFDNWDESYDYWCEASSPHIHPVGWCKEHRRTLITPPGYPNVKHFSWDKYLEE TNSLPAPARAFKVKPPHGFQKKMKLEVVDKRNPMFIRVATVADTDDHRVKMRRQRIEMLK EAPEIVHWRNQNSTQAFKVMLGLTPPTREPQGTEASLSQAFGSFIVGRNPTPTNQMLPSG LSILRNEVKAEINLNKDRIFPDRLSGEMPPASPSFPRNKRTDANESSSSPEIRIQGILIG KLCPILRRSHPESTWYAECLDLFSYRDLSNGYQHADDVKEDFEERTESEMRTSHEARGAR EEPTVQQAQRRSAVFLSFKSPIPCLPLRWEQQSKLLPTVAGIPASKVSKWSTDEVYFIFF AARHQIQDYWLKRCGTLSAISKTPNVWNNGIVYALQKLTEKKFQKETCCLRQAFSRAKVV INQSAFVVGSLGMSVDIFINDLEDFYKAVHCTKQIDGEAFLLMTQTDIVKIMSIKLGPAL KIFNSILMFKAAEKNSHNEL >gi568815592r:130044523_130315273|GENSCAN_predicted_CDS_1|2943_bp tttcgggtaaatgagtttggagccctggaagttattacagatgagaatgagatggaaaat gttaaaaaagcaactgctaccaccacttggatggtaccaactgctcaagaagtcttttca gagaagactgggatgcctttcaggttgaaggatccagtgaaagtagaagggcttcagttc tgtgagaactgttgtcagtatggcaacgtagatgagtgtctttctggaggaaactattgc agccagaattgtgctcggcacatcaaagataaagatcagaaggaagaaagggacgtagaa gaagacaatgaggaagaagatcctaagtgtagtcggaagaaaaaaccaaaattatctctg aaagctgacaccaaggaggatggagaagagagagatgatgaaatggtgaagatcccaagt aggcatttgtctacagccacatggagtaaggaagaggaagctgagggtgtggattgggag tatctggcctatgagaacaaacaagatgtaagaatcctgaggggttcgcagagagcacgg aggaaaagacgaggggattcggctgtactaaagcagggtgcaaaccacagttcacttttt atcttggttcttgggaagacgagccaggaagagagacagagataccagtcctggaagtca gagaagagccaggcagcgtatcagtcagggacgtgtttgcctcctaaaggaaagaaagcg tggtgctgggcatcctacctggaagaggagaaagcggtggcagtgccggcgaagctgttc aaggagcatcaatcctttccatataacaaaaatggattcaaagttggcatgaaattagaa ggcgtggatcctgagcatcagtctgtgtactgtgtcctcaccgtcgcggagtttttgtta gacccttttgagttggaagaggagggtcagagtttcttggtggatgatttgtgtttgtcc tttgaagtggatgatattgcccagcaagagcgcaaggcctttggtggagaaaaaaagctt gacttctgtgcatatgcaccagctgtggtctgcagccttcagtgcagggcagactgtcca ctctggggccatgctagagctgagtgtgctgtgctggtttgtggataccggataaagctt cactttgatgggtattctgattgctatgacttctgggtgaatgcagacgctctggatatc cacccagttgggtggtgtgagaaaaccggccacaaactccatcctccaaaagggtataaa gaagaagaattcaattggcagacctatcttaagacatgtaaagctcaagctgctcctaag tcattatttgaaaatcagaatataacagtgatcccatcgggctttcgagttggtatgaag cttgaggcagtagacaaaaagaatccctcattcatctgtgttgctacggtaacagatatg gtggacaatcgtttcctggtacattttgacaactgggatgagagctatgactattggtgt gaagcatcaagtccacatattcatccagttggttggtgtaaggaacatagaagaaccctt attactccaccaggttatccaaatgtgaaacatttttcttgggataaatacttagaagaa accaattctttacctgctcctgcaagagctttcaaagtgaaacctcctcatggattccag aaaaaaatgaagcttgaggttgtagacaaaaggaaccctatgtttattagagtagcaact gtggcagacacagatgatcaccgggtaaaaatgaggagacagagaattgaaatgcttaag gaagcacctgagattgtacattggcggaatcagaattcaacccaggctttcaaggtgatg ttaggtctgacacctcctactagggagccacaagggaccgaggcttccctttctcaggcc tttggcagtttcattgtgggcaggaatcccacacccaccaatcagatgctgccatccggg ctttccatcttgaggaatgaggtgaaggcagaaatcaatttgaataaagaccgtattttt ccagaccgcttaagtggtgagatgcctccggctagtccgtcatttccaagaaataaaagg acagatgcaaatgaaagctcttcttcccctgaaatcaggattcagggcatcctcatcggg aagctctgccccatattaagacggtcccaccctgaatccacctggtatgccgaatgcttg gatctcttttcctacagagacttgtcaaatgggtaccagcatgctgatgatgtcaaagaa gactttgaagagagaacagaaagtgaaatgagaacatcacatgaagccagaggtgcccgg gaagaacccaccgtccagcaggcacagcgtcggtcagctgtctttctgtcctttaagtcc ccaattccatgtctgcccttgcgctgggagcagcaaagcaaacttcttccaactgtcgca ggaatccctgccagtaaagtttccaaatggagcacagacgaggtatattttattttcttt gctgcccgacaccagatacaggattactggcttaagaggtgtggaacattgagcgccatt agcaagacacctaatgtctggaataatggcattgtatatgcattgcagaagctaaccgaa aaaaaatttcagaaggagacttgctgtctgcgccaggccttcagtagggccaaggttgta ataaaccagtcagcctttgtagtgggctccttgggaatgtcagttgacattttcattaat gacttagaagacttctataaggctgtccattgtactaagcaaattgatggagaagcattt ctacttatgactcaaacagacattgttaaaattatgagcattaaactgggccctgctctc aaaattttcaattccatcctgatgttcaaagctgcagagaagaattctcacaatgaactt tga >gi568815592r:130044523_130315273|GENSCAN_predicted_peptide_2|420_aa MRIRIIEFLQADMTKYLEGSLYPSTQQYNDVVNALLQAHPFLDEDGCGFFLWKRALKDRF KYVRRPIEDDEQVIRNKCKFGHRRGQTRKSLADIRFDEIKLVQIKPQLVIPRRIASGVDL QQTVADLQKRDLLEEKLTNKKQHHQCQHKGLPHKNPIQRSSKIKENEIDDGFRRWVITNS SELKEHVLTQSKEAKNLDKRLQELLTRITSLERNINDLMELKNTGRELRQKKIISYAAEI ENVDAGKTQESGHSLKGKKEEAVCFDSELDEHIKWFQQEYVKTEKDWREIDKRMSQTLEI RRKMIGSRTPLKDILKLFPFLKCPYQVQVSTPVLEVKNPFNMEVCEFSLYLERERLTKVD DCVTALAALVAAFHVFRIECPRRLSQTFNFLETLIFDMHSPYFPSLKEKENEVGFQHPLT >gi568815592r:130044523_130315273|GENSCAN_predicted_CDS_2|1263_bp atgaggataaggatcattgagtttctccaggccgacatgactaagtatctggaaggctca ctgtaccccagcacccagcagtacaatgacgtggttaatgccctgctgcaggcccaccct ttcctggatgaggatggctgtggcttctttttatggaaacgagccctcaaagatcgcttt aaatatgttcgaagacccatagaagatgatgagcaagtgattagaaataagtgtaaattt ggacaccgaagaggccagacaaggaaatctcttgctgatataagatttgatgaaattaaa cttgtccagataaaacctcaactggtaatacccagacgaatagcatctggagtggacctc cagcaaactgtagcagacctgcagaagagggatctgttagaagaaaaactaacaaacaaa aagcaacatcatcaatgtcaacataaaggactcccacacaaaaatcccatccaaaggtca tcaaagatcaaagaaaatgagattgatgacggcttcagaaggtgggtaataacaaactcc tctgagctaaaggagcatgttctaacccaaagcaaggaagctaagaaccttgataaaagg ttacaggaactgttaactagaataaccagtttagagaggaacataaatgacctgatggag ctgaaaaacacaggacgagaacttcgtcaaaaaaaaattatttcctatgctgcagagatc gaaaatgttgatgcaggcaagacacaggagtctggccattcactcaaaggcaagaaagaa gaagctgtttgttttgattctgagctagatgaacatattaagtggttccagcaagaatac gtgaaaacagaaaaggactggagagaaattgacaagagaatgagccaaactttggaaata agaagaaagatgattggcagccgaacacctctgaaggacattcttaaactgtttcctttc ttgaagtgcccttatcaggtgcaagtgtccacacctgtgttggaagttaaaaaccctttc aacatggaggtctgcgaattttctttatatttagaaagggagaggctcacaaaggtggac gactgtgttacagccttggctgcgctagtagctgcctttcatgtatttaggattgagtgt ccaagaagactgtcccaaactttcaacttcctagaaacgctgattttcgatatgcacagt ccttattttccttctttgaaagaaaaggaaaacgaagtaggatttcagcacccactcact taa >gi568815592r:130044523_130315273|GENSCAN_predicted_peptide_3|211_aa MALLRTGSGEHQLSKEQSESTEEQQLAASRQVELGPNSSSASAPPLYNLFITTPPHTQSG LQFHSATSPPPPAQQFPLKKVAGAKEQDKPYKLVQDLRLINQIVLPIHPMVPNPYTLLSS IPPSTTHYSVLDLKHAFFTIPLHPSSQPLFAFTWTDPDTHQAQQITWAVLLQGFTDIPHY FSQAQISSSSVTCLSVILIKTHVLSLPIMSD >gi568815592r:130044523_130315273|GENSCAN_predicted_CDS_3|636_bp atggctctgttaagaacgggctcaggtgagcaccagctctctaaggaacaatctgaaagc acagaagaacagcagttggctgcttctcgccaggtcgagctaggtcccaattcttcctca gcctccgctcctccactctataatctttttatcaccacccctcctcacacccagtctggc ttacagtttcattctgcgactagccctcccccacctgcccagcaatttcctcttaaaaag gtggctggagctaaagaacaagacaagccttacaagttagttcaggatctgcgccttatc aaccaaattgttttgcctatccaccccatggtgccgaacccatatactctcctatcctca atacctccctctacaacccattattctgttctagatctcaaacatgctttctttactatt cctttgcacccttcatcccagcctctcttcgctttcacttggactgaccctgacacccat caggctcagcaaattacctgggctgtactgctgcaaggcttcacagacatccctcattac ttcagtcaagctcaaatttcttcctcatctgttacctgtctcagcgtaattcttataaaa acacacgtgctctccctgccgatcatgtctgactga >gi568815592r:130044523_130315273|GENSCAN_predicted_peptide_4|310_aa MRCYVPLEHHAAIETPEYKQVPGEPPTACALREWGGATEVHTVCSEEELALLILVWELGI QSVYLQETAMETWSVEQVCSWLVEKNLGELVHRFQEEEVSGAALLALNDRMVQQLVKKIG HQAVLMDLIKKYKQNTQGLKSPENPKKAALVMQTEAARDYRDEESSSPARHGEQMPSFYP AENLDNGLIDQRVLKQSISRPSHPGNPCIHNSYYILSPVTNKQMQCLKRSSLKRETVTLH TKNNRVIAKQGCGPSVFTTPLHYQLILPTEQYCKAKSSGLGKYETNQSYSSVQSLTQVIN NVHCMKEQMY >gi568815592r:130044523_130315273|GENSCAN_predicted_CDS_4|933_bp atgagatgttatgttccattggaacaccatgctgctattgagacacctgagtataaacag gtccccggagagcctccgacagcctgcgcactgagagaatggggtggagccacggaagtt cacacggtttgcagcgaggaggaactggcccttcttattctggtgtgggaacttggaatt caatctgtgtatttacaggagacagcaatggaaacctggtcagttgagcaggtctgcagt tggttggtggagaaaaatttaggagagctagttcatagatttcaagaggaagaagtaagt ggggccgctcttcttgcacttaatgatcggatggttcagcaactggtaaagaaaattggg caccaggctgttctgatggatttaattaaaaaatacaagcagaacactcaaggactgaag tccccagaaaaccccaaaaaggcagccctggtcatgcaaacagaagcagctcgagattac agggatgaagagtcctccagtccagccaggcatggggagcagatgccatctttctatcca gctgaaaaccttgataatggactaattgaccaaagagtattgaaacagagcatttcacgg ccgtcccacccaggcaatccgtgcatccataactcatattacatattgtctccagtcact aacaagcaaatgcaatgccttaaacgtagttcattgaaaagagagactgtcaccttgcac acaaagaataatcgggtaatagccaagcaaggatgtggtccttcagttttcacaacccct ttacactatcagctcattttgcccacagagcagtactgcaaggccaagagctctgggttg gggaagtatgagacgaaccagtcttattcatctgtgcagtccctgacacaggtgatcaac aatgttcattgcatgaaggaacagatgtattaa >gi568815592r:130044523_130315273|GENSCAN_predicted_peptide_5|98_aa MASSCNGIPQTVDKKAFPTLYWLGLHKRLDIKRNTLAEYTDRLAGHQQQYDVDHVEFAWG WLEESQATERPDSRGRPPSHSIPLVDPIHLLRATSTSQ >gi568815592r:130044523_130315273|GENSCAN_predicted_CDS_5|297_bp atggcctctagttgcaatggaattccccagacagtggacaaaaaggcctttcctacactg tattggctaggcctacacaagcggctggacatcaagaggaacacactggcagaatacact gacagactggcaggacatcaacagcagtatgatgtggaccacgtagaatttgcctggggg tggttggaggagagtcaggccactgagcggcctgactccaggggaagaccgccttcccat tccattccccttgtggatcccatccatctgctgagagctacttccaccagtcaataa >gi568815592r:130044523_130315273|GENSCAN_predicted_peptide_6|263_aa MTQRGQVLVNYVSTGISNYFSVLVLARPQEKEKSKEDAPSMAVSGTGGISADRNHGQNSA VIRQALLEGQRNKGAICKTERRPSSETELASTLILDFPAFRVKKGERVTLEIDGAEMDLQ GPYNRETGGSERRQKQELELRTLKIEGGAINQRKYVAFRREKKKIKVTGEWTSSEWKAER REPGSVGVPMGRSWGTKKESSKSLVKVNPGNSESMERCTPTQSLPTTHPSGQVLPLIISP PDDKPAFTLKHHLLDCSLNCITK >gi568815592r:130044523_130315273|GENSCAN_predicted_CDS_6|792_bp atgacccaacggggacaagtgcttgtcaattatgtctctactggaatctctaactacttt tctgtcctggtccttgcaaggcctcaggagaaagaaaaaagcaaggaagacgctccaagc atggcagtatctggcactgggggaattagtgctgaccgaaaccatgggcagaactctgca gtaataaggcaggctctccttgagggacagaggaataagggagccatctgcaagacagaa agaaggccctcatcagaaactgaattggccagcaccttgatcttggacttcccagccttc agagttaaaaaaggggaacgtgtaaccctagaaattgatggagctgaaatggatcttcaa ggtccttataacagggaaacaggaggttcagagagaagacagaaacaggagttggaatta cgtactttgaaaatagaaggaggggccataaaccaaagaaagtatgtggcctttagaaga gagaaaaaaaagatcaaagttactggtgaatggacaagttctgaatggaaagctgagaga agagaaccaggatctgttggagtgcctatgggaaggagctggggtacaaaaaaggaaagc agcaagagtctggtaaaggtcaacccaggaaactcagagtccatggaaaggtgcacccca acacaatctcttcctaccacccatccttcagggcaggtgcttccactcatcatcagtcca cctgacgacaagccagcttttacacttaagcaccatctattggattgcagcctgaactgc atcaccaaataa >gi568815592r:130044523_130315273|GENSCAN_predicted_peptide_7|100_aa MAVIWSQKTLISFALVNVDRGDHLSAATSTLLHWPLKLVQLLGLKESSSLADVFTNFFST LQKTSEYCSMGTQNHESVCNSDQGAVGWRNPSLPVPSGGQ >gi568815592r:130044523_130315273|GENSCAN_predicted_CDS_7|303_bp atggctgtcatctggtcccagaagactcttatatcctttgccctggtaaatgtggacaga ggagaccatctcagcgcagccacctccactctgctgcactggcctcttaaacttgttcaa ctactgggcctaaaagaaagcagtagtctggcagatgtcttcacaaattttttttcaaca cttcagaaaacttcggaatactgctctatgggcacacagaaccatgagagtgtctgcaac agcgaccagggagcagtaggctggaggaacccctccttgcctgtgccttcaggagggcaa taa