GENSCAN 1.0 Date run: 2-Nov-116 Time: 23:58:40 Sequence gi568815593f:10150341_10364780 : 214440 bp : 43.42% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 14277 14364 88 2 1 92 94 45 0.662 6.30 1.02 Intr + 16959 17009 51 1 0 52 81 60 0.341 0.58 1.03 Intr + 17475 17507 33 1 0 80 93 22 0.283 0.09 1.04 Intr + 28260 28503 244 1 1 -8 85 144 0.013 1.06 1.05 Term + 37510 37648 139 1 1 49 33 169 0.888 4.94 1.06 PlyA + 38588 38593 6 1.05 2.03 PlyA - 39285 39280 6 1.05 2.02 Term - 40046 39954 93 2 0 140 49 20 0.725 1.13 2.01 Init - 44537 44487 51 2 0 98 65 43 0.701 4.16 2.00 Prom - 47003 46964 40 -3.06 3.00 Prom + 56060 56099 40 -2.96 3.01 Init + 61108 61219 112 0 1 54 94 96 0.780 7.17 3.02 Term + 64960 65036 77 2 2 49 49 65 0.224 -3.30 3.03 PlyA + 65528 65533 6 1.05 4.05 PlyA - 66121 66116 6 1.05 4.04 Term - 77307 77101 207 0 0 100 37 89 0.378 2.34 4.03 Intr - 86275 86138 138 1 0 123 95 26 0.834 7.46 4.02 Intr - 89016 88727 290 1 2 92 68 222 0.927 17.56 4.01 Init - 92030 91889 142 2 1 50 33 67 0.586 -2.30 4.00 Prom - 92666 92627 40 -2.16 5.00 Prom + 97392 97431 40 -6.96 5.01 Init + 100001 100105 105 1 0 108 90 159 0.775 18.32 5.02 Intr + 100415 100553 139 1 1 59 70 37 0.659 -0.96 5.03 Intr + 104334 104498 165 1 0 54 91 228 0.887 19.63 5.04 Intr + 105615 105813 199 1 1 92 46 245 0.993 19.11 5.05 Intr + 107771 107963 193 2 1 71 61 203 0.992 15.39 5.06 Intr + 108046 108195 150 0 0 105 108 157 0.999 19.76 5.07 Intr + 110452 110571 120 0 0 87 75 103 0.933 9.59 5.08 Intr + 111220 111405 186 0 0 26 81 141 0.991 7.09 5.09 Intr + 112141 112278 138 0 0 27 116 145 0.999 11.86 5.10 Intr + 112794 112974 181 2 1 81 77 217 0.961 19.34 5.11 Intr + 114316 114417 102 2 0 67 62 58 0.244 1.25 5.12 Term + 116656 116678 23 2 2 126 55 15 0.322 0.47 5.13 PlyA + 117441 117446 6 1.05 6.00 Prom + 117868 117907 40 -6.96 6.01 Init + 119214 119570 357 2 0 58 86 189 0.914 12.81 6.02 Term + 122488 122601 114 0 0 81 54 36 0.304 -1.93 6.03 PlyA + 125178 125183 6 1.05 7.06 PlyA - 125259 125254 6 1.05 7.05 Term - 130292 130113 180 2 0 61 43 113 0.752 1.71 7.04 Intr - 131948 131857 92 0 2 99 105 55 0.994 8.01 7.03 Intr - 136156 136014 143 0 2 115 54 25 0.977 1.80 7.02 Intr - 138189 138082 108 2 0 71 95 66 0.962 5.00 7.01 Init - 140422 140208 215 1 2 65 60 180 0.984 11.12 7.00 Prom - 145285 145246 40 -3.46 8.00 Prom + 152934 152973 40 -3.36 8.01 Init + 169044 169222 179 2 2 62 77 94 0.203 4.43 8.02 Term + 177954 177981 28 0 1 128 44 36 0.155 0.75 8.03 PlyA + 177985 177990 6 1.05 9.03 PlyA - 178728 178723 6 1.05 9.02 Term - 181441 181228 214 0 1 56 41 166 0.235 5.40 9.01 Intr - 188637 188577 61 1 1 114 42 46 0.029 0.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 28358 28503 146 1 2 77 85 78 0.822 5.99 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:10150341_10364780|GENSCAN_predicted_peptide_1|184_aa MNLEHYNSFGFVFAFFQRIKAKNDQELQEVVEAGLSKIKALADSVSVGINPTTSKVPERN SININKKDIHSETPSKGHQHQRPKVDKSMKMGRKQHKKAENCQNQNASSPAKDHNSSPAR EQNCRENEFDELTEVGFRRAVTLTVKICGFILELSKTTNPPERTISGHTGKTVCFYAKSD ERSG >gi568815593f:10150341_10364780|GENSCAN_predicted_CDS_1|555_bp atgaacctggaacattataattcttttggatttgtttttgcctttttccaaagaataaag gctaagaatgaccaggaacttcaagaagttgtggaggctgggctgtccaagatcaaggcc ctggcagattcggtgtctgtgggaatcaaccccaccacctcaaaggtgccagaaaggaac agtatcaacatcaacaaaaaggacatccactcagagaccccatccaaaggtcaccaacat caaagaccaaaggtagataagtccatgaagatggggagaaagcagcacaaaaaggctgaa aattgccaaaaccagaacgcctcttctcctgcaaaggatcacaactcctcgccagcaagg gaacaaaactgcagggagaatgagtttgatgaattgacagaagtaggcttcagaagagct gtgacactcaccgtgaagatctgcggctttattcttgaactcagcaagaccacgaaccca ccggaaagaaccatctctggacacacgggaaaaaccgtgtgtttctatgctaaatctgat gaaaggagtggatag >gi568815593f:10150341_10364780|GENSCAN_predicted_peptide_2|47_aa MNHKPGTRAAINSCKIEGLSLEDDNSEPHWVLWGESSGQGALCFSIE >gi568815593f:10150341_10364780|GENSCAN_predicted_CDS_2|144_bp atgaaccacaaaccaggaacaagggcagccatcaacagttgtaaaatcgagggcctctcc ctggaggacgataactcagaaccacactgggtgctgtggggagagagttctgggcaaggt gctctttgcttctctattgaatga >gi568815593f:10150341_10364780|GENSCAN_predicted_peptide_3|62_aa MKNETLIRNQYIRTEGRKLRKPSADEEAQQLELSYTAGGPHQRLCLALGATFQKDDLQPS FC >gi568815593f:10150341_10364780|GENSCAN_predicted_CDS_3|189_bp atgaaaaatgagaccctaataagaaaccagtacatcaggacagaagggagaaaactaaga aaaccaagtgctgatgaggaagcacagcaactggagctgtcgtacactgcaggtgggcca caccagcggctgtgtctggctctaggagccacattccagaaagacgatttacaacctagc ttctgttaa >gi568815593f:10150341_10364780|GENSCAN_predicted_peptide_4|258_aa MESTTGEDAVNIVEMMTKNLEYFINLVDKVVAEFEKIDSKFGRSSTVGIPLETLKEESQS RHVLPASFEVNSLQKSNWGFLLTGLVGGTLVAVYAVATPFVTPALRKVCLPFVPATTKQI ENVVKMLRCRRGSLVDIGSGDGRIVIAAAKKGFTAVGYELNPWLVWYSRYRAWREGVHGS AKFYISDLWKMLQLEKKLERELEDDARVIACRFPFPHWTPDHVTGEGIDTVWAYDASTFR GREKRPCTSMHFQLPIQA >gi568815593f:10150341_10364780|GENSCAN_predicted_CDS_4|777_bp atggaatctactactggtgaggatgctgtaaacattgttgaaatgatgacaaagaactta gaatattttataaacttagttgataaagtagtggcagagtttgagaaaattgactccaaa tttggaagaagttccactgtgggtatacccctagaaacacttaaagaagaaagtcagtca agacatgttctacctgcaagttttgaagtcaacagtttgcagaaaagcaactgggggttc ttacttactgggcttgtgggtggcaccctggtggctgtgtacgctgtagccacgccgttt gtaacgccagcccttcgaaaagtctgtttgccgtttgtacctgcaactacgaagcagatt gaaaatgttgtgaaaatgttgcgatgccgaagaggatcccttgtggacatcggtagtggg gacggacgcattgtcatagcggctgcgaagaaagggttcacagcagttggttatgaatta aacccatggctagtttggtattccagataccgcgcttggcgagaaggtgtgcatggatct gccaaattttatatttcagatttgtggaagatgctgcagttggagaagaaacttgaacgt gaacttgaggatgatgcacgagttattgcttgccggttccctttcccacattggactcca gaccacgtcacgggggaggggatagacacagtgtgggcatatgatgcaagcacttttaga ggccgtgaaaagaggccctgtacatcgatgcatttccagctgcccattcaagcataa >gi568815593f:10150341_10364780|GENSCAN_predicted_peptide_5|566_aa MASMGTLAFDEYGRPFLIIKDQDRKSRLMGLEALKPDGRVGLRSSGRAPGRCSVTCRLRR CCLTFRWSTRNGPSVFFVTRKGLDKMMVDKDGDVTVTNDGATILSMMDVDHQIAKLMVEL SKSQDDEIGDGTTGVVVLAGALLEEAEQLLDRGIHPIRIADGYEQAARVAIEHLDKISDS VLVDIKDTEPLIQTAKTTLGSKVVNSCHRQMAEIAVNAVLTVADMERRDVDFELIKVEGK VGGRLEDTKLIKGVIVDKDFSHPQMPKKVEDAKIAILTCPFEPPKPKTKHKLDVTSVEDY KALQKYEKEKFEEMIQQIKETGANLAICQWGFDDEANHLLLQNNLPAVRWVGGPEIELIA IATGGRIVPRFSELTAEKLGFAGLVQEISFGTTKDKMLVIEQCKNSRAVTIFIRGGNKMI IEEAKRSLHDALCVIRNLIRDNRVVYGGGAAEISCALAVSQEADKCPTLEQYAMRAFADA LEVIPMALSENSGMNPIQTMTEVRARQVKEMNPALGIDCLHKGTNDMKQQHVIETLIGKK QQISLATQMVRMILKIDDIRETAPTD >gi568815593f:10150341_10364780|GENSCAN_predicted_CDS_5|1701_bp atggcgtccatggggaccctcgccttcgatgaatatgggcgccctttcctcatcatcaag gatcaggaccgcaagtcccgtcttatgggacttgaggccctcaagcctgacggccgcgtg ggactgcgctccagtgggagggcgccggggagatgctctgtcacctgtcggttaaggcgc tgctgcttaacctttcgctggtccacccgcaacggcccttccgttttctttgtcactcgt aaagggcttgataagatgatggtggataaggatggagatgtgactgtaactaatgatggg gccaccatcttaagcatgatggatgttgatcatcagattgccaagctgatggtggaactg tccaagtctcaggatgatgaaattggagatggaaccacaggagtggttgtcctggctggt gccttgttagaagaagcggagcaattgctagaccgaggcattcacccaatcagaatagcc gatggctatgagcaggctgctcgtgttgctattgaacacctggacaagatcagcgatagc gtccttgttgacataaaggacaccgaacccctgattcagacagcaaaaaccacgctgggc tccaaagtggtcaacagttgtcaccgacagatggctgagattgctgtgaatgccgtcctc actgtagcagatatggagcggagagacgttgactttgagcttatcaaagtagaaggcaaa gtgggcggcaggctggaggacactaaactgattaagggcgtgattgtggacaaggatttc agtcacccacagatgccaaaaaaagtggaagatgcgaagattgcaattctcacatgtcca tttgaaccacccaaaccaaaaacaaagcataagctggatgtgacctctgtcgaagattat aaagcccttcagaaatacgaaaaggagaaatttgaagagatgattcaacaaattaaagag actggtgctaacctagcaatttgtcagtggggctttgatgatgaagcaaatcacttactt cttcagaacaacttgcctgcggttcgctgggtaggaggacctgaaattgagctgattgcc atcgcaacaggagggcggatcgtccccaggttctcagagctcacagccgagaagctgggc tttgctggtcttgtacaggagatctcatttgggacaactaaggataaaatgctggtcatc gagcagtgtaagaactccagagctgtaaccatttttattagaggaggaaataagatgatc attgaggaggcgaaacgatcccttcacgatgctttgtgtgtcatccggaacctcatccgc gataatcgtgtggtgtatggaggaggggctgctgagatatcctgtgccctggcagttagc caagaggcggataagtgccccaccttagaacagtatgccatgagagcgtttgccgacgca ctggaggtcatccccatggccctctctgaaaacagtggcatgaatcccatccagactatg accgaagtccgagccagacaggtgaaggagatgaaccctgctcttggcatcgactgtttg cacaaggggacaaatgatatgaagcaacagcatgtcatagaaaccttgattggcaaaaag caacagatatctcttgcaacacaaatggttagaatgattttgaagattgatgacattcgt gaaacagctccaacagattga >gi568815593f:10150341_10364780|GENSCAN_predicted_peptide_6|156_aa MSYLSHDLDAYCKDATEASSVVFQLLGSHFVCSSSSSLTSCERQNSKPVLPRAHLVIPSG TDLDTVRRDFTDFIKVTNQLTLKEGNYPGGSNAVTQAHRGSQIKVKDTAGDSRDGGLGKC ESEGSTSNQILASSNERIQLRVVRQRDRLRQVLKEE >gi568815593f:10150341_10364780|GENSCAN_predicted_CDS_6|471_bp atgtcttatttatcacatgatcttgacgcctactgcaaagacgcaactgaagccagcagt gtggtctttcagctcctcggctcccacttcgtttgcagcagttcatcgtccctcacctcc tgtgagaggcagaattctaaacctgtgctcccaagagcccatctggttattccttcaggc actgatctagatactgtgagaagggacttcacggatttcattaaggttaccaatcagctg accttaaaagagggcaattatccaggtggttccaatgcagtcacacaagcccacaggggc tcacagataaaggtgaaagatacagcaggagactcaagagatggcggccttggcaagtgt gaatctgaagggtctacatcaaatcaaatccttgcctcctcgaatgaaagaattcagctg cgggtcgtaaggcagagggacagactgaggcaagttttaaaggaggagtga >gi568815593f:10150341_10364780|GENSCAN_predicted_peptide_7|245_aa MANEAYPCPCDIGHRLEYGGLGREVQVEHIKAYVTKSPVDAGKAVIVIQDIFGWQLPNTR YIADMISGNGYTTIVPDFFVGQEPWDPSGDWSIFPEWLKTRNAQKIDREISAILKYLKQQ CHAQKIGIVGFCWGGTAVHHLMMKYSEFRAGVSVYGIVKDSEDIYNLKNPTLFIFAENDV VIPLKDVSLLTQKLKEHCKVEYQIKTFSGQTHGFVHRKREDCSPADKPYIDEARRNLIEW LNKYM >gi568815593f:10150341_10364780|GENSCAN_predicted_CDS_7|738_bp atggctaacgaagcttatccttgtccgtgtgacattggccacagacttgagtatggaggg ctaggccgtgaagttcaagtcgagcacatcaaggcttatgtcaccaaatcccccgttgat gcaggcaaagctgtgattgtcattcaagatatatttggctggcagttgcccaataccaga tatatagctgacatgatctcaggaaatggatacacaaccattgttccagacttctttgta gggcaagagccttgggacccctctggcgactggtctatcttccctgagtggctgaaaaca agaaatgcccagaagatcgatagagagatcagtgctatcttgaagtatctgaaacaacag tgtcatgcccagaaaattggcatcgtgggattctgctggggtggaactgctgtccatcat ttgatgatgaaatactcagaattcagggcaggggtgtccgtctatggcattgtcaaggat tctgaagacatttacaatttaaagaaccccactttgttcatttttgctgaaaatgatgtt gtgattccactcaaggacgtatctttgctgactcagaagttgaaagaacactgcaaagtt gaatatcaaattaaaacattttctgggcagactcatgggttcgtgcatcggaagagagaa gattgctcacctgcagacaagccctacattgacgaggccagaaggaatttaattgagtgg ctgaacaagtacatgtag >gi568815593f:10150341_10364780|GENSCAN_predicted_peptide_8|68_aa MAGLARSEGGVFRPLTSKDESEDMKILTVHPSEICQNDFVASSLLSNYSAASSPLKQGCR YLLLRIGK >gi568815593f:10150341_10364780|GENSCAN_predicted_CDS_8|207_bp atggcagggttagcacgatccgagggtggagttttcaggcctctgacatcaaaagatgag tcagaggacatgaaaatcctcactgtgcatccttcagagatctgccagaacgacttcgtg gccagcagtctcttatctaactactccgcagctagtagtcctcttaagcaaggatgccgg tatttgctgctcagaattggcaaatag >gi568815593f:10150341_10364780|GENSCAN_predicted_peptide_9|91_aa XNIHSPVDMDMTFITICLVCLCFNTESAEIHRNSPEAHGAGGFDEDVERSLPAELIFAFT SKRLCVLAASQRRAFWEERAAWVKAQPRGSA >gi568815593f:10150341_10364780|GENSCAN_predicted_CDS_9|276_bp nctaatatccactcccctgtggacatggatatgaccttcatcaccatctgtttggtttgc ctatgtttcaacacagagagtgcagaaatccacaggaacagcccagaagctcatggagct gggggctttgatgaggacgtggagaggagcttgccagctgaactaatatttgccttcacc tccaagcgactttgtgttttggcagcaagtcagagaagagccttctgggaagagagagca gcttgggttaaagcccagccacgtgggagtgcttga