GENSCAN 1.0 Date run: 8-Nov-116 Time: 03:57:13 Sequence gi568815575f:111023104_111320444 : 297341 bp : 39.09% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 3183 3430 248 1 2 52 43 170 0.843 3.87 1.02 PlyA + 3462 3467 6 1.05 2.00 Prom + 4486 4525 40 -5.45 2.01 Init + 6646 6723 78 0 0 82 66 51 0.804 3.41 2.02 Intr + 10237 10290 54 0 0 129 68 34 0.056 3.76 2.03 Intr + 52188 52392 205 2 1 51 86 114 0.218 5.35 2.04 Intr + 57179 57245 67 0 1 49 91 64 0.005 -0.16 2.05 Intr + 73394 73518 125 2 2 24 57 152 0.276 5.01 2.06 Intr + 74472 74581 110 1 2 75 86 80 0.536 5.58 2.07 Intr + 75465 75538 74 2 2 47 82 43 0.088 -3.11 2.08 Intr + 75856 75959 104 1 2 95 64 69 0.116 4.10 2.09 Intr + 80056 80203 148 2 1 65 105 148 0.421 12.57 2.10 Term + 85410 85614 205 0 1 53 55 101 0.351 -0.84 2.11 PlyA + 87063 87068 6 1.05 3.00 Prom + 95446 95485 40 -5.55 3.01 Init + 100001 100175 175 1 1 49 37 191 0.768 9.66 3.02 Intr + 118993 119093 101 2 2 45 93 90 0.661 4.11 3.03 Intr + 124634 124787 154 1 1 110 94 112 0.924 12.72 3.04 Intr + 129307 129344 38 2 2 105 123 -2 0.803 2.06 3.05 Intr + 139812 139943 132 2 0 54 89 224 0.997 19.02 3.06 Intr + 140459 140624 166 1 1 64 78 182 0.794 13.41 3.07 Intr + 149915 149978 64 0 1 89 110 36 0.658 2.76 3.08 Intr + 158658 158770 113 0 2 90 70 60 0.457 3.50 3.09 Intr + 159065 159120 56 0 2 121 64 21 0.267 0.78 3.10 Intr + 169403 169515 113 1 2 75 97 58 0.856 3.66 3.11 Intr + 171198 171315 118 0 1 86 99 152 0.989 15.65 3.12 Intr + 173364 173537 174 2 0 71 110 123 0.707 12.01 3.13 Intr + 193318 193455 138 0 0 70 67 157 0.809 11.54 3.14 Term + 197255 197344 90 1 0 113 47 34 0.610 -1.46 3.15 PlyA + 197391 197396 6 1.05 4.04 PlyA - 197517 197512 6 1.05 4.03 Term - 200139 200009 131 1 2 97 42 60 0.268 -0.34 4.02 Intr - 208235 208131 105 0 0 51 106 38 0.065 1.17 4.01 Init - 220299 220077 223 0 1 66 39 174 0.172 9.06 4.00 Prom - 220543 220504 40 -6.65 5.13 PlyA - 220820 220815 6 1.05 5.12 Term - 223656 223474 183 0 0 144 35 216 0.989 18.46 5.11 Intr - 224401 224265 137 2 2 68 110 102 0.998 9.77 5.10 Intr - 224889 224768 122 2 2 91 69 163 0.999 13.92 5.09 Intr - 225668 225466 203 2 2 97 31 144 0.976 6.76 5.08 Intr - 225954 225832 123 0 0 101 93 148 0.999 16.46 5.07 Intr - 228000 227814 187 2 1 95 36 238 0.999 18.07 5.06 Intr - 228183 228106 78 2 0 81 73 123 0.996 7.85 5.05 Intr - 228639 228446 194 0 2 86 82 247 0.983 21.27 5.04 Intr - 229396 229204 193 0 1 56 68 219 0.635 15.27 5.03 Intr - 230113 229905 209 1 2 91 78 111 0.999 7.35 5.02 Intr - 231300 231169 132 0 0 83 62 80 0.965 4.82 5.01 Init - 240833 240669 165 2 0 61 93 135 0.973 10.98 5.00 Prom - 245071 245032 40 -5.35 6.04 PlyA - 245372 245367 6 1.05 6.03 Term - 247337 247175 163 1 1 13 33 272 0.440 10.63 6.02 Intr - 247850 247678 173 2 2 70 69 78 0.287 1.92 6.01 Init - 249988 249902 87 1 0 70 86 53 0.605 3.99 6.00 Prom - 260283 260244 40 -3.75 7.04 PlyA - 260553 260548 6 1.05 7.03 Term - 274258 274123 136 0 1 58 47 119 0.060 1.31 7.02 Intr - 278640 278527 114 2 0 111 42 68 0.060 3.14 7.01 Init - 290117 290032 86 2 2 77 116 3 0.096 2.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:111023104_111320444|GENSCAN_predicted_peptide_1|82_aa XLEVLARAIRQEKEIKGIQIGKEEVKLSLFADDMIVYLENSEDSSKKLLELGNEFGKVLG YKINVHKSVSLLYTNSDQAEIK >gi568815575f:111023104_111320444|GENSCAN_predicted_CDS_1|249_bp ntactggaagtcctagccagagcaatcagacaagagaaagaaataaagggcatccaaatt ggtaaagaggaagtcaagctgtccctatttgctgatgatatgattgtatacctagaaaac tctgaagactcttccaaaaagctcctagaactgggaaatgaattcggcaaagttttagga tacaaaattaatgtacacaaatcagtatctctgctatacaccaacagtgaccaagctgag atcaaataa >gi568815575f:111023104_111320444|GENSCAN_predicted_peptide_2|389_aa MTNGTSCGSPGVIQGLQHCTICDEKHIQTHTQVFIMHRPKCRGQAQRLKSKEWLHGPGPE CQCPVTPQEAASCIPTAPAPAVVQRALDTARTTASEGTSCKPWQLPCDVQSAEISTAPQS TATTTLISQQPSTCRCSAFPDPPPVLFVPQPRVSYVAPRSPETKRAVCMIPVRGQVDSNP GQVYGCLRYWNRRRSIPVGDNTVALFWDEQGVKEPVALTKRTRNSMSSSLNNWLFPPGAW NGLVCQVGTTQLLQQGAEKGLQRRAGPESSISGFPQERAAESLHLVAAGVPSTESVGAEA SPDPSHGQCPPVLKSCSGGGVRSAATLDSYGSVSPIVNCTGKGSRLCAPYENLMPDDLRW NSFIPKPPHCPMSMEKLSSTKLVPDAKKG >gi568815575f:111023104_111320444|GENSCAN_predicted_CDS_2|1170_bp atgactaatggcacttcgtgtggatctcctggtgttattcaaggtttacagcattgcact atatgtgatgaaaaacatatccaaacacacacgcaggtattcataatgcacaggcctaaa tgtcgaggtcaggcccagaggcttaagagtaaagaatggcttcatgggccaggcccagag tgccaatgccccgtgacacctcaggaggctgcttcttgcatccctactgctccagctcca gctgtggttcaaagggccctagatacagctcggaccactgcttcagagggcacaagctgt aagccttggcagcttccatgtgatgttcagtctgcagaaatttccacagccccccaatct acagcaaccaccaccctgatcagtcagcagccatcaacatgcagatgttctgcctttccg gacccccctcctgtcctgtttgtgccccagcctcgagtctcctacgtggctccccggagc cctgagaccaagcgagcagtttgcatgatcccagttcgcggacaagtggactcaaatcca ggccaagtgtatggctgtctgaggtattggaacagaaggaggtccattcctgttggtgac aacaccgtggccctgttctgggatgagcaaggtgtaaaggaaccagtagctttaaccaag agaacaagaaactcaatgagcagcagcctcaacaactggctctttcctcctggggcctgg aatggactcgtctgccaagtcggcactacacagcttctccagcagggggcggaaaagggc ctgcagcgacgcgcaggacccgagtcctccatttcaggtttcccccaagaaagagcagct gagtccttgcatcttgtggcagctggtgtgcccagcactgagtctgtaggagctgaagcc agcccggacccttctcatgggcagtgcccacctgtgctgaagtcctgcagcggtggcggt gtgagatcagcagcaacattagattcttatgggagcgtgagccctattgtgaactgcaca ggcaagggatctaggttgtgtgctccttatgagaatctgatgcctgatgatctgagatgg aacagtttcattccaaaaccaccccattgccccatgtccatggaaaaattgtcttccaca aaactggtacctgatgccaaaaaaggttga >gi568815575f:111023104_111320444|GENSCAN_predicted_peptide_3|543_aa MSDGLDNEEKPPAPPLRMNSNNRDSSALNHSSKPLPMAPEEKNKKARLRSIFPGGGDKTN KKKEKERPEISLPSDFEHTIHVGFDAVTGEFTGIPEQWARLLQTSNITKLEQKKNPQAVL DVLKFYDSKETVNNQKYMSFTSGDKSAHGYIAAHPSSTKTASEPPLAPPVSEEEDEEEEE EEDENEPPPVIAPRPEHTKSIYTRSVVESIASPAVPNKEVTPPSAENANSSTLYRNTDRQ RKKSKMTDEEILEKLRSIVSVGDPKKKYTRFEKIGQGKLWARVVSSVLIEVILWVGPPTP KVFASSSDLFWATQGLLNPMAHTVKQLLRFNVGVAIKQMNLQQQPKKELIINEILVMREN KNPNIVNYLDSYLVGDELWVVMEYLAGGSLTDVVTETCMDEGQIAAVCREITPEQSKRST MVGTPYWMAPEVVTRKAYGPKVDIWSLGIMAIEMVEGEPPYLNENPLRALYLIATNGTPE LQNPERLSAVFRDFLNRCLEMDVDRRGSAKELLQHPFLKLAKPLSSLTPLIIAAKEAIKN SSR >gi568815575f:111023104_111320444|GENSCAN_predicted_CDS_3|1632_bp atgtctgacggtctggataatgaagagaaacccccggctcctccactgaggatgaatagt aacaaccgggattcttcagcactcaaccacagctccaaaccacttcccatggcccctgaa gagaagaataagaaagccaggcttcgctctatcttcccaggaggaggggataaaaccaat aagaagaaggagaaagagcgcccagagatctctcttccttcagactttgagcatacgatt catgtggggtttgatgcagtcaccggggaattcactggaattccagagcaatgggcacga ttactccaaacttccaacataacaaaattggaacagaagaagaacccacaagctgttcta gatgttctcaaattctatgattccaaagaaacagtcaacaaccagaaatacatgagcttt acatcaggagataaaagtgcacatggatacatagcagcccatccttcgagtacaaaaaca gcatctgagcctccattggcccctcctgtgtctgaagaagaagatgaagaggaagaagaa gaagaagatgaaaatgagccaccaccagttatcgcaccaagaccagagcatacaaaatca atctatactcgttctgtggttgaatccattgcttcaccagcagtaccaaataaagaggtc acaccaccctctgctgaaaatgccaattccagtactttgtacaggaacacagatcggcaa agaaaaaaatccaagatgacagatgaggagatcttagagaagctaagaagcattgtgagt gttggggacccaaagaaaaaatacacaagatttgaaaaaattggtcaagggaaactatgg gcaagagtggtgtctagtgtgcttatagaagtgattctttgggtaggaccacccacccca aaagtatttgcttcttcatcggatctattttgggccacacaaggcctcttgaatccaatg gcacacactgtgaaacagctgcttagatttaatgttggagtggccataaagcagatgaac cttcaacagcaacccaagaaggaattaattattaatgaaattctggtcatgagggaaaat aagaaccctaatattgttaattatttagatagctacttggtgggtgatgaactatgggta gtcatggaatacttggctggtggctctctgactgatgtggtcacagagacctgtatggat gaaggacagatagcagctgtctgcagagagatcactcctgagcaaagtaaacgaagcact atggtgggaaccccatattggatggcacctgaggtggtgactcgaaaagcttatggtccg aaagttgatatctggtctcttggaattatggcaattgaaatggtggaaggtgaaccccct taccttaatgaaaatccactcagggcattgtatctgatagccactaatggaactccagag ctccagaatcctgagagactgtcagctgtattccgtgactttttaaatcgctgtcttgag atggatgtggataggcgaggatctgccaaggagcttttgcagcatccatttttaaaatta gccaagcctctctccagcctgactcctctgattatcgctgcaaaggaagcaattaagaac agcagccgctaa >gi568815575f:111023104_111320444|GENSCAN_predicted_peptide_4|152_aa MKTLEENLGNTIQGIGMGKDFMTKTPKAMATKAKIDKRDLIKLNSVFTAKETIISMNRQP TELEKFFAIYLSDKGRKMSHPITVGHSVYSSQQEIKEKSPGHHPYVCQQRHFGINIERCA LVISGHTSNIKDLDRLTCVKMSLKIQGKRINC >gi568815575f:111023104_111320444|GENSCAN_predicted_CDS_4|459_bp atgaaaaccctagaagaaaacctaggcaataccattcagggcataggcatgggcaaagat ttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaacgggatcta attaaactaaacagcgtcttcacagcaaaagaaactatcatcagcatgaacaggcaacct acagaattggagaaattttttgcaatctatctatctgacaaaggaagaaaaatgagtcac cctatcacagtgggccacagtgtatactctagtcaacaggaaattaaagaaaagtcacca ggccaccatccatatgtttgccagcaaagacattttggaatcaacattgaaagatgtgcc ctcgttatctcagggcacactagcaacatcaaagatttggatagactcacctgtgtcaag atgagcttaaaaatccagggaaagagaataaactgttaa >gi568815575f:111023104_111320444|GENSCAN_predicted_peptide_5|641_aa MGPPLKLFKNQKYQELKQECIKDSRLFCDPTFLPENDSLFYNRLLPGKVVWKRPQDICDD PHLIVGNISNHQLTQGRLGHKPMVSAFSCLAVQESHWTKTIPNHKEQEWDPQKTEKYAGI FHFRFWHFGEWTEVVIDDLLPTINGDLVFSFSTSMNEFWNALLEKAYAKLLGCYEALDGL TITDIIVDFTGTLAETVDMQKGRYTELVEEKYKLFGELYKTFTKGGLICCSIESPNQEEQ EVETDWGLLKGHTYTMTDIRKIRLGERLVEVFSAEKVYMVRLRNPLGRQEWSGPWSEISE EWQQLTASDRKNLGLVMSDDGEFWMSLEDFCRNFHKLNVCRNVNNPIFGRKELESVLGCW TVDDDPLMNRSGGCYNNRDTFLQNPQYIFTVPEDGHKVIMSLQQKDLRTYRRMGRPDNYI IGFELFKVEMNRKFRLHHLYIQERAGTSTYIDTRTVFLSKYLKKGNYVLVPTMFQHGRTS EFLLRIFSEVPVQLRELTLDMPKMSCWNLARGYPKVVTQITVHSAEDLEKKYANETVNPY LVIKCGKEEVRSPVQKNTVHAIFDTQAIFYRRTTDIPIIVQVWNSRKFCDQFLGQVTLDA DPSDCRDLKSLYLRKKGGPTAKVKQGHISFKVISSDDLTEL >gi568815575f:111023104_111320444|GENSCAN_predicted_CDS_5|1926_bp atgggtcctcctctgaagctcttcaaaaaccagaaataccaggaactgaagcaggaatgc atcaaagacagcagacttttctgtgatccaacatttctgcctgagaatgattctcttttc tacaaccgactgcttcctggaaaggtggtgtggaaacgtccccaggacatctgtgatgac ccccatctgattgtgggcaacattagcaaccaccagctgacccaagggagactggggcac aagccaatggtttctgcattttcctgtttggctgttcaggagtctcattggacaaagaca attcccaaccataaggaacaggaatgggaccctcaaaaaacagaaaaatacgctgggata tttcactttcgtttctggcattttggagaatggactgaagtggtgattgatgacttgttg cccaccattaacggagatctggtcttctctttctccacttccatgaatgagttttggaat gctctgctggaaaaagcttatgcaaagctgctaggctgttatgaggccctggatggtttg accatcactgatattattgtggacttcacgggcacattggctgaaactgttgacatgcag aaaggaagatacactgagcttgttgaggagaagtacaagctattcggagaactgtacaaa acatttaccaaaggtggtctgatctgctgttccattgagtctcccaatcaggaggagcaa gaagttgaaactgattggggtctgctgaagggccatacctataccatgactgatattcgc aaaattcgtcttggagagagacttgtggaagtcttcagtgctgagaaggtgtatatggtt cgcctgagaaaccccttgggaagacaggaatggagtggcccctggagtgaaatttctgaa gagtggcagcaactgactgcatcagatcgcaagaacctggggcttgttatgtctgatgat ggagagttttggatgagcttggaggacttttgccgcaactttcacaaactgaatgtctgc cgcaatgtgaacaaccctatttttggccgaaaggagctggaatcggtgttgggatgctgg actgtggatgatgatcccctgatgaaccgctcaggaggctgctataacaaccgtgatacc ttcctgcagaatccccagtacatcttcactgtgcctgaggatgggcacaaggtcattatg tcactgcagcagaaggacctgcgcacttaccgccgaatgggaagacctgacaattacatc attggctttgagctcttcaaggtggagatgaaccgcaaattccgcctccaccacctctac atccaggagcgtgctgggacttccacctatattgacacccgcacagtgtttctgagcaag tacctgaagaagggcaactatgtgcttgtcccaaccatgttccagcatggtcgcaccagc gagtttctcctgagaatcttctctgaagtgcctgtccagctcagggaactgactctggac atgcccaaaatgtcctgctggaacctggctcgtggctacccgaaagtagttactcagatc actgttcacagtgctgaggacctggagaagaagtatgccaatgaaactgtaaacccatat ttggtcatcaaatgtggaaaggaggaagtccgttctcctgtccagaagaatacagttcat gccatttttgacacccaggccattttctacagaaggaccactgacattcctattatagta caggtctggaacagccgaaaattctgtgatcagttcttggggcaggttactctggatgct gaccccagcgactgccgtgatctgaagtctctgtacctgcgtaagaagggtggtccaact gccaaagtcaagcaaggccacatcagcttcaaggttatttccagcgatgatctcactgag ctctaa >gi568815575f:111023104_111320444|GENSCAN_predicted_peptide_6|140_aa MAFILGKSVECTVRQIVCEALERERKSLGSLPEVLITITDLPTRAMSDQGHVCSLAHINF ANTSPFFSACPLSSNVHVCDINPGTENSSGNGSSSSSSSSSSSRAPGITQVSREGIRKLT LEFYFLDNLASKKDEETSTL >gi568815575f:111023104_111320444|GENSCAN_predicted_CDS_6|423_bp atggcatttatccttggcaaaagtgtagaatgcactgttaggcagatcgtttgtgaggct ttagaaagggagcggaaatcactagggtcacttcctgaagtcctcatcactatcacagat ctgcctaccagggccatgtctgaccagggccatgtctgcagcctagcccatattaacttt gccaacacttcaccatttttctctgcatgtcctctctctagtaatgtgcatgtctgtgat attaaccctgggacagaaaacagcagcggcaacggcagcagcagcagcagcagcagcagc agcagcagcagggctcctgggataactcaggtgagtagagagggaattcgcaaacttacc ctggagttttatttcctggataacttagcgtctaagaaagatgaagaaacttcaactttg tag >gi568815575f:111023104_111320444|GENSCAN_predicted_peptide_7|111_aa MGLSFTYPIIWHQRKCALVQIAHISSPLRTCTCLCPWMTRTRLVIPCKGGESAQSPEYKS KPIIVVGTREQDAYIPKMSSRTLMGMIPKITPPQKRLCQETSPDRNTGTVV >gi568815575f:111023104_111320444|GENSCAN_predicted_CDS_7|336_bp atggggctgtctttcacctaccccatcatctggcaccaaaggaaatgtgcactggtccaa attgcccatatttcttcccctttaaggacctgtacctgcctctgtccttggatgactcgg actcgcttggtgattccatgtaaaggaggggagagtgctcagagtccagagtacaaatcc aagcctatcattgtagtagggacaagagaacaggatgcctatatccccaaaatgagctcc aggacactgatgggaatgatcccaaagatcaccccacctcagaaacgtctgtgccaagag acttccccagatagaaacactgggacagtggtttga