GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:29:40 Sequence gi568815581f_30632048 : 261739 bp : 44.29% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.19 PlyA - 1207052 1207047 6 1.05 1.18 Term - 1220261 1220092 170 0 2 83 48 194 0.999 12.94 1.17 Intr - 1220983 1220765 219 2 0 70 78 199 0.914 15.37 1.16 Intr - 1226464 1226433 32 0 2 86 97 45 0.753 2.97 1.15 Intr - 1227251 1226767 485 2 2 73 51 118 0.050 -1.58 1.14 Intr - 1236065 1235999 67 1 1 90 108 34 0.855 4.51 1.13 Intr - 1241440 1241331 110 1 2 72 73 82 0.948 4.18 1.12 Intr - 1243366 1243187 180 1 0 62 82 123 0.660 9.16 1.11 Intr - 1246280 1246203 78 2 0 89 91 39 0.511 4.05 1.10 Intr - 1248707 1248506 202 1 1 57 46 73 0.176 -0.71 1.09 Intr - 1252458 1252377 82 1 1 82 80 71 0.429 4.40 1.08 Intr - 1254014 1253933 82 2 1 38 100 61 0.931 1.51 1.07 Intr - 1255258 1255189 70 0 1 12 79 111 0.382 1.78 1.06 Intr - 1257356 1257238 119 2 2 75 81 84 0.382 5.66 1.05 Intr - 1260747 1260700 48 0 0 91 92 12 0.613 0.78 1.04 Intr - 1262690 1262598 93 2 0 96 87 51 0.965 5.96 1.03 Intr - 1262964 1262923 42 0 0 75 98 35 0.756 1.64 1.02 Intr - 1267683 1267599 85 2 1 66 108 59 0.998 5.52 1.01 Init - 1269580 1269489 92 1 2 59 41 136 0.975 5.96 1.00 Prom - 1272939 1272900 40 -3.06 2.00 Prom + 1276727 1276766 40 -5.06 2.01 Init + 1305200 1305473 274 1 1 105 70 248 0.160 19.87 2.02 Intr + 1344474 1344567 94 1 1 30 95 50 0.031 -1.08 2.03 Intr + 1350952 1351057 106 1 1 83 86 48 0.159 4.32 2.04 Intr + 1362517 1362674 158 0 2 76 96 122 0.915 10.61 2.05 Term + 1366611 1366956 346 0 1 79 48 119 0.553 0.97 2.06 PlyA + 1368732 1368737 6 1.05 3.00 Prom + 1371276 1371315 40 -5.76 3.01 Init + 1375986 1376085 100 2 1 81 80 280 0.999 26.92 3.02 Intr + 1385925 1386031 107 1 2 72 87 36 0.159 1.83 3.03 Intr + 1388650 1388844 195 0 0 106 30 46 0.087 0.21 3.04 Intr + 1389346 1390778 1433 0 2 106 52 757 0.006 62.25 3.05 Intr + 1402863 1402934 72 0 0 70 84 49 0.065 1.22 3.06 Intr + 1403518 1403592 75 1 0 89 93 38 0.077 3.03 3.07 Intr + 1408491 1408779 289 0 1 -42 38 555 0.221 34.75 3.08 Intr + 1408891 1409613 723 0 0 -19 63 1000 0.921 78.43 3.09 Intr + 1410243 1410372 130 2 1 48 55 101 0.504 3.07 3.10 Intr + 1415714 1415854 141 0 0 82 72 174 0.298 15.52 3.11 Intr + 1417055 1417347 293 0 2 109 78 290 0.781 26.95 3.12 Intr + 1417956 1418060 105 2 0 107 94 -15 0.566 1.41 3.13 Term + 1421219 1421281 63 1 0 117 53 15 0.603 -1.21 3.14 PlyA + 1421416 1421421 6 1.05 4.06 PlyA - 1421952 1421947 6 1.05 4.05 Term - 1424257 1424235 23 2 2 81 49 21 0.259 -4.03 4.04 Intr - 1424383 1424315 69 2 0 86 75 78 0.255 5.45 4.03 Intr - 1445234 1445140 95 1 2 24 74 102 0.098 2.01 4.02 Intr - 1447996 1447819 178 2 1 41 22 151 0.156 2.78 4.01 Intr - 1454919 1454884 36 1 0 34 111 53 0.012 0.43 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 1371278 1371061 218 1 2 56 81 201 0.801 14.35 S.002 Init - 1373853 1373816 38 0 2 81 72 34 0.828 0.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f_30632048|GENSCAN_predicted_peptide_1|751_aa MAEIIQERIEDRLPELEQLERIGLFSHAEIKAIIKKASDLEYKIQRRTLFKEDFINYVQY EINLLELIQRRRTRIGYSFKKDEIENSIVHRVQGVFQRASAKWKDDVQLWLSYVAFCKKW LCGLWQPNGKWKIDCLQKAQGNYFFAHCAFIQSAQNFIKKMELMHAEKLRKEKEEFEKAS MDVENPDYSEEILKGELAWIIYKNSVSIIKGAEFHVSLLSIAQLFDFAKDLQKEIYDDLQ ALHTDDPLTWDYVARRELEIESQTEEQPTTKQAKAVEVGRKEERCCAVYEEAVKTLPTGE LHQTMRLERTMTVFRKAHELKLLSECQYKQLSVSLLCYNFLREALEVAVAGTELFRDSGT MWQLKLQVLIESKSPDIAMLFEEAFVHLKPQKALLAVIGADSVTLKNKYLDWAYRSGGYK KARAVFKSLQESRPFSVDFFRKMIQFEKEQADWRFCGGSSGGQPGRRVTWPTASRATSRG PQGMDLQAAGAQAQGAAEPSRGPPLPSARGAPPSPEVSAAWGPGATASRSLGREVAGPGW AGAAGPGKRLWPASGRPATRSAVWARARAENGQAAGPRGCGAGKAEGVGGRSGSAETAVR PLGAAADLVNGSLYRLGAERLEARGTQSIPNDSPARGEGTHSEEEGFAMDEEDSDGELNT WELSEGTNCPPKEQPGDLFNEDWDSELKADQGNPYDADDIQESISQELKPWVCCAPQGDM IYDPSWHHPPPLIPYYSKMVFETGQFDDAED >gi568815581f_30632048|GENSCAN_predicted_CDS_1|2256_bp atggcagagataattcaggaacgcatagaagatcggctcccggaattggaacagctggag cgcattggactgttcagtcatgcggagattaaggctatcattaagaaggcttccgatcta gagtacaaaatccagagaagaacccttttcaaggaagactttatcaattatgttcaatat gaaattaatcttttggagctgatccagagaagaagaacacgcattggatattcatttaag aaggatgagattgagaattctattgtacaccgggtacaaggtgttttccagcgtgcctca gcaaaatggaaagacgatgttcaactttggctctcctatgtggctttttgtaagaagtgg ctttgtggattatggcagccaaatgggaaatggaagatcgattgtcttcagaaagcgcaa ggcaactatttcttcgcgcactgcgctttcatccagagtgcccaaaactttataaagaag atggagctgatgcatgctgaaaaactgaggaaggagaaggaagaatttgaaaaagccagt atggatgtggagaatcctgattattctgaagaaatccttaagggcgagttggcatggatc atctacaaaaattctgtaagcataattaaaggtgcagaatttcacgtgtcactgctttcg attgcacagctatttgactttgccaaagatctacaaaaagagatttatgatgaccttcag gctctacacacagatgatcctctcacttgggattatgtggcaaggcgagaattagagatt gagtcacagacagaagagcagcctacaacgaaacaagccaaagcagtggaggtcggccgg aaggaggagaggtgctgtgctgtgtatgaagaggcagtgaagactctgccaacaggtgaa cttcaccaaaccatgaggttggaaagaaccatgactgtattcaggaaggcacatgaactg aagcttctgtcagaatgccaatacaagcagttgagtgtttcgttgctgtgttataacttc ctgagggaagctctggaagtggcagtagctggaactgaattgtttagagactctgggaca atgtggcagctgaagctgcaggtgctgatcgagtcaaagagccctgacatagccatgctt tttgaagaagcctttgtgcacctgaaaccccagaaagctctcttagctgtcataggtgcc gactcagtaaccctgaagaataagtacctggattgggcttatcgaagtggtggctacaaa aaggccagagctgtgtttaaaagtttacaggagagccgaccattttcagttgactttttc aggaaaatgattcagtttgaaaaggagcaagctgattggcgcttctgcggcggatcctcg ggcgggcagccgggccggcgcgtcacgtggcccacggcgtccagggcgaccagccgcggg ccgcagggcatggaccttcaggccgccggggcccaggcgcagggggccgcggagccgtct cggggcccgccgctgcctagcgcgcggggggcgccccccagcccggaggtgagcgccgcg tgggggccaggagcaacagccagccgcagcctggggcgggaagtcgcggggccgggctgg gcgggggccgcggggccggggaagcgcctctggccggcctcgggccgccccgcgacacgg agcgctgtttgggctcgcgctcgagctgaaaacggccaggccgcggggccgcggggctgc ggggcggggaaagccgagggcgtgggtgggcgctctgggtcagcagagacggctgtccgc ccgctgggcgccgctgcggatttggtaaatgggagtctgtaccggctaggtgctgagcgg cttgaagcccgtggaacacagagcattcctaatgacagtcctgcccggggtgagggcacc cattctgaagaggaaggctttgccatggatgaggaggactctgatggagaactgaatacc tgggagctgtcagaagggacaaactgtccacccaaggaacagcctggcgatctttttaat gaggactgggactcggagttgaaagcagatcaagggaatccatatgatgctgacgacatc caggagagcatttctcaagagcttaaaccttgggtgtgctgtgccccacaaggagacatg atctatgaccccagctggcaccatccgcctccactgataccctattattccaagatggtc tttgaaacaggacagtttgacgatgctgaagattga >gi568815581f_30632048|GENSCAN_predicted_peptide_2|325_aa MAPQKHGGGGGGGSGPSAGSGGGGFGGSAAVAAATASGGKSGGGSCGGGGSYSASSSSSA AAAAGAAVLPVKKPKMEHVQADHELFLQAFENVNEELPARRKRNREDGEKTFVAQMTVFD KNRRLQLLDGEYEVAMQEMEECPISKKRATWETILDGKYHPKGARIDVSINECYDGSYAG NPQDIHRQPGFAFSRNGPVKRTPITHILVCRFIADNQMNHACMLFVENYGQKIIKKNLCR NFMLHLVSMHDFNLISIMSIDKAVTKLREMQQKLEKGESASPANEEITEEQNGTANGFSE INSKEKALETDSVSGVSKQSKKQKL >gi568815581f_30632048|GENSCAN_predicted_CDS_2|978_bp atggcgcctcagaagcacggcggtgggggagggggcggctcggggcccagcgcggggtcc gggggaggcggcttcgggggttcggcggcggtggcggcggcgacggcttcgggcggcaaa tccggcggcgggagctgtggagggggtggcagttactcggcctcctcctcctcctccgcg gcggcagcggcgggggctgcggtgttaccggtgaagaagccgaaaatggagcacgtccag gctgaccacgagcttttcctccaggcctttgagaatgtcaatgaagagcttccagccaga agaaaacgaaatcgtgaggatggggaaaagacatttgttgcacaaatgacagtatttgat aaaaacaggcgcttacagcttttagatggggaatatgaagtagccatgcaggaaatggaa gaatgtccaataagcaagaaaagagcaacatgggagactattcttgatgggaagtatcat ccaaaaggtgctaggatagatgtttctatcaatgagtgttatgatggctcctatgcagga aatcctcaggatattcatcgccaacctggatttgcttttagtcgcaacggaccagttaag agaacacctatcacacatattcttgtgtgcaggtttattgctgacaatcaaatgaatcat gcctgtatgctgtttgtagaaaattatggacagaaaataattaagaagaatttatgtcga aacttcatgcttcatctagtcagcatgcatgactttaatcttattagcataatgtcaata gataaagctgttaccaagctccgtgaaatgcagcaaaaattagaaaagggggaatctgct tcccctgcaaacgaagaaataactgaagaacaaaatgggacagcaaatggatttagtgaa attaactcaaaagagaaagctttggaaacagatagtgtctcaggggtttcaaaacagagc aaaaaacaaaaactctga >gi568815581f_30632048|GENSCAN_predicted_peptide_3|1241_aa MEEDDDDSDYPEELEDDDEDASYSTESSFSSTPESGKIAVECRPSEEIVDVRWEELHSLI QVCGDKNSKVSCSPLKEEYSRISPQRRKNSGCQNWTSTSEDPKARSRDFLGKLCRAPTPP HPTTAGRWSPGTTPMSALPQEPTENLAPFLKELDSAGELPLGPEPFLAAHQDLNDKRTPE ERLPEVVPLLNRDQNQALVQLPRLKWVQTTDLDRAAGHQADEILVPLDSKVSRPTKFVVS PKNLKKDLAERWSLPEIVGIPHQLSKPQRQKQTLPDDYLSMDTLYPGSLPPELRVNADEP PGPPEQVGLSQFHLEPKSQNPETLEDIQSSSLQEEAPAQLLQLPQEVEPSTQQEAPALPP ESSMESLAQTPLNHEVTVQPPGEDQAHYNLPKFTVKPADVEVTMTSEPKNETESTQAQQE APIQPPEEAEPSSTALRTTDPPPEHPEVTLPPSDKGQAQHSHLTEATVQPLDLELSITTE PTTEVKPSPTTEETSAQPPDPGLAITPEPTTEIGHSTALEKTRAPHPDQVQTLHRSLTEV TGPPTKLESSQDSLVQSETAPEEQKASTSTNICELCTCGDETLSCVGLSPKQRLRQVPVP EPDTYNGIFTTLILNRNPLTTVEDPYLFELPALKYLDMGTTHITLTTLKNILTMTVELEK LDGDVISKAVTEMLARTIKYLQPNPASQAKLTMLNTVCKIQGQVKNPGYPQLEGLLSECL IRHQKELGNESNFSDALLDAGESMKHLAEVKDSLDIEKRQGKIPDEELRQALEKFEDSKE VAETSMHNLLETNIEQVSQLRALAEAQLNYHWQAMQILDELAEKLKRRMREASSRPKREY KPKFWEPFDLGEPEQSNGGFPCTTVPKIAASSPFRSSDKSIWTPSRSMPPLDQPSCKVLY DFQPENHGELGFHEGDVFTLINQINENWYQGMLDSQLGFFLLSYMDVLMPLPSDSLVPPP RPSIHTGWHPRQVSCLPWAPAAKAVSKPAGATQARALEPGPRKPVSLWHKLPPPATYTPP QHTSHGTTAPAVLLRAKHKGSTEEASVGNPEGAFMKMLQARKQHMSTQLTIESEAPSDSS GINLSGFGGDQLEIQLTEQLRSLIPNEDVRKFMSHVIRTLKMECSETHVQGSCAKLMLRT GLLMKLLSEQQEAKALNVEWDTDQQKTNYINENMEQNEQKEQKSSELMKEVPGDDYKNKL IFAISVTVILIILIIIFCLIEVNSHKRASEKYKDNPSISGA >gi568815581f_30632048|GENSCAN_predicted_CDS_3|3726_bp atggaggaggacgacgatgactccgattatccggaggagctggaagatgacgacgaggac gccagttactccacggaaagcagcttcagcagcactccagaaagtggtaaaatagctgtg gagtgcagacccagtgaagagattgtagatgtcagatgggaagaactacacagtttaatt caagtatgtggagataaaaactcaaaggtcagttgctctcccctgaaggaagagtattct cggatttcacctcagaggaggaagaattcaggctgccagaactggactagcacttctgaa gatcccaaggccaggtcccgtgacttccttgggaagctctgccgtgcccccaccccaccc caccccaccacagcaggccgctggagtcctgggaccaccccgatgtcagccctgcctcag gaaccaactgaaaatttggctccattcctgaaggaattggattcagctggagagctgccc ctggggccagagccgttcttggctgcacatcaggacttaaatgacaagcggactccagaa gaaaggctcccagaggtggttccgcttctcaaccgggatcagaaccaggccctagttcag cttcctcgcctcaagtgggttcaaactacagatctagatcgggctgcaggtcatcaggca gatgaaatacttgttccactagacagtaaggtttcaagaccaaccaaatttgttgtttcg cccaagaacctgaagaaagatctagctgaacgttggagccttcctgagattgttgggatt ccacaccaattatccaaacctcagcgtcagaaacagactttgccagatgattatttgagt atggacacactgtatcccggcagcctacctccagaactccgggtgaacgcagatgagcct ccagggcctcctgagcaagttggactttctcaattccatctagagcccaaaagtcaaaat ccagagacccttgaagacatccagtcctcttcactccaggaagaagccccagcgcagctt ctacagctccctcaggaggtagaaccttcaacccagcaggaggccccagctctgcctcca gagtcctctatggagagtctagctcaaactccactgaatcatgaagtgacagttcaacct ccaggtgaggatcaagctcattataatttgcccaagtttacagtcaaacctgcagatgtg gaggttaccatgacttcagagcctaaaaatgagacagaatctacccaagcccagcaggag gccccaattcagcctcccgaggaggcggaaccttcttctacagccctgaggactacagat cctcctccagaacaccctgaggtgacacttccaccttcagacaagggtcaggctcagcat tcacacctgactgaagccacagttcaacctctggacctggagcttagcataactacagag cctactacagaggttaaaccgtctccaaccacggaggaaacctcagctcagcctccagac ccggggcttgccataactccagaacccactacagagattggacattccacagccctggag aagactagagctcctcatccagaccaggttcagactctgcatcgaagcctgactgaagtc acaggtccacctacaaagttagaatcttcgcaggattcattggtgcagtctgaaactgca ccagaggaacagaaggcctccacaagcaccaacatatgtgagctctgcacctgcggagat gagactctgtcatgtgttggtctcagcccaaagcagaggctccgccaagtgcctgtgcca gagcccgacacctacaatggcatcttcaccaccttaattctcaatcgcaatcctctgact actgtcgaagatccatatctctttgaactgccggcattaaaatatctagacatgggaaca acacacatcacacttacaacacttaagaacattctcacgatgactgttgaactggaaaaa ctagatggagatgtcatcagcaaggcggtgacggaaatgctggcaaggaccatcaagtac ctgcagcccaacccagcctcacaggctaagctgaccatgctcaacacagtgtgcaagatc cagggccaggtgaagaaccccggctacccgcagttggaggggctcctgagcgagtgcctg atccgccaccagaaggagctgggcaacgagtccaacttcagtgacgcactgctggatgcc ggcgagtccatgaagcacctggcagaggtgaaggactccctggacatagagaagcggcag ggcaagatccccgatgaggagctgcgtcaggcgctggagaagtttgaggactccaaggag gtagcagaaaccagcatgcacaacctcctggagaccaacattgagcaggtgagtcaactc cgggccctggcggaggcgcagctgaactaccactggcaggccatgcagatcctggacgag ctggcagagaagctcaagcgcaggatgcgggaagcttcctcacgccccaagcgggagtat aagcccaagttctgggagccctttgacctcggagagcctgagcagtccaatgggggcttc ccctgcaccacagtccccaagattgcagcttcatcccctttccgatcttccgacaagtcc atctggactcctagcaggagcatgccgcccctagaccagccaagctgcaaggtgctgtat gacttccagcctgagaaccatggggagctgggcttccatgagggtgacgtcttcacgctg atcaaccagatcaacgagaactggtaccagggcatgctggacagccagttgggcttcttc ctgctcagctacatggacgtgctcatgcctctgcccagtgactcactggtgcccccgccc cgcccctccatccacactgggtggcacccccgccaggtctcctgccttccatgggctcct gctgccaaggcggtgtccaagcctgccggcgccacccaggcccgggcccttgagccaggc ccacggaagcctgtgtcactctggcacaagctgccaccaccagccacctacacacccccc cagcacacctcgcacgggaccacagccccagccgtgctgctaagggccaagcacaaaggc tccactgaagaagcatctgtagggaatccagaaggagcgttcatgaagatgttacaagcc cggaagcagcacatgagcactcagctgactattgagtcggaggcgccctcagacagcagt ggcatcaacttgtcaggctttgggggtgatcagcttgaaattcagctaaccgagcagcta cggtccctcatccccaacgaggatgtgagaaagttcatgtctcatgttatccggaccttg aaaatggaatgttcagaaacacatgtgcaagggagctgtgccaagctcatgttgcgaaca ggcctcctgatgaagcttctcagcgagcagcaggaagcaaaggcattgaatgtagaatgg gatacggaccaacaaaaaacaaattatattaatgagaacatggaacagaatgaacagaaa gagcagaagtcaagtgagctcatgaaagaagttccaggagatgactataagaacaaactc atcttcgcaatatctgtgactgtaatactaataattttgattataattttttgtcttata gaggtgaattcacataaaagggcatcagaaaaatacaaagacaacccatcaatatcagga gcctga >gi568815581f_30632048|GENSCAN_predicted_peptide_4|133_aa XSWIREPKILSSARCEGQWLILKFLSYSLIRFLPKLWFQLTPLSEAASEFSCYGNGAAEI PRNLLVRLIASPSCFTFCHELKLPEIFPEAKQMPALCLYSPQNLTQRPHKTLFVQPEDAK EELEGAEADNDKG >gi568815581f_30632048|GENSCAN_predicted_CDS_4|402_bp ngttcttggattcgagagcctaagatcctgagttcagctcggtgtgagggccagtggttg atcttgaagtttctttcctactccctcatccgtttcctgcccaagctgtggttccagctg actcccctatcagaggctgcgagtgagttcagttgttatggcaacggggctgcggagatc cccaggaacctgctggtccgcctcattgcctcaccttcctgcttcaccttctgccatgag ttaaagctccctgagatctttccagaagccaagcagatgccagcactgtgcttgtacagc ccgcagaacttgactcagaggccccacaagaccctctttgtccagccagaagatgcaaag gaagaattagaaggtgcagaagcagataatgataaaggttag