GENSCAN 1.0 Date run: 7-Nov-116 Time: 15:44:02 Sequence gi568815581f:57005465_57220321 : 214857 bp : 45.48% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3251 3315 65 1 2 37 46 128 0.439 4.12 1.02 Intr + 6219 6285 67 0 1 66 109 25 0.201 1.31 1.03 Intr + 35247 35345 99 2 0 85 60 34 0.020 0.61 1.04 Intr + 40233 40413 181 2 1 19 -7 295 0.098 12.54 1.05 Term + 40596 40981 386 1 2 -49 52 658 0.922 43.55 1.06 PlyA + 41310 41315 6 1.05 2.10 PlyA - 42810 42805 6 1.05 2.09 Term - 58417 58269 149 2 2 61 40 97 0.208 0.26 2.08 Intr - 73946 73823 124 2 1 86 62 -7 0.043 -3.34 2.07 Intr - 79554 79376 179 1 2 -13 93 140 0.549 4.04 2.06 Intr - 80008 79807 202 1 1 49 11 166 0.589 3.86 2.05 Intr - 80656 80468 189 1 0 61 76 134 0.949 9.28 2.04 Intr - 89022 88728 295 2 1 11 84 141 0.507 3.01 2.03 Intr - 90937 90792 146 1 2 47 82 123 0.638 6.68 2.02 Intr - 92638 92512 127 0 1 89 47 100 0.898 6.68 2.01 Init - 96485 96451 35 2 2 101 87 -4 0.800 0.14 2.00 Prom - 97789 97750 40 -8.36 3.00 Prom + 98894 98933 40 -8.76 3.01 Init + 100001 101714 1714 1 1 74 75 1183 0.996 104.73 3.02 Intr + 104561 104694 134 0 2 124 97 117 0.984 16.56 3.03 Intr + 106334 106460 127 1 1 54 86 85 0.989 5.15 3.04 Intr + 107027 107154 128 0 2 75 80 122 0.952 10.50 3.05 Intr + 108995 109172 178 1 1 115 80 249 0.999 26.29 3.06 Intr + 110647 110797 151 2 1 94 99 313 0.999 32.22 3.07 Intr + 111396 111463 68 0 2 120 92 37 0.923 5.85 3.08 Intr + 112917 112990 74 1 2 67 89 67 0.956 3.83 3.09 Intr + 113518 113580 63 0 0 109 100 15 0.908 3.71 3.10 Intr + 120212 120425 214 1 1 60 81 72 0.205 1.99 3.11 Term + 122399 122511 113 0 2 16 48 94 0.224 -3.08 3.12 PlyA + 122965 122970 6 1.05 4.03 PlyA - 123135 123130 6 1.05 4.02 Term - 127172 125912 1261 1 1 106 37 425 0.335 30.26 4.01 Init - 128186 128170 17 2 2 45 100 9 0.308 -2.44 4.00 Prom - 143033 142994 40 -1.06 5.03 PlyA - 143286 143281 6 1.05 5.02 Term - 144096 144079 18 0 0 127 44 21 0.109 -0.08 5.01 Init - 157757 157701 57 2 0 64 91 87 0.623 7.91 5.00 Prom - 163382 163343 40 -3.96 6.00 Prom + 166902 166941 40 -7.66 6.01 Init + 171423 171488 66 2 0 52 94 62 0.127 4.27 6.02 Intr + 182450 182563 114 1 0 68 7 100 0.036 0.54 6.03 Intr + 189144 189351 208 2 1 105 71 93 0.672 7.95 6.04 Intr + 198882 199026 145 1 1 -7 90 106 0.067 0.64 6.05 Term + 206648 206759 112 2 1 62 49 37 0.002 -4.77 6.06 PlyA + 208628 208633 6 1.05 7.02 PlyA - 208807 208802 6 1.05 7.01 Sngl - 209551 209162 390 1 0 85 42 181 0.430 9.42 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:57005465_57220321|GENSCAN_predicted_peptide_1|265_aa MHHLNVDRYGTTATLQDSPERCSAELSCSPFPRILLEHVCFPGQHSQPSMVFVLFSRNSL IAGLSLVHRCGPGIWHVSRPPLENADQHLFTLPQGYGQFAFGIFDDSFEIPTFSPGAQAD GSKDPERPWETEHQSRPLANGLDAFAQLLNQFENTGPPPADEEKIQYLPTVPVTEEHVGS GLECPVCKDDYALGEQLPRNHLFHDGCIVHRLEQHDSCPVCRKSLPGHNTATNTPAPGPT GMNCSSSSSSPSSSSPSKENATSNS >gi568815581f:57005465_57220321|GENSCAN_predicted_CDS_1|798_bp atgcatcacctgaatgtggaccgctatggcaccacggccacactccaggacagccctgaa agatgctcagctgagctcagctgctcccccttcccccggatcctcctagagcatgtctgc ttcccgggccagcacagccagccaagtatggtgtttgtcctgttttctagaaactccttg atagcaggcttaagtctggttcatcgatgtggccctggcatctggcacgtaagccggccg ccgttggagaacgcggaccagcacctgttcacgctgccgcagggctacggacagtttgct ttcggcatctttgacgacagcttcgagatccccacgttctctcctggggcgcaggctgac ggcagcaaggaccctgagagaccgtgggagacagagcatcagtcccggcccctggccaac ggcctggacgccttcgcacagctcctcaatcagtttgaaaacacgggccccccaccggca gatgaagagaaaatccagtacctccccaccgtccccgtcaccgaggagcacgtaggctcc gggctcgagtgccccgtgtgcaaggacgactacgcgctgggcgagcagctgccccgcaac cacctgttccacgatggctgcatagtgcaccggctggagcagcacgacagctgccccgtc tgccgaaaaagcctcccgggacacaacacggccacgaacacccccgccccgggcccgact gggatgaactgctcctcctcgtcgtcctccccctcctccagctcgcccagtaaagagaac gccacaagtaactcctga >gi568815581f:57005465_57220321|GENSCAN_predicted_peptide_2|481_aa MEKITWIRKTKREVPAASQNSPQKSNGLTLNSADKPGNCRLQPSAKQENKSLAMSSCQEL GTLSQPNSQKEGQKLNPVVSEYLLGKAFYGQNENPESTTRCSRPSCLLTSATQLKAQSMP SKVPVPAWIGWLLVGHKPRTPSQQATERWAPPRQGNQPRKEGHWIQKRDKRRGEGIPRLK GKQSQGKGMRPGETWNRCEYLGLRMQIPPETPLSLRPPDPNSVRTVPQPETRLDHNRPMS RPIFPVPLARRTELPGAAPAGEPRAPHRPDPAAFGSPAGGPRRSCPRPTSSSATIGVRDA ESARRRRPPAVPRGPARSAALPQLPHVLAQAESRHLSGARERGRAGRYSNRRNLDSLPRK PSTGFAALSRRVNQAISCLGTESALEKAEGPTLVTQERWQDFHPSHLTWNPWINQLPLEA NPETKGAVYTETVAEVTLKPYWIVPDLLLPAHMPLVPVKYSRTLPEENPALNYPEPVWFG L >gi568815581f:57005465_57220321|GENSCAN_predicted_CDS_2|1446_bp atggaaaaaatcacctggatacgtaaaacaaaaagggaggtacctgcagcttctcagaac agccctcagaaaagtaatggcctcactctcaactcagctgacaaacctggtaactgcagg ctacagccttctgctaaacaagaaaacaaatccctggcaatgagcagctgtcaggaactc gggacactttcccagcccaattcacagaaggaaggccagaagcttaacccagtagtgtct gagtacctgcttggcaaagccttctacggccaaaatgaaaaccctgaatccactaccagg tgctctaggcctagttgtctgctgacttcagccacccaactgaaggcccaatccatgcct tcaaaggtgcctgtccctgcatggatcggctggcttcttgtgggccacaaaccacgcaca ccttctcagcaagctactgaacgatgggctcccccaagacaaggaaatcaaccaagaaag gaggggcactggatccagaaacgagacaagagaaggggagagggaattccaaggctgaag gggaagcagagccagggaaaagggatgcgcccaggagaaacgtggaaccgttgcgagtac ctggggctacgaatgcaaattccccccgagacgcccctgagcctccgacctccagacccc aattcggttcgaacggtccctcagcccgaaacacggttggatcacaaccggccgatgtcc cggcccatcttcccagttcccctggctcgccgcacggaactcccaggggctgcgccggcg ggcgagccgagggctccccatcgccccgaccccgccgccttcgggtctccggcagggggt ccccggcgcagctgcccccgacctaccagctcctcggctaccatcggggtgcgggatgcg gagtccgcgcgccgccgccggcccccggctgtgccccgcggcccggcccgcagcgccgcg cttccgcagctcccccacgtgctagcccaggcggagagccgtcatctctccggtgcccgg gagaggggccgggcgggacggtacagcaaccggaggaaccttgattccctgccccgcaag cccagcaccggttttgccgccttgtctcgaagggtcaaccaggccatctcctgcctcggg acggagagcgccctggaaaaggcggaggggccgaccttagtcacacaagagcgatggcaa gattttcacccaagccatctgacttggaacccatggattaaccaactgccactggaggca aatccagagaccaagggagcagtttatacagaaacagttgcagaagtcactctaaaaccc tactggattgtcccagaccttctgctcccagctcatatgcctttggttcctgtcaagtat tccaggacactgccagaagagaaccctgccctgaactaccctgaacctgtctggtttggg ctctga >gi568815581f:57005465_57220321|GENSCAN_predicted_peptide_3|987_aa MAIQFRSLFPLALPGMLALLGWWWFFSRKKGHVSSHDEQQVEAGAVQLRADPAIKEPLPV EDVCPKVVSTPPSVTEPPEKELSTVSKLPAEPPALLQTHPPCRRSESSGILPNTTDMRLR PGTRRDDSTKLELALTGGEAKSIPLECPLSSPKGVLFSSKSAEVCKQDSPFSRVPRKVQP GYPVVPAEKRSSGERARETGGAEGTGDAVLGEKVLEEALLSREHVLELENSKGPSLASLE GEEDKGKSSSSQVVGPVQEEEYVAEKLPSRFIESAHTELAKDDAAPAPPVADAKAQDRGV EGELGNEESLDRNEEGLDRNEEGLDRNEESLDRNEEGLDRNEEIKRAAFQIISQVISEAT EQVLATTVGKVAGRVCQASQLQGQKEESCVPVHQKTVLGPDTAEPATAEAAVAPPDAGLP LPGLPAEGSPPPKTYVSCLKSLLSSPTKDSKPNISAHHISLASCLALTTPSEELPDRAGI LVEDATCVTCMSDSSQSVPLVASPGHCSDSFSTSGLEDSCTETSSSPRDKAITPPLPEST VPFSNGVLKGELSDLGAEDGWTMDAEADHSGGSDRNSMDSVDSCCSLKKTESFQNAQAGS NPKKVDLIIWEIEVPKHLVGRLIGKQGRYVSFLKQTSGAKIYISTLPYTQSVQICHIEGS QHHVDKALNLIGKKFKELNLTNIYAPPLPSLALPSLPMTSWLMLPDGITVEVIVVNQVNA GHLFVQQHTHPTFHALRSLDQQMYLCYSQPGIPTLPTPVEITVICAAPGADGAWWRAQVV ASYEETNEVEIRYVDYGGYKRVKVDVLRQIRSDFVTLPFQGAEVLLDSVMPLSDDDQFSP EADAAMSEMTGNTALLAQVTSYSPTGLPLIQLWSVVGDEAGALTAPTDFFLSIEHLLYAK HPAGGFKDETNIEPLGSFSVLGVGFDDLGADGAFLSGCLLPLCNGILAIADTDLIGLGYT LGTGRDLEKVLPLECNVYTKAHQVHTS >gi568815581f:57005465_57220321|GENSCAN_predicted_CDS_3|2964_bp atggcaatccagttccgttcgctcttccccttggcattgcctgggatgctggcgctcctc ggctggtggtggtttttctctcgtaaaaaaggccatgtcagcagccatgatgagcagcag gtggaggctggtgctgtgcagctgagggctgaccctgccatcaaggaacctctccccgtg gaagacgtctgtcccaaagtagtgtccacaccccccagtgtcacagagcctccagaaaag gaactgtccaccgtgagcaagctgcctgcagagcccccagcattgctccagacacaccca ccttgccgaagatcagagtcctcgggcattcttcctaacaccacagacatgagattgcga ccaggaacacgcagagatgacagtacaaagctggagctagccctgacaggtggtgaagcc aaatcgattcctctagagtgccccctttcatccccaaagggtgtactattctccagcaaa tcagctgaggtgtgtaagcaagattcccccttcagcagggtgccaaggaaggtccagcca ggctaccccgtagtccccgcagagaagcgtagctctggggagagggcaagagagacaggt ggggccgaagggactggtgatgccgtgttgggggaaaaggtgcttgaagaagctctgttg tctcgggagcatgtcttggaattggagaacagcaagggccccagcctggcctctttagag ggggaagaagataaggggaagagcagctcatcccaggtggtggggccagtgcaggaggaa gagtatgtagcagagaagttgccaagtaggttcatcgagtcggctcacacagagctggca aaggacgatgcggcgccagcacccccagtcgcagacgccaaagcccaggatagaggtgtc gagggagaactgggcaatgaggagagcttggatagaaatgaggagggcttggatagaaat gaggagggcttggatagaaatgaggagagcttggatagaaatgaggagggcttggataga aatgaggagattaagcgggctgccttccagataatctcccaagtgatctcagaagcaacc gaacaggtgctggccaccacggttggcaaggttgcaggtcgtgtgtgtcaggccagtcag ctccaagggcagaaggaagagagctgtgtcccagttcaccagaaaactgtcttgggccca gacactgcggagcctgccacagcagaggcagctgttgccccgccggatgctggcctcccc ttgccaggcctaccagcagagggctcaccaccaccaaagacctacgtgagctgcctgaag agccttctgtccagccccaccaaggacagtaagccaaatatctctgcacaccacatctcc ctggcctcctgcctggcactgaccacccccagtgaagagttgccggaccgggcaggcatc ctggtggaagatgccacctgtgtcacctgcatgtcagacagcagccaaagtgtccctttg gtggcttctccaggacactgctcagattctttcagcacttcagggcttgaagactcttgc acagagaccagctcgagccccagggacaaggccatcaccccgccactgccagaaagtact gtgcccttcagcaatggggtgctgaagggggagttgtcagacttgggggctgaggatgga tggaccatggatgcggaagcagatcattcaggaggttctgacaggaacagcatggattcc gtggatagctgttgcagtctcaagaagactgagagcttccaaaatgcccaggcaggctcc aaccctaagaaggtcgacctcatcatctgggagatcgaggtgccaaagcacttagtcggt cggctaattggcaagcaggggcgctatgtgagttttctgaagcaaacatctggtgccaag atctacatttcaaccctgccttacacccagagcgtccagatctgccacatagaaggctct caacatcatgtagacaaagcgctgaacttgattgggaagaagttcaaagagctgaacctc accaatatctacgctcccccattgccttcactggcactgccttctctgccgatgacatcc tggctcatgctgcctgatggcatcaccgtggaggtcattgtggtcaaccaggtcaatgcc gggcacctgttcgtgcagcagcacacacaccctaccttccacgcgctgcgcagcctcgac cagcagatgtacctctgttactctcagcctggaatccccaccttgcccaccccagtggaa ataacggtcatctgtgccgcccctggtgcggacggggcctggtggcgagcccaagtggtt gcctcctacgaggagaccaacgaagtggagattcgatacgtggactacggcggatataag agggtgaaagtagacgtgctccggcaaatcaggtctgactttgtcaccctgccgtttcag ggagcagaagtccttctggacagtgtgatgcccctgtcagacgatgaccagttttcaccg gaagcagatgccgccatgagcgagatgacggggaatacagcactgcttgctcaggtgaca agttacagtccaactggtcttcctctgattcagctgtggagtgtggttggagatgaagca ggagctctgactgctcccactgatttctttctctccattgagcacctgctgtatgccaag catcctgctgggggcttcaaagatgaaacaaacattgagcctttagggagcttctctgtt ttgggagtggggtttgatgatctcggtgcagatggtgctttcctgagtgggtgccttctc ccactttgtaatgggatcctggcaattgctgacacggatttaattggtctgggctacacc ctgggcactgggagagatttagaaaaagtattacctttggagtgtaacgtgtacaccaaa gcacaccaagtgcacacatcttga >gi568815581f:57005465_57220321|GENSCAN_predicted_peptide_4|425_aa MAGKTSRPTLTPSSPPTLTPSSRPTLTPCSTPTLTPSSTPTLTPSNTPTLTPSNTPTLTP SSPPTLTPSSTPTLTPSSQPILTRSSTPTLTPSSPPTLTPSNTPTLTPSNTPTLTPSSTP TLTPSSTPTLTPSNTPTLTPSSTPTLTPSSTPTLTPSSQPILTRSSTPTLTPSSPPTLTP SNTPTLTPSNTPTLTPSNTPTLTPSSTPTLTPSSTPTLTPSSRPILTRSSTPTLTPSSRP ILTRSSTPTLTPSSTPTLTPSSTPTLTPSSTPTLTPSSRPILTRSSTPTLTPSSRPILTP SSTPTLTPSSRPILTRSSTPTLTPSSTPTLTPSNTPTLTPSNTPTLTPSNTPTLTPSSTP TLTPSSTPTLTPSSRPILTPSSTPTLTPSSRPILTRSSTPTLTPCSRPTLYFTASCSLLP KLPRG >gi568815581f:57005465_57220321|GENSCAN_predicted_CDS_4|1278_bp atggctggaaaaacaagccggcccaccctcaccccctccagcccacccaccctcaccccc tccagccggcccaccctcaccccctgcagcacacccaccctcaccccctccagcacaccc accctcaccccctccaacacacccaccctcaccccctccaacacacccaccctcaccccc tccagcccgcccaccctcaccccctccagcacgcccaccctcaccccctccagccagccc atcctcacccgctccagcacacccaccctcaccccctccagcccgcccaccctcaccccc tccaacacacccaccctcaccccctccaacacacccaccctcaccccctccagcacaccc accctcaccccctccagcacacccaccctcaccccctccaacacacccaccctcaccccc tccagcacgcccaccctcaccccctccagcacgcccaccctcaccccctccagccagccc atcctcacccgctccagcacacccaccctcaccccctccagcccacccaccctcaccccc tccaacacacccaccctcaccccctccaacacacccaccctcaccccctccaacacaccc accctcaccccctccagcacacccaccctcaccccctccagcacgcccaccctcaccccc tccagccggcccatcctcacccgctccagcacacccaccctcaccccctccagccggccc atcctcacccgctccagcacgcccaccctcaccccctccagcacgcccaccctcaccccc tccagcacacccaccctcaccccctccagcacgcccaccctcaccccctccagccggccc atcctcacccgctccagcacgcccaccctcaccccctccagccggcccatcctcaccccc tccagcacacccaccctcaccccctccagccggcccatcctcacccgctccagcacaccc accctcaccccctccagcacgcccaccctcaccccctccaacacacccaccctcaccccc tccaacacacccaccctcaccccctccaacacacccaccctcaccccctccagcacaccc accctcaccccctccagcacacccaccctcaccccctccagccggcccatcctcaccccc tccagcacacccaccctcaccccctccagccggcccatcctcacccgctccagcacaccc accctcaccccctgcagccggcccaccctctatttcaccgcttcctgctcacttcttccc aagttacccagaggctga >gi568815581f:57005465_57220321|GENSCAN_predicted_peptide_5|24_aa MIAALINIAAGEIYMGRDQRSHED >gi568815581f:57005465_57220321|GENSCAN_predicted_CDS_5|75_bp atgattgctgccctcatcaatattgctgccggcgaaatctacatgggacgcgatcagagg agtcatgaagattag >gi568815581f:57005465_57220321|GENSCAN_predicted_peptide_6|214_aa MGKQFDKLWYIHTTEDYTAAKKTNKTTDALRSAASSLMADEQNVFAEAPLHLLSTLITSQ KFTLKEALCRFLIKALPKPYACTELYKLGTTIPQPGKRPGTENREKYLVHGIGKIRVCKD GLWKHLEVLAWCEEHREISWPNRTPVALDASLNQIYIRSVSCLSFPKASWAAYSHSQRAA LPPHDKSLLEVTGAPRAQPKQVTPTLRDRGPSSV >gi568815581f:57005465_57220321|GENSCAN_predicted_CDS_6|645_bp atggggaaacagtttgataaattatggtacattcatacaacagaagattatacagctgct aaaaagaccaacaagactactgatgccttgagatcagctgcctcaagtctcatggcagat gagcagaatgtctttgctgaagcacccttgcatcttctgagcacactcatcacctcgcag aagttcactttgaaagaggccttgtgtcgctttctgatcaaggctcttcccaagccttat gcctgtactgagttgtacaagttagggaccaccatccctcaacctgggaaaaggcctggc acagagaacagagagaaatatttggtgcatggaatcggtaaaataagggtgtgcaaggat gggctgtggaagcacctagaggtgctggcctggtgtgaagagcatcgggaaatctcatgg ccaaatcggacccccgtggccttggatgccagcctcaatcagatctacattcgaagtgtc tcctgcctttccttccccaaagcctcctgggcagcctactcccacagccagagagctgca ttaccccctcatgacaagtcgctgttggaggtcactggggcaccaagagcccagccgaag caggtgactccaaccctcagagacagagggccctcaagtgtctga >gi568815581f:57005465_57220321|GENSCAN_predicted_peptide_7|129_aa MLQQKALSSYKDFSKGSPEMSDNEPFTASEGWLNRSRNRFGLKNRNTTREAASANEEVAA TFLAKLKKLINEKGYPPKQVFNCNEIWIFWEKMGNRTYIHKSAKEVPGQKTWKDRLALLP CDNTAGHMI >gi568815581f:57005465_57220321|GENSCAN_predicted_CDS_7|390_bp atgttgcagcagaaagcattgagttcatacaaagacttcagcaagggatcccctgaaatg agcgacaacgagccatttactgcaagtgagggatggttaaacagatccaggaataggttt ggactgaaaaatagaaatactactagagaggccgcatctgccaatgaagaagttgctgcc acatttctggcaaagttgaagaagttaattaacgagaaaggataccctccaaagcaagtc ttcaattgcaatgaaatctggattttctgggagaagatgggcaatagaacctacattcat aaaagtgcaaaggaagtaccagggcagaaaacatggaaggacagattagctctgttgcca tgtgacaatactgcagggcatatgatatag