GENSCAN 1.0 Date run: 4-Nov-116 Time: 18:08:55 Sequence gi568815593r:69264991_69465737 : 200747 bp : 42.29% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1415 1417 3 1 0 113 81 0 0.629 1.85 1.02 Intr + 4217 4303 87 1 0 100 119 43 0.822 7.85 1.03 Intr + 7902 8051 150 2 0 67 68 58 0.657 1.14 1.04 Intr + 11553 11700 148 2 1 61 116 111 0.966 10.09 1.05 Intr + 14636 14798 163 0 1 69 66 155 0.384 9.51 1.06 Intr + 20539 20635 97 1 1 49 94 11 0.090 -3.11 1.07 Intr + 23820 23919 100 2 1 81 103 57 0.293 5.26 1.08 Intr + 33573 33716 144 1 0 68 94 70 0.324 4.93 1.09 Term + 43948 44513 566 2 2 44 43 175 0.028 2.17 1.10 PlyA + 45663 45668 6 1.05 2.02 PlyA - 47858 47853 6 1.05 2.01 Sngl - 48882 48562 321 0 0 108 47 317 0.822 25.34 2.00 Prom - 48984 48945 40 -7.55 3.00 Prom + 49955 49994 40 -5.45 3.01 Init + 61678 61681 4 0 1 96 67 0 0.264 -1.09 3.02 Intr + 67537 67875 339 2 0 90 93 140 0.919 9.22 3.03 Term + 68874 69325 452 1 2 -17 47 416 0.666 21.36 3.04 PlyA + 69609 69614 6 1.05 4.04 PlyA - 74819 74814 6 1.05 4.03 Term - 87263 87071 193 1 1 90 38 226 0.996 13.71 4.02 Intr - 90804 90659 146 0 2 73 116 80 0.998 7.46 4.01 Init - 94652 94584 69 2 0 53 116 40 0.926 4.40 4.00 Prom - 94890 94851 40 -1.55 5.02 PlyA - 99774 99769 6 1.05 5.01 Sngl - 100747 99953 795 1 0 76 33 616 0.954 50.52 5.00 Prom - 101192 101153 40 -9.35 6.00 Prom + 103265 103304 40 -8.65 6.01 Init + 104408 104474 67 1 1 60 105 101 0.750 8.29 6.02 Intr + 104555 104705 151 0 1 122 47 27 0.626 0.30 6.03 Intr + 104889 104943 55 0 1 50 78 39 0.567 -2.84 6.04 Intr + 108840 109097 258 2 0 120 84 139 0.974 13.44 6.05 Intr + 109638 109721 84 2 0 41 93 69 0.772 1.80 6.06 Intr + 116911 117067 157 0 1 73 99 69 0.881 5.16 6.07 Intr + 119807 119943 137 0 2 90 72 38 0.963 1.77 6.08 Intr + 121053 121105 53 2 2 77 100 85 0.993 5.39 6.09 Intr + 121190 121315 126 2 0 87 82 22 0.552 0.37 6.10 Intr + 121406 121475 70 2 1 42 115 59 0.987 2.27 6.11 Intr + 124044 124155 112 2 1 78 88 -5 0.919 -2.47 6.12 Intr + 126841 127023 183 2 0 37 69 118 0.875 3.64 6.13 Intr + 128216 128250 35 0 2 86 107 47 0.864 3.42 6.14 Intr + 128364 128510 147 2 0 72 64 84 0.918 3.91 6.15 Intr + 131407 131556 150 0 0 58 93 71 0.927 4.04 6.16 Intr + 145436 145560 125 1 2 27 121 50 0.523 0.66 6.17 Term + 149041 149302 262 1 1 49 43 258 0.894 11.41 6.18 PlyA + 149516 149521 6 -1.95 7.02 PlyA - 149704 149699 6 1.05 7.01 Sngl - 150931 150644 288 1 0 51 36 327 0.421 19.15 7.00 Prom - 151138 151099 40 -9.75 8.00 Prom + 151656 151695 40 -10.15 8.01 Init + 154396 155541 1146 0 0 18 107 891 0.996 77.95 8.02 Intr + 159611 159646 36 1 0 97 95 25 0.725 1.64 8.03 Intr + 167537 167685 149 1 2 68 69 117 0.837 5.91 8.04 Intr + 167932 168103 172 1 1 86 92 256 0.919 24.82 8.05 Intr + 194532 194565 34 2 1 99 94 68 0.068 5.38 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:69264991_69465737|GENSCAN_predicted_peptide_1|485_aa MVPFLPGDSDLDQLTRIFETLGTPTEEQWPDMCSLPDYVTFKSFPGIPLHHIFSAAGDDL LDLIQGLFLFNPCARITATQALKMKYFSNRPGPTPGCQLPRPNCPVETLKEQSNPALAIK RKRTEALEQGQQQCIKPTRASNLSDFLVCVCRQGAAKGEAAKQLRVQEIIAPKGSEDTQN VSNTKRGNLISRTSESRSTQWNVVSRAQPSEPGAMQVDSVRIEGNDRIPSWCVGNPPPLA SGVRSAELSGCQKQLRPGKGQYQSQRVAARTPDSRCYFLTSVLPHQLCRGPCSCLAGAKR FVAFCPCPRDLWNFELERDDLEYLVEEISKQQSIQEMAWVLLKAFHFKREAQHKSSENLQ PDNAIEKKIPFSKDKFKLAADICISNEEPNVNPQDNGENVSRACQRPLQQPLSSQSQKSR RKKWFRGQGPGSPCCVQSKDLVPCVPATPAVAERGQGTAPSVNIEGASPKPWQLPHDVEP AGAQK >gi568815593r:69264991_69465737|GENSCAN_predicted_CDS_1|1458_bp atggttccttttttgccaggagattcagaccttgatcagctaacaagaatatttgaaact ttgggcacaccaactgaggaacagtggccggacatgtgtagtcttccagattatgtgaca tttaagagtttccctggaatacctttgcatcacatcttcagtgcagcaggagacgactta ctagatctcatacaaggcttattcttatttaatccatgtgctcgaattacggccacacag gcactgaaaatgaagtatttcagtaatcggccagggccaacacctggatgtcagctgcca agaccaaactgtccagtggaaaccttaaaggagcaatcaaatccagctttggcaataaaa aggaaaagaacagaggccttagaacaaggacagcagcagtgcatcaaacccactcgtgct tcaaatctgtctgacttcctcgtgtgtgtgtgtagacagggagctgccaagggagaagca gccaagcagctaagagtccaggagataatagcgccaaagggatctgaggacactcaaaat gtgtctaacaccaaaagaggtaacttgatctctaggacaagtgagtccaggagcacacag tggaatgtagtgagcagagcacagccttcagagccaggagcaatgcaggtagacagtgtg agaattgagggaaatgataggatacccagctggtgtgtggggaatcccccacccctagca tctggtgtcagaagcgctgagctgagtggttgccaaaagcaactaagacctgggaagggc caatatcaatcacaacgggttgcggccaggacccctgactcccggtgctacttcctcacc tcagtgctccctcaccagctgtgtcggggcccttgttcctgcctggctggagcaaagaga tttgtggcattttgcccctgccctagagatctgtggaactttgaacttgagagagatgat ttagagtatctggtggaagaaatttctaagcagcaaagcattcaagagatggcttgggta ctgttaaaggcattccattttaaaagggaagcacagcataaaagttcagaaaatttgcag cctgacaatgcaatagaaaagaaaatcccattttctaaggataaattcaagctagctgca gatatttgcataagtaatgaggagccaaatgttaatccccaagataatggggaaaatgtc tccagggcatgtcagagacctttgcagcagcctctctcatcacagagccagaagtctcgg aggaaaaaatggtttcgtgggcagggcccagggtccccgtgctgtgtgcagtctaaggac ttggtgccctgcgtcccagccactccagcggtggctgaaaggggccaaggtacagctccg tctgtgaatatagagggtgcaagccccaagccttggcagcttccacatgatgttgagcct gcaggtgcacagaagtaa >gi568815593r:69264991_69465737|GENSCAN_predicted_peptide_2|106_aa MASVVAVSDGVIKVFNDMKLCKSSTPEKLKKRKKAVLFRLSEDKKNIILKGRARRYWWAM WTRPSTTPMPPLSRCYRIRIASTPSMTQPMRPREQEGGPGVYLLGH >gi568815593r:69264991_69465737|GENSCAN_predicted_CDS_2|321_bp atggcctctgttgtggctgtctctgatggtgtcatcaaggtgttcaacgacatgaagttg tgtaagtcttcaacaccagagaagttgaagaagcgcaagaaggcggtgctcttccgcctg agtgaagacaagaagaacatcatcctgaaggggagggcaaggagatactggtgggcaatg tggaccagaccatcgacaacccctatgccacctttgtcaagatgctaccggataaggatt gcctctacgccctctatgacgcaacctatgagaccaagagagcaagaaggaggacctggt gtttatcttctgggccactga >gi568815593r:69264991_69465737|GENSCAN_predicted_peptide_3|264_aa MDEAIKLAFREKDLGSLSTTCRVSTNLRIPPLPWVLPTPGSHLPGLTPGTLSRSVAGTGA RTPAQRTAPSYRPGAPLSSLTWQVASSPCACAAAAARPPGHHVTAWAQSQTGNAAGRHTS GGRAVAYLGCRVEAEAAPPAWPLRPARPPQMRAAPRPVPVVQPPAAAPPSVVGSSAAAPR QPGLMAQMATTAAGVAVGSAVGHAITGAFRGGSNAEPARPDITYQEPQGTQPAQQQQPCF YEISFWSVPRTRVTSSSVRVSMRC >gi568815593r:69264991_69465737|GENSCAN_predicted_CDS_3|795_bp atggacgaggccattaaactggcattcagggaaaaggatttgggcagtttgtccacaacc tgcagagtttccacgaatctgcgcatcccgcccctgccctgggtcctccccactcccggt tcccacctgccaggtctgaccccgggtaccctctcaaggtccgtggcggggaccggagcc cggactccagctcagcgaacagcaccctcctaccgccccggcgccccactcagctcgctc acctggcaagtagcctcctctccgtgcgcgtgcgccgctgccgccgcgcgtcctccgggc caccacgtgaccgcgtgggcgcagtcccagacaggaaacgccgccggtcgtcacacgtcc ggaggccgagccgtcgcgtacctaggatgccgcgtggaagccgaagccgcacctcccgca tggcccctccggccagccaggccccctcagatgagagctgcacccaggccagtaccagtc gttcagccaccagcagcggcacccccatctgtagttggctcttctgctgctgcgccccgg cagccaggtctgatggcccagatggcaaccactgcagctggcgtggctgtgggctctgct gtgggtcacgccattactggggctttcaggggaggaagtaatgctgagcctgcgaggcct gacatcacttaccaggagcctcagggaacccagccggcacagcagcagcagccttgcttc tatgagatcagtttctggagtgtgcccagaaccagggtgacatcaagctctgtgagggtt tcaatgaggtgctga >gi568815593r:69264991_69465737|GENSCAN_predicted_peptide_4|135_aa MAVKHNVMILQFDKQQQLGRFFKVVDELDNQMREGGVIVDYHGCDFFPERWFHIVFVLRT DTNVLYERLETRGYNEKKLTDNIQCEIFQVLYEEATASYKEEIVHQLPSNKPEELENNVD QILKWIEQWIKDHNS >gi568815593r:69264991_69465737|GENSCAN_predicted_CDS_4|408_bp atggcagttaaacataatgtcatgattttacagtttgacaaacagcagcagctgggaagg ttttttaaggtagttgatgagttagataaccaaatgagagaaggtggagttattgttgat taccatggttgtgatttcttccctgaacgctggtttcatatagtttttgtgctgagaaca gataccaatgtattgtacgaaagacttgaaacaaggggttataatgagaagaaactaaca gacaatattcagtgtgagatttttcaagttctttatgaagaagccacagcatcctacaag gaagaaatcgtgcatcagctgcccagtaataaaccagaagagctagaaaataatgtagat cagatcttgaaatggattgagcagtggatcaaagatcataactcttga >gi568815593r:69264991_69465737|GENSCAN_predicted_peptide_5|264_aa MESGKTASPKSMPKDAQMMAQILKDMGITEYEPRVINQMLEFAFRYVTTILDDAKIYSSH AKKATVDADDVRLAIQCRADQSFTSPPPRDFLLDIARQRNQTPLPLIKPYSGPRLPPDRY CLTAPNYRLKSLQKKASTSAGRITVPRLSVGSVTSRPSTPTLGTPTPQTMSVSTKVGTPM SLTGQRFTVQMPTSQSPAVKASIPATSAVQNVLINPSLIGSKNILITTNMMSSQNTANES SNALKRKREDDDDDDDDDDDYDNL >gi568815593r:69264991_69465737|GENSCAN_predicted_CDS_5|795_bp atggagtctggcaagacggcttctcccaagagcatgccgaaagatgcacagatgatggca caaatcctgaaggatatggggattacagaatatgagccaagagttataaatcagatgttg gagtttgccttccgatatgtgaccacaattctagatgatgcaaaaatttattcaagccat gctaagaaagctactgttgatgcagatgatgtgcgattggcaatccagtgccgcgctgat cagtcttttacctctcctcccccaagagattttttattagatattgcaaggcaaagaaat caaacccctttgccattgatcaagccatattcaggtcctaggttgccacctgatagatac tgcttaacagctccaaactataggctgaaatctttacagaaaaaggcatcaacttctgcg ggaagaataacagtcccgcggttaagtgttggttcagttactagcagaccaagtactccc acactaggcacaccaaccccacagaccatgtctgtttcaactaaagtagggactcccatg tccctcacaggtcaaaggtttacagtacagatgcctacttctcagtctccagctgtaaaa gcttcaattcctgcaacctcagcagttcagaatgttctgattaatccatcattaatcggg tccaaaaacattcttattaccactaatatgatgtcatcacaaaatactgccaatgaatca tcaaatgcattgaaaagaaaacgtgaagatgatgatgatgacgatgatgatgatgatgac tatgataatctgtaa >gi568815593r:69264991_69465737|GENSCAN_predicted_peptide_6|723_aa MPRALCAPSLPQPSPSRPRALTGRSRKGRAAAKAHGPKGPEAHRGAPSLPRRPSRGPLVT WLSMTHFRRKVGGVVPGRLRSSPKRGCSSGKVTDWVDPSFDDFLECSGVSTITATSLGVN NSSHRRKNGPSTLESSRFPARKRGNLSSLEQIYGLENSKEYLSENEPWVDKYKPETQHEL AVHKKKIEEVETWLKAQVLERQPKQGGSILLITGPPGCGKTTTLKILSKEHGIQVQEWIN PVLPDFQKDDFKGMFNTESSFHMFPYQSQIAVFKEFLLRATKYNKLQMLGDDLRTDKKII LVEDLPNQFYRDSHTLHEVLRKYVRIGRCPLIFIISDSLSGDNNQRLLFPKEIQEECSIS NISFNPVAPTIMMKFLNRIVTIEANKNGGKITVPDKTSLELLCQGCSGDIRSAINSLQFS SSKGENNLRPRKKGMSLKSDAVLSKSKRRKKPDRVFENQEVQAIGGKDVSLFLFRALGKI LYCKKYERDTLLVEPEEVVEMSHMPGDLFNLYLHQNYIDFFMEIDDIVRASEFLSFADIL SGDWNTRSLLREYSTSIATRGVMHSNKARGYAHCQGGGSSFRPLHKPQWFLINKKVLIGH LYIFFGKLFTTFKKSVSAQISFIQDIGRLPLKRHFGRLKMEALTDREHGMIDPDSGDEAQ LNGGHSAEESLGEPTQATVPETWSLPLSQNSASELPASQPQPFSAQGDMEENIIIEDYES DGT >gi568815593r:69264991_69465737|GENSCAN_predicted_CDS_6|2172_bp atgcccagagcactctgcgcccccagcctgccccagcccagtccctcccggccgcgcgcc ctgaccgggaggagccggaaggggcgggcggcagcaaaagcccacggccccaaaggcccc gaagcccaccgcggcgcccctagcctgccccggcggcccagccgcggcccactggttacc tggctttcgatgacacatttccgtcgcaaagttggaggggtagtccctggtcgcctccgc tcttcgcctaaaaggggatgcagctccgggaaagtaacagactgggttgacccatcattt gatgattttctagagtgtagtggcgtctctactattactgccacatcattaggtgtgaat aactcaagtcatagaagaaaaaatgggccttctacattagaaagcagcagatttccagcg agaaaaagaggaaatctatcttccttagaacagatttatggtttagaaaattcaaaagaa tatctgtctgaaaatgaaccatgggtggataaatataaaccagaaactcagcatgaactt gctgtgcataaaaagaaaattgaagaagtcgaaacctggttaaaagctcaagttttagaa aggcaaccaaaacagggtggatctattttattaataacaggtcctcctggatgtggaaag acaacgaccttaaaaatactatcaaaggagcatggtattcaagtacaagagtggattaat ccagttttaccagacttccaaaaagatgatttcaaggggatgtttaatactgaatcaagc ttccatatgtttccctatcagtctcagatagcagttttcaaagagtttctactaagagcg acaaagtataacaagttacaaatgcttggagatgatctgagaactgataagaagataatt ctggttgaagatttacctaaccagttttatcgggattctcatactttacatgaagttcta aggaagtatgtgaggattggtcgatgtcctcttatatttataatctcggacagtctcagt ggagataataatcaaaggttattgtttcccaaagaaattcaggaagagtgttctatctca aatattagtttcaaccctgtggcaccaacaattatgatgaaatttcttaatcgaatagtg actatagaagctaacaagaatggaggaaaaattactgtccctgacaaaacttctctagag ttgctctgtcagggatgttctggtgatatcagaagtgcaataaacagcctccagttttct tcttcaaaaggagaaaacaacttacggccaaggaaaaaaggaatgtctttaaaatcagat gctgtgctgtcaaaatcaaaacgaagaaaaaaacctgatagggtttttgaaaatcaagag gtccaagctattggtggcaaagatgtttctctgtttctcttcagagctttggggaaaatt ctatattgtaaaaaatatgaacgggatacattacttgttgaacctgaggaggtagtagaa atgtcacacatgcctggagacttatttaatttatatcttcaccaaaactacatagatttc ttcatggaaattgatgatattgtgagagccagtgaatttctgagttttgcagatatcctc agtggtgactggaatacacgctctttactcagggaatatagcacatctatagctacgaga ggtgtgatgcattccaacaaagcccgaggatatgctcattgccaaggaggaggatcaagt tttcgacccttgcacaaacctcagtggtttctaataaataaaaaggtgcttattggccat ttgtatattttctttggaaaattgtttacaacttttaaaaaatctgtttcagctcagatt tcttttatccaagatattggaaggctccctctgaagcgacactttggaagattgaaaatg gaagccctgactgacagggaacatggaatgatagaccctgacagcggagatgaagcccag cttaatggaggacattctgcagaggaatctctgggtgaacccactcaagccactgtgccg gaaacctggtctcttcctttgagtcagaatagtgccagtgaactgcctgctagccagccc cagcccttttcagcccaaggagacatggaagaaaacataataatagaagactacgagagt gatgggacatag >gi568815593r:69264991_69465737|GENSCAN_predicted_peptide_7|95_aa MSLRVRSGPAGRGSVCLCPAQAPDPYSTLGHSSWKAAAMAGEMAAPAKCHRNHAWSPLHK QNHRFTWLKSFAGPKKRVDFGYFCASEGSGKALRF >gi568815593r:69264991_69465737|GENSCAN_predicted_CDS_7|288_bp atgtcactgagagttcgttctggtcccgcaggtcgcggcagcgtgtgtctttgtcctgct caggccccggacccttactccactcttggacactcatcttggaaagccgcggcaatggca ggagaaatggctgccccagcaaagtgtcacaggaatcatgcctggtcacctttacacaaa cagaaccaccggtttacctggcttaaaagttttgctggacccaagaaacgggtagatttc ggatacttctgcgcttccgaggggtcgggaaaggctctgcggttctaa >gi568815593r:69264991_69465737|GENSCAN_predicted_peptide_8|513_aa MSNDGRSRNRDRRYDEVPSDLPYQDTTIRTHPTLHDSERAVSADPLPPPPLPLQPPFGPD FYSSDTEEPAIAPDLKPVRRFVPDSWKNFFRGKKKDPEWDKPVSDIRYISDGVECSPPAS PARPNHRSPLNSCKDPYGGSEGTFSSRKEADAVFPRDPYGSLDRHTQTVRTYSEKVEEYN LRYSYMKSWAGLLRILGVVELLLGAGVFACVTAYIHKDSEWYNLFGYSQPYGMGGVGGLG SMYGGYYYTGPKTPFVLVVAGLAWITTIIILVLGMSMYYRTILLDSNWWPLTEFGINVAL FILYMAAAIVYVNDTNRGGLCYYPLFNTPVNAVFCRVEGGQIAAMIFLFVTMIVYLISAL VCLKLWRHEAARRHREYMEQQEINEPSLSSKRKMCEMATSGDRQRDSEVNFKELRTAKMK PELLSGHIPPGHIPKPIVMPDYVAKYPVIQTDDERERYKAVFQDQFSEYKELSAEVQAVL RKFDELDAVMSRLPHHSESRQVLNDGKMLIHTX >gi568815593r:69264991_69465737|GENSCAN_predicted_CDS_8|1539_bp atgtcaaatgatggaagatccaggaatcgggacaggcgctacgatgaggtcccaagcgac ctgccctatcaagataccaccataagaacccacccaactcttcatgacagtgagcgggca gtgagcgctgatcccttgccaccaccccctctcccattacagccaccattcggcccagac ttctactcaagtgacacagaagaaccagctatagcgccagatctcaaaccagtaaggcgc tttgtccctgactcctggaagaactttttcagagggaagaaaaaggaccccgaatgggat aagccggtgtctgatatcaggtacatctccgatggagtggagtgttcaccaccagcctct ccagcaagaccaaaccaccgttcgcccctcaactcctgcaaagatccctacggagggtca gaaggaacctttagttcccggaaagaggctgacgcagtgtttccccgggatccctatgga tctctagaccgacacacacaaacagttcgaacatacagtgagaaggtggaggagtataac ctgagatactcctacatgaagtcgtgggcaggcctgctgagaatactgggtgtggtggag ctgcttttgggggccggtgtctttgcttgtgtcacagcttacattcacaaggacagtgag tggtacaacttgtttggatattcacaaccgtatggcatgggaggcgttggtggattgggc agtatgtatgggggctattactacactggccctaagaccccttttgtactcgtggttgct ggattagcttggatcaccaccattattattctggttcttggcatgtccatgtattaccgg accattcttctggactctaattggtggcccctaactgaatttggaattaacgttgccttg tttattttgtatatggccgcagccatagtctatgtgaatgataccaaccgaggtggcctc tgctactatccgttatttaatacaccagtgaatgcagtgttctgccgggtagaaggagga cagatagctgcaatgatcttcctgtttgtcaccatgatagtttatctcattagtgctttg gtttgcctaaagttatggaggcatgaggcagctcggagacatagagaatatatggaacaa caggagataaatgagccatcattgtcatcgaaaaggaaaatgtgtgaaatggccaccagt ggtgacagacaaagagactcagaagttaatttcaaggaactgagaacagcaaaaatgaaa cctgaactactgagtggacacatccccccaggccacattcctaaacctatcgtgatgccc gactatgtggcaaaataccctgtgattcagacagatgatgagcgagaacgctataaagct gtgttccaagaccagttttcagagtacaaagagctgtctgcagaagttcaggctgtcctg aggaagtttgatgagctggatgcagtgatgagcagattgccacatcattcggaaagccga caggtcctcaacgacgggaagatgctcattcacacagnn